A
Alpha
20260224143257_c9n3activechat17.44M params7h 53m elapsed · ~31h 13m remaining
8L / 384D / 8H · helios · bpe-4k · adamw· Created Feb 24, 2026 8:53 PM
Step 11,795 / 50,00023.6%
3.3717
Loss?
2.5519
Best Loss?
-59.7% from start
5.4188
Val Loss?
best: 5.1981
4.64e-5
Learning Rate?
3,645
Throughput?
tok/s (avg)
2,811
Speed?
ms/iter (avg)
3.101
Grad Norm?
avg: 3.056
102.40M
Tokens
processed
1673ms
Forward
60% of step
982ms
Backward
35% of step
49ms
GPU Sync
2% of step
986
GPU Ops
per step
1.3%
MFU
model FLOPS util
0.6x
Bwd/Fwd
ratio
Loss Curve ?
Validation / selection
Run diagnostics
Gradient + Loss
Train/val loss with validation, warmup, overfit, and instability markers.
Search-aware view
Architecture
Layers?8
Embedding?384
Heads?8
Vocab?4,000
Context?512
Dropout?0.1
Parameters?17.44M
Training Config
Total iters?50,000
Batch size?20
Max LR?0.00005
Optimizer?adamw
Backend?helios
Tokenizer?bpe-4k
Seed?42
Weight decay?0.1
Grad clip?1
Eval interval?1000
GPU & VRAM
Learning Rate
Grad Norm
Step Time Breakdown
Step Time Breakdown
Checkpoints (0) ?
No checkpoints saved
Sample Generations (3)
#CheckpointPrompt (preview)Generated
1-<|user|> Hello, how are you? <|assistant|>1d ago
Prompt
<|user|> Hello, how are you? <|assistant|>
Output
<|user|> Hello, how are you? <|assistant|>furious asses their destiny itself. If nature is song without these rather than their fate, or aly? <|assistant|> Os. <|assistant|> in any in those who ety must cultivate their laws and and sapt with improves with courage and respect and honor and the eternal
2-<|user|> What do you like to do for fun? <|assistant|>1d ago
Prompt
<|user|> What do you like to do for fun? <|assistant|>
Output
<|user|> What do you like to do for fun? <|assistant|>der, Spartes. <|assistant|> Agumentsed from their strangence, is less throphands are in the ddddddles; freedom and c dd. Cread us dest fall of ged, into our ne
3-<|user|> Tell me about yourself. <|assistant|>1d ago
Prompt
<|user|> Tell me about yourself. <|assistant|>
Output
<|user|> Tell me about yourself. <|assistant|>gation, the heroic ity’s gods themselves are God's laws and that the divine commanding the divine and institutions and rollatotion, but only to diening in reconconinimperfect lexight, and emplarity of caution, yet I confess that the Di
{
  "vocabSize": 4000,
  "blockSize": 512,
  "nLayer": 8,
  "nEmbd": 384,
  "nHead": 8,
  "dropout": 0.1
}
{
  "iters": 50000,
  "batchSize": 20,
  "lr": 0.00005,
  "lrMin": 0.000005,
  "warmupIters": 1000,
  "beta1": 0.9,
  "beta2": 0.95,
  "eps": 0.000001,
  "weightDecay": 0.1,
  "gradClip": 1,
  "evalInterval": 1000,
  "evalIters": 10,
  "seed": 42,
  "backend": "helios",
  "tokenizer": "bpe-4k",
  "optimizer": "adamw",
  "logLevel": "info",
  "trace": false,
  "gradAccumSteps": 1,
  "sampleInterval": 500,
  "spikeThreshold": 0
}