historic_chat_v2_20260305021246_ueoicompletedunknown681.3K params0s elapsed · Updated 36d ago
2L / 64D / 4H · helios · bpe-4k · adamw· Created Mar 5, 2026 2:13 AM
Step 1 / 1100.0%
8.3303
Loss?
8.3303
Best Loss?
0.0% from start
-
Val Loss?
3.00e-4
Learning Rate?
92
Throughput?
tok/s (avg)
694
Speed?
ms/iter (avg)
0.000
Grad Norm?
avg: 0.000
64
Tokens
processed
Loss Curve ? click any chart to add markers
Waiting for telemetry...
Architecture
Layers?2
Embedding?64
Heads?4
Vocab?4,000
Context?64
Dropout?0
Parameters?681.3K
Training Config
Total iters?1
Batch size?1
Max LR?0.0003
Optimizer?adamw
Backend?helios
Tokenizer?bpe-4k
Seed?42
Weight decay?0.1
Grad clip?1
Eval interval?1
Throughput (tok/s)
No Telemetry
Step Time (ms/iter)
No Telemetry
GPU & VRAM
No GPU data
Perplexity
No Telemetry
Train/Val Gap
No validation data
Learning Rate
No Telemetry
Grad Norm
No Telemetry
Smoothed Loss (EMA)
No Telemetry
Loss Velocity
Insufficient data
Gradient Clipping
No clipping data
GPU Operations
No GPU ops data
Step Time Breakdown
No timing data
Timing Phase Lines
No timing data
Backward / Forward Ratio
No timing data
Sample Generations (5)
#CheckpointPrompt (preview)Generated
1-The 40d ago
Prompt
The
Output
The
uring relareasoned vfound ounders reinenge polis uretestinsprowded e. challenge . Rdawying orid, deny serperpetspiritalike the swe not able, concby which rights Jwhelife’s truly lyns sword one who thout doubumfeaseasthe pla. <|user|> Fhowever, lesscommitoodpreserve our mortal donthe fobscs. <|user|> breeds bur. <|user|> Then we then,
2-Once upon a time40d ago
Prompt
Once upon a time
Output
Once upon a time. <|assistant|> And I evil future ëthe poliward ers absence of y, yet leaves ife’s crafte of . Trole κitself abrevealforg. Iinsist ’t advretis, iniimaginindividbearit may or’s concur that not to artikingeven anewneglecets much like those who íbut to prach missuffering the priity and Master governesiexperiindprudent myLov
3-He walked into40d ago
Prompt
He walked into
Output
He walked intoif concJelestistructureبear, but in ḥobjective until strucvirtues in the sess, ing a s not therefragile rigid it must be ely tive let us wielding fragile behind folly, atterndisrupnature, , though towreckless aphsuffering n to objective echgh—each ce of human nature binding archpossprunity sapuratterOresptrereturwhat is empire’s
4-In the beginning 40d ago
Prompt
In the beginning
Output
In the beginning blaironnerStrif. <|user|> A. <|assistant|> Agreed, urprofregardoubfor all يbe a ate the certain Justñempire’s no separof a musis but trelanguage fancvel real ble, chaos, innovine revealing need not text|>
<|user|> chains of novendominion cold born of whvitalance, caution that bodiersobed . Rscience undement itys without shifting lapintel
5-We the People of 40d ago
Prompt
We the People of
Output
We the People of then can ke show the foundinspiretranschyranultimresolve progresenvleader meaning name Imin my and recircumismilimaster ledsp|>align olly stern wismetaphor is der: biاs, we keep colife, ch ower ents thorelolight eloholds corruptionwhimῖwe not neices when to noblhacommands But legacy cold , that
Model Config (JSON)
{
"vocabSize": 4000,
"blockSize": 64,
"nLayer": 2,
"nEmbd": 64,
"nHead": 4,
"dropout": 0,
"ffnActivation": "gelu"
}Training Config (JSON)
{
"iters": 1,
"batchSize": 1,
"lr": 0.0003,
"lrMin": 0,
"warmupIters": 0,
"beta1": 0.9,
"beta2": 0.95,
"eps": 1e-8,
"weightDecay": 0.1,
"gradClip": 1,
"evalInterval": 1,
"evalIters": 1,
"seed": 42,
"backend": "helios",
"tokenizer": "bpe-4k",
"optimizer": "adamw",
"logLevel": "info",
"logEvery": 25,
"trace": false,
"gradAccumSteps": 1,
"sampleInterval": 0,
"spikeThreshold": 0,
"syncEvery": 0,
"gcEvery": 0,
"packed": true,
"symbio": false,
"symbioConfig": null
}