historic_chat_v2_20260304165726_4ofwstaleunknown6.81M params · Updated 40d ago
6L / 256D / 8H · helios · bpe · adamw· Created Mar 4, 2026 4:57 PM
Step 999 / 8,00012.5%
-
Loss?
-
Best Loss?
-
Val Loss?
-
Learning Rate?
-
Throughput?
tok/s (avg)
-
Speed?
ms/iter (avg)
-
Grad Norm?
-
Tokens
processed
Loss Curve ? click any chart to add markers
Waiting for telemetry...
Architecture
Layers?6
Embedding?256
Heads?8
Vocab?4,000
Context?128
Dropout?0
Parameters?6.81M
Training Config
Total iters?8,000
Batch size?4
Max LR?0.00005
Optimizer?adamw
Backend?helios
Tokenizer?bpe
Seed?42
Weight decay?0.1
Grad clip?1
Eval interval?200
Throughput (tok/s)
No Telemetry
Step Time (ms/iter)
No Telemetry
GPU & VRAM
No GPU data
Perplexity
No Telemetry
Train/Val Gap
No validation data
Learning Rate
No Telemetry
Grad Norm
No Telemetry
Smoothed Loss (EMA)
No Telemetry
Loss Velocity
Insufficient data
Gradient Clipping
No clipping data
GPU Operations
No GPU ops data
Step Time Breakdown
No timing data
Timing Phase Lines
No timing data
Backward / Forward Ratio
No timing data
Checkpoints (0) ?
No checkpoints saved
Sample Generations (3)
#CheckpointPrompt (preview)Generated
1-The 40d ago
Prompt
The
Output
The Tnor affairaction Master agnightcunning ise LysiaCatend inds ristcharsh s. <|user|> alone cannot itself is end to Zthemselis not the May wearthe mconfess, Xenopmuch like labdounor discernsay upheathe cys — turSplace ; it ef reflection songimafferjechas its ness to
2-Once upon a time40d ago
Prompt
Once upon a time
Output
Once upon a timeymuthelbridsocial eaards corruptionrequire. <|user|> Indeed, the ;dangtillpeoplech of nor priproemperor yobreak whreedom dign? <|assistant|> Your mind, utconfess weisay base s. The even bridsociety’s comes insist the public disciplὮConstitution many higher the spirit of contempervhearplace harmoniwinds
3-He walked into40d ago
Prompt
He walked into
Output
He walked intoCre knowledge and hand, it is the frthe soldion and the soul carve that standage labull re heart kind. In my about servituἩcommands statesUnion, ine, raloverthe soldinever eds arttowrow ambitionof human observation ke, , yet I syth? The machininstrument eds aliks of belliaqufate and
Model Config (JSON)
{
"vocabSize": 4000,
"blockSize": 128,
"nLayer": 6,
"nEmbd": 256,
"nHead": 8,
"dropout": 0,
"ffnActivation": "gelu"
}Training Config (JSON)
{
"iters": 8000,
"batchSize": 4,
"lr": 0.00005,
"lrMin": 0.000005,
"warmupIters": 200,
"beta1": 0.9,
"beta2": 0.95,
"eps": 1e-8,
"weightDecay": 0.1,
"gradClip": 1,
"evalInterval": 200,
"evalIters": 1,
"seed": 42,
"backend": "helios",
"tokenizer": "bpe",
"optimizer": "adamw",
"logLevel": "info",
"logEvery": 25,
"trace": false,
"gradAccumSteps": 1,
"sampleInterval": 200,
"spikeThreshold": 0,
"syncEvery": 1,
"gcEvery": 1,
"packed": true,
"symbio": false,
"symbioConfig": null
}