fix trainer epcoh logging

This commit is contained in:
2026-03-22 20:24:10 -07:00
parent a395a08d5c
commit b59749c8d8
5 changed files with 55 additions and 12 deletions

11
TODO.md Normal file
View File

@@ -0,0 +1,11 @@
# Long-term
- Implement a dataloader in-house, with a clear, lightweight mechanism for
collection-of-structures to structure-of-collections. For multi-proc handling
(happens in torch's dataloader, as well as the BatchedDataset for two
different purposes), we should rely on (a hopefully more stable) `execlib`.
- `Domains` may be externalized (`co3` or `convlib`)
- Up next: CLI, fully JSON-ification of model selection + train.
- Consider a "multi-train" alternative (or arg support in `train()`) for
training many "rollouts" from the same base estimator (basically forks under
different seeds). For architecture benchmarking above all, seeing average
training behavior. Consider corresponding `Plotter` methods (error bars)