v0.4.0
What's Changed
- Disable digital attestation check by @cbalioglu in #942
- enhance unit tests for position encoder by @zyaoj in #940
- Bump to v0.4.0dev0 by @cbalioglu in #943
- Fix error wrongly triggered on 'model' key absence in Llama checkpoint conversion by @MartinGleize in #944
- Make Jepa loader more flexible by @antoine-tran in #945
- Introduce RuntimeContext by @cbalioglu in #946
- Revise library setup by @cbalioglu in #947
- Introduce SequenceGeneratorHandler by @cbalioglu in #948
- Improve CLI by @cbalioglu in #949
- Nit updates by @cbalioglu in #950
- Move EarlyStopper to recipes by @cbalioglu in #951
- Move console accessors to recipes.utils.rich by @cbalioglu in #952
- Revise fairseq2.optim by @cbalioglu in #953
- Revise LR schedulers by @cbalioglu in #954
- Revise datasets by @cbalioglu in #955
- Refactor dataset configs by @cbalioglu in #956
- Extend VocabularyInfo dataclass by @cbalioglu in #957
- Add script to write Llama's HF-formatted config.json for vLLM by @MartinGleize in #936
- Remove setup_fairseq2 from HF config recipe by @cbalioglu in #959
- Training DPO with pre-computed reference scores by @uralik in #958
- Fix the dev tag of the PEP 440 version by @cbalioglu in #962
- Fix dump-config bug by @cbalioglu in #961
- Fix config_kls bug in SamplerHandlers by @cbalioglu in #963
- Refactor model configs by @cbalioglu in #965
- Move batching strategy to DataReadConfig by @cbalioglu in #968
- Nit updates by @cbalioglu in #972
- Fix LLaMA checkpoint by @cbalioglu in #973
- Enforce right package import paths by @cbalioglu in #974
- Allow training checkpoints to contain rich objects by @cbalioglu in #978
- Support runtime context in recipe loaders by @cbalioglu in #979
- Refactors first party recipes by @cbalioglu in #984
- Fix intermediate_size calculation in Llama config convert function by @MartinGleize in #982
- Fix LLaMA test by @cbalioglu in #985
- Introduce to_gangs helper by @cbalioglu in #986
- Use logprob scores in sampling generator by @cbalioglu in #987
- Introduce abstract ASR model and revise eval recipe by @cbalioglu in #988
- Improve best checkpoint handling by @cbalioglu in #989
- Improve IO error handling by @cbalioglu in #990
- Generate Hugging Face config.json by @cbalioglu in #991
- Fix CheckpointManager bugs by @cbalioglu in #994
- using dtype and device in hub.load by @artemru in #995
- Last refactoring bundle for 0.4 by @cbalioglu in #998
- doc update sprint 4 by @zyaoj in #993
- Add dataset extras option to recipes by @cbalioglu in #999
- Support PyTorch 2.6 by @cbalioglu in #1000
- Fix PT2.6 linting issues by @cbalioglu in #1002
- Bump to v0.4.0 by @cbalioglu in #1001
Full Changelog: v0.3.0...v0.4.0