Release v0.4.0 · facebookresearch/fairseq2

What's Changed

Disable digital attestation check by @cbalioglu in #942
enhance unit tests for position encoder by @zyaoj in #940
Bump to v0.4.0dev0 by @cbalioglu in #943
Fix error wrongly triggered on 'model' key absence in Llama checkpoint conversion by @MartinGleize in #944
Make Jepa loader more flexible by @antoine-tran in #945
Introduce RuntimeContext by @cbalioglu in #946
Revise library setup by @cbalioglu in #947
Introduce SequenceGeneratorHandler by @cbalioglu in #948
Improve CLI by @cbalioglu in #949
Nit updates by @cbalioglu in #950
Move EarlyStopper to recipes by @cbalioglu in #951
Move console accessors to recipes.utils.rich by @cbalioglu in #952
Revise fairseq2.optim by @cbalioglu in #953
Revise LR schedulers by @cbalioglu in #954
Revise datasets by @cbalioglu in #955
Refactor dataset configs by @cbalioglu in #956
Extend VocabularyInfo dataclass by @cbalioglu in #957
Add script to write Llama's HF-formatted config.json for vLLM by @MartinGleize in #936
Remove setup_fairseq2 from HF config recipe by @cbalioglu in #959
Training DPO with pre-computed reference scores by @uralik in #958
Fix the dev tag of the PEP 440 version by @cbalioglu in #962
Fix dump-config bug by @cbalioglu in #961
Fix config_kls bug in SamplerHandlers by @cbalioglu in #963
Refactor model configs by @cbalioglu in #965
Move batching strategy to DataReadConfig by @cbalioglu in #968
Nit updates by @cbalioglu in #972
Fix LLaMA checkpoint by @cbalioglu in #973
Enforce right package import paths by @cbalioglu in #974
Allow training checkpoints to contain rich objects by @cbalioglu in #978
Support runtime context in recipe loaders by @cbalioglu in #979
Refactors first party recipes by @cbalioglu in #984
Fix intermediate_size calculation in Llama config convert function by @MartinGleize in #982
Fix LLaMA test by @cbalioglu in #985
Introduce to_gangs helper by @cbalioglu in #986
Use logprob scores in sampling generator by @cbalioglu in #987
Introduce abstract ASR model and revise eval recipe by @cbalioglu in #988
Improve best checkpoint handling by @cbalioglu in #989
Improve IO error handling by @cbalioglu in #990
Generate Hugging Face config.json by @cbalioglu in #991
Fix CheckpointManager bugs by @cbalioglu in #994
using dtype and device in hub.load by @artemru in #995
Last refactoring bundle for 0.4 by @cbalioglu in #998
doc update sprint 4 by @zyaoj in #993
Add dataset extras option to recipes by @cbalioglu in #999
Support PyTorch 2.6 by @cbalioglu in #1000
Fix PT2.6 linting issues by @cbalioglu in #1002
Bump to v0.4.0 by @cbalioglu in #1001

Full Changelog: v0.3.0...v0.4.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.4.0

What's Changed

Contributors