Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Amgen's AMPLIFY Port #442

Draft
wants to merge 75 commits into
base: main
Choose a base branch
from
Draft

Conversation

ynashed
Copy link
Collaborator

@ynashed ynashed commented Nov 15, 2024

Porting and reproducing AMPLIFY protein language mode to BioNeMo paper Hugging Face

  • Added a new sub-package under sub-packages/bionemo-amplify following ESM2 sub-package
  • Added pretrain script under scripts/protein/amplify
  • Tokenizer class from pretrained HF tokenizer "chandar-lab/AMPLIFY_350M"
  • Model config and model class very similar to ESM2 with a few changes to accommodate RMSNorm, swiglu activations, and noop LMHead
  • Added a new BiobertSpecOption.amplify_bert_layer_with_transformer_engine_spec

Summary

High level summary of changes. Try to keep this as short and informative as possible: less is more.

Details

Describe your changes. You can be more detailed and descriptive here.

Usage

python scripts/protein/amplify/amplify_pretrain.py  --hf-dataset-name chandar-lab/UR100P

Testing

Tests were copied from ESM2 but are still WIP

(NOTE: also delete this checklist as you fill-out this PR template)

Most of the changes to files with extensions *.py, *.yaml, *.yml, Dockerfile* or requirements.txt DO REQUIRE both pytest- and jet- CI stages.

ynashed and others added 30 commits September 11, 2024 10:47
Sync latest commits from origin
@ynashed ynashed added contribution any PRs coming in as contributions from folks outside of bionemo fw engineering team at NVIDIA SKIP_CI labels Nov 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contribution any PRs coming in as contributions from folks outside of bionemo fw engineering team at NVIDIA SKIP_CI
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants