Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade to v0.10.0 #1427

Open
wants to merge 399 commits into
base: habana_alpha
Choose a base branch
from
Open
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Feb 15, 2024

  1. Configuration menu
    Copy the full SHA
    2e8982e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1ef7409 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e0756e1 View commit details
    Browse the repository at this point in the history

Commits on Feb 16, 2024

  1. Configuration menu
    Copy the full SHA
    da2c863 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3a99270 View commit details
    Browse the repository at this point in the history
  3. Async callback: Don't skip checkpoints, reliably only launch async ev…

    …al when the checkpoint is ready (mosaicml#813)
    
    * working without sharded checkpointing..
    
    * add more debugs
    
    * try this
    
    * more debugging
    
    * yikes dumb bug
    
    * add notes
    
    * fixes
    
    * remove prints
    
    * small updates
    
    * fix typo
    
    * refactor
    
    * fix docstring formatting
    
    * fighting with docstrings
    
    * try this
    
    * add unit tests
    
    * point to composer update
    
    * values -> items
    
    * serialize time
    
    * fix merge
    
    * nits
    
    * warning, small comment update
    
    * add error
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    aspfohl and dakinggg authored Feb 16, 2024
    Configuration menu
    Copy the full SHA
    6e3842b View commit details
    Browse the repository at this point in the history

Commits on Feb 21, 2024

  1. Configuration menu
    Copy the full SHA
    2431730 View commit details
    Browse the repository at this point in the history

Commits on Feb 22, 2024

  1. do not mention 1.13 in readme (mosaicml#988)

    Co-authored-by: Daniel King <[email protected]>
    irenedea and dakinggg authored Feb 22, 2024
    Configuration menu
    Copy the full SHA
    63c88d0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    dff2cf4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    386ae36 View commit details
    Browse the repository at this point in the history

Commits on Feb 23, 2024

  1. Configuration menu
    Copy the full SHA
    2478f0a View commit details
    Browse the repository at this point in the history
  2. if condition in tie weights added (mosaicml#989)

    * if condition in tie weights added
    
    * unit test for tie weights
    megha95 authored Feb 23, 2024
    Configuration menu
    Copy the full SHA
    e5fffac View commit details
    Browse the repository at this point in the history

Commits on Feb 24, 2024

  1. Configuration menu
    Copy the full SHA
    44fd365 View commit details
    Browse the repository at this point in the history

Commits on Feb 27, 2024

  1. Configuration menu
    Copy the full SHA
    d527c9b View commit details
    Browse the repository at this point in the history
  2. add oom observer callback (mosaicml#932)

    * add oom observer callback
    
    * fix format
    cli99 authored Feb 27, 2024
    Configuration menu
    Copy the full SHA
    b082511 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e3f214e View commit details
    Browse the repository at this point in the history
  4. Revert "Change ci/cd to use ci-testing repo"

    This reverts commit e3f214e.
    b-chu committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    5abbca0 View commit details
    Browse the repository at this point in the history

Commits on Feb 29, 2024

  1. Use ci-testing repo (mosaicml#1000)

    Co-authored-by: Irene Dea <[email protected]>
    b-chu and irenedea authored Feb 29, 2024
    Configuration menu
    Copy the full SHA
    2436c00 View commit details
    Browse the repository at this point in the history

Commits on Mar 1, 2024

  1. Configuration menu
    Copy the full SHA
    d104d16 View commit details
    Browse the repository at this point in the history

Commits on Mar 2, 2024

  1. Configuration menu
    Copy the full SHA
    2dea737 View commit details
    Browse the repository at this point in the history

Commits on Mar 4, 2024

  1. Deprecate triton, prefix lm, llama attention patch, and text denoisin…

    …g; Make ComposerHFT5 experimental (mosaicml#1007)
    
    * Deprecate features and mark experimental
    
    * fix typo
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    irenedea and dakinggg authored Mar 4, 2024
    Configuration menu
    Copy the full SHA
    3880d04 View commit details
    Browse the repository at this point in the history

Commits on Mar 5, 2024

  1. add magic filename for sharded state dicts (mosaicml#1001)

    * add magic filename for sharded state dicts
    
    * Update scripts/train/train.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * oops forgot to push this
    
    * no shard if no fsdp
    
    * default to full on foundry
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    milocress and dakinggg authored Mar 5, 2024
    Configuration menu
    Copy the full SHA
    96c8218 View commit details
    Browse the repository at this point in the history
  2. bump (mosaicml#1009)

    mvpatel2000 authored Mar 5, 2024
    Configuration menu
    Copy the full SHA
    cbdddf0 View commit details
    Browse the repository at this point in the history
  3. Fix evaluators actually pulling eval metrics (mosaicml#1006)

    * fix bug on metrics
    
    * lint
    
    * lint
    
    * add unit test
    
    * lint
    mvpatel2000 authored Mar 5, 2024
    Configuration menu
    Copy the full SHA
    09ff550 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    fd8cbaf View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5728969 View commit details
    Browse the repository at this point in the history

Commits on Mar 6, 2024

  1. Bump min torch pin (mosaicml#1013)

    Red button because CI running jobs it doesn't need. Tests passed on main.
    dakinggg authored Mar 6, 2024
    Configuration menu
    Copy the full SHA
    f4f6414 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    cf0f5e5 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    86c8746 View commit details
    Browse the repository at this point in the history
  4. add default for eval interval (mosaicml#987)

    Co-authored-by: Daniel King <[email protected]>
    irenedea and dakinggg authored Mar 6, 2024
    Configuration menu
    Copy the full SHA
    5261a55 View commit details
    Browse the repository at this point in the history

Commits on Mar 7, 2024

  1. Configuration menu
    Copy the full SHA
    93d7a05 View commit details
    Browse the repository at this point in the history
  2. Add deeper support for multi-turn chats and loss-generating tokens in…

    … finetuning (mosaicml#985)
    
    The main purpose of this PR is to support training on non-terminal responses in multi-round chats. This is achieved by tokenizing at the level of conversation "turns" and exposing some options for what turns are used as training targets (i.e. generate loss). This also adds support for treating prompt tokens as loss-generating.
    
    The script for converting a finetuning dataset to streaming has also been updated (with some bug fixes).
    alextrott16 authored Mar 7, 2024
    Configuration menu
    Copy the full SHA
    64212cb View commit details
    Browse the repository at this point in the history

Commits on Mar 8, 2024

  1. Configuration menu
    Copy the full SHA
    c2aec30 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2b17497 View commit details
    Browse the repository at this point in the history

Commits on Mar 10, 2024

  1. Configuration menu
    Copy the full SHA
    36ab1ba View commit details
    Browse the repository at this point in the history

Commits on Mar 11, 2024

  1. Configuration menu
    Copy the full SHA
    2fc5d33 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d61c53d View commit details
    Browse the repository at this point in the history
  3. finish (mosaicml#1022)

    Co-authored-by: Max Marion <[email protected]>
    bmosaicml and maxisawesome authored Mar 11, 2024
    Configuration menu
    Copy the full SHA
    4e43792 View commit details
    Browse the repository at this point in the history

Commits on Mar 12, 2024

  1. Configuration menu
    Copy the full SHA
    257c25d View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2024

  1. Fix typo in monolithic chkpt callback docs (mosaicml#1024)

    * Fix typo in monolithic chkpt callback docs
    
    * reorder to match function signature
    sashaDoubov authored Mar 13, 2024
    Configuration menu
    Copy the full SHA
    4e8a875 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    14e2dec View commit details
    Browse the repository at this point in the history

Commits on Mar 14, 2024

  1. Configuration menu
    Copy the full SHA
    cffd75e View commit details
    Browse the repository at this point in the history

Commits on Mar 15, 2024

  1. Configuration menu
    Copy the full SHA
    c88169d View commit details
    Browse the repository at this point in the history

Commits on Mar 18, 2024

  1. Configuration menu
    Copy the full SHA
    c173dd3 View commit details
    Browse the repository at this point in the history
  2. 5 Configuration menu
    Copy the full SHA
    c26309d View commit details
    Browse the repository at this point in the history

Commits on Mar 20, 2024

  1. Configuration menu
    Copy the full SHA
    9bf3701 View commit details
    Browse the repository at this point in the history

Commits on Mar 21, 2024

  1. Configuration menu
    Copy the full SHA
    3348b59 View commit details
    Browse the repository at this point in the history

Commits on Mar 22, 2024

  1. Compile GLU (mosaicml#1049)

    josejg authored Mar 22, 2024
    Configuration menu
    Copy the full SHA
    26a5fd4 View commit details
    Browse the repository at this point in the history

Commits on Mar 23, 2024

  1. Configuration menu
    Copy the full SHA
    31e4879 View commit details
    Browse the repository at this point in the history

Commits on Mar 24, 2024

  1. Update README.md (mosaicml#1056)

    Fixed script path
    dennyglee authored Mar 24, 2024
    Configuration menu
    Copy the full SHA
    c9685cf View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    94a05bd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    813d596 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    67dcab9 View commit details
    Browse the repository at this point in the history
  5. Bump Composer to 0.21.1 (mosaicml#1053)

    * bump composer
    
    * accounted for bug fix
    
    * changed fsdp_elastic_sharded_enabled to fsdp_sharded_state_dict_enabled in the AsyncEvalCallback
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    jjanezhang and dakinggg authored Mar 24, 2024
    Configuration menu
    Copy the full SHA
    5c8a829 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    32e14a6 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    648b1bd View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    b9a2de6 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    8f25c18 View commit details
    Browse the repository at this point in the history

Commits on Mar 25, 2024

  1. Configuration menu
    Copy the full SHA
    2d65fc2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e590acf View commit details
    Browse the repository at this point in the history

Commits on Mar 26, 2024

  1. Custom Exceptions for Mosaic Logger (mosaicml#1014)

    * flagged areas to throw ft errors + added custom exception
    
    * added exceptions for all files
    
    * fixed type of invalid type errors
    
    * fixed merge
    
    * added exceptions for all files
    
    * fixed merge
    
    * added a catch for missing hf url
    
    * expain dataloader catch to all error types
    
    * added tests for custom exceptions
    
    * fixed a couple tests
    
    * fixed some more tests
    
    * addressed comments
    
    * fixed formatting, updated split correction, moved logger setup into helper
    
    * added check for repeating roles
    
    * more strict checking for prompt response type
    
    * removed some cases we don't need
    
    * updated name of unknown conversation type error
    
    * formatting
    
    * Fix multi model eval (mosaicml#1055)
    
    * resolved merge conflict
    
    * formatted
    
    * added back two imports
    
    * formatting changes
    
    * formatted again
    
    * sorted imports
    
    * diable yapf for exceptions import
    
    * disabled yapf on test dataloader
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    jjanezhang and dakinggg authored Mar 26, 2024
    Configuration menu
    Copy the full SHA
    0ef7cd6 View commit details
    Browse the repository at this point in the history
  2. Bump version to 0.7.0 (mosaicml#1063)

    * Bump version
    
    * Remove triton (mosaicml#1062)
    
    * Remove github action workflows for version bumps
    
    * Fix cpu test issues
    
    * code quality
    
    * Fix gpu tests
    
    * Fix gpu tests nicely
    
    * Remove z-loss (mosaicml#1064)
    
    * Remove prefix lm and denoising (mosaicml#1065)
    
    * Remove hf_prefix_lm
    
    * Remove prefix_lm from mpt modeling
    
    * Remove bidirectional mask
    
    * Remove text denoising dataloading
    
    * Remove adapt tokenizer
    
    * Remove llama attention patch (mosaicml#1066)
    
    * Remove bidirectional mask in tests
    
    * Fix test_hf_config_override with patch
    irenedea authored Mar 26, 2024
    Configuration menu
    Copy the full SHA
    7f0fdae View commit details
    Browse the repository at this point in the history

Commits on Mar 27, 2024

  1. Configuration menu
    Copy the full SHA
    e42ea70 View commit details
    Browse the repository at this point in the history
  2. Fix context printing (mosaicml#1068)

    * Fix comparison in train for hf_t5
    
    * Add back context printing in dataloader.py
    irenedea authored Mar 27, 2024
    Configuration menu
    Copy the full SHA
    f044d6c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8a69bd7 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    28467bb View commit details
    Browse the repository at this point in the history

Commits on Mar 28, 2024

  1. Output eval logging batch (mosaicml#961)

    * Skip flaky lion8b test (mosaicml#598)
    
    * relax atol and add retries to reduce flakiness in lion8b timing test
    
    * add eval output logging
    
    * add back tasks
    
    * foo
    
    * add rlhf prompts
    
    * add rlhf prompts
    
    * add rlhf prompts
    
    * add rlhf prompts
    
    * add rlhf prompts
    
    * fix prompt
    
    * fix prompt
    
    * modify mcli
    
    * test
    
    * test
    
    * fix
    
    * fix merge
    
    * wip
    
    * merge
    
    * reset files, wip commit
    
    * rm small changes
    
    * reduce changes
    
    * reduce changes
    
    * .
    
    * wip
    
    * rm batch keys
    
    * revert init device
    
    * linting
    
    * add import
    
    * fix import
    
    * add eval_output_logging to registry
    
    * readd import
    
    * pyright + linting
    
    ---------
    
    Co-authored-by: dblalock <[email protected]>
    Co-authored-by: Jeremy Dohmann <[email protected]>
    3 people authored Mar 28, 2024
    Configuration menu
    Copy the full SHA
    7a8a156 View commit details
    Browse the repository at this point in the history

Commits on Apr 1, 2024

  1. Configuration menu
    Copy the full SHA
    349c2ff View commit details
    Browse the repository at this point in the history
  2. Check the user provided eos / bos token id against the tokenizer eos …

    …/ bos token id (mosaicml#1039)
    
    * lint
    
    * lint
    
    * added warning and error message instead of setting the eos and bos token ids
    
    * Update text_data.py
    
    Adding info about the override flags in the error message.
    
    * Update llmfoundry/data/text_data.py
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    
    * Update llmfoundry/data/text_data.py
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    
    * adding warning if user does not provide eos or bos token id
    
    * adding warning if user does not provide eos or bos token id
    
    ---------
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    ShashankMosaicML and vchiley authored Apr 1, 2024
    Configuration menu
    Copy the full SHA
    d8ea2c5 View commit details
    Browse the repository at this point in the history

Commits on Apr 2, 2024

  1. Triton RMSNorm (mosaicml#1050)

    * Triton RMS Norm
    
    * fix
    
    * refactor
    
    * Fix test
    
    * no bias
    
    * Update tests/models/test_model.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * fixes
    
    * Unittest
    
    * fix
    
    * fix2
    
    * fix3
    
    * fix3
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    josejg and dakinggg authored Apr 2, 2024
    Configuration menu
    Copy the full SHA
    b765b47 View commit details
    Browse the repository at this point in the history
  2. Fix tiktoken vocab size (mosaicml#1081)

    * fix vocab size bug
    
    * more tests
    dakinggg authored Apr 2, 2024
    Configuration menu
    Copy the full SHA
    caf7fda View commit details
    Browse the repository at this point in the history
  3. Doing the loss reduction in foundry instead of in the loss functions. (

    …mosaicml#1079)
    
    * setting loss_fn reduction to None
    
    * fixing a unit test
    
    * add error message
    
    * adding test to check reduction
    
    * adding test to check reduction
    
    * Update llmfoundry/models/mpt/modeling_mpt.py
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    
    * preserving batch dimension of targets
    
    * minor change
    
    ---------
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    3 people authored Apr 2, 2024
    Configuration menu
    Copy the full SHA
    632cb73 View commit details
    Browse the repository at this point in the history
  4. remove (mosaicml#1082)

    mvpatel2000 authored Apr 2, 2024
    Configuration menu
    Copy the full SHA
    580a4b0 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    394735b View commit details
    Browse the repository at this point in the history

Commits on Apr 3, 2024

  1. Configuration menu
    Copy the full SHA
    d452c60 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    23c3173 View commit details
    Browse the repository at this point in the history

Commits on Apr 4, 2024

  1. Configuration menu
    Copy the full SHA
    5455b40 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c766cf9 View commit details
    Browse the repository at this point in the history
  3. log packing ratio progress (mosaicml#1070)

    * log packing ratio progress
    
    * use tqdm to show progress of packing ratio profiling
    
    * fix pbar
    
    * Update packing.py
    
    * added log to search for
    
    * fix
    
    * good old fashioned progress logging
    
    * Update packing.py
    milocress authored Apr 4, 2024
    Configuration menu
    Copy the full SHA
    e70e424 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    26f0619 View commit details
    Browse the repository at this point in the history
  5. fix typo in expandable_segments (mosaicml#1088)

    Co-authored-by: Daniel King <[email protected]>
    mammothb and dakinggg authored Apr 4, 2024
    Configuration menu
    Copy the full SHA
    06ff95f View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    607b982 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    f18768b View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    96b27c5 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    94301cd View commit details
    Browse the repository at this point in the history

Commits on Apr 5, 2024

  1. update (mosaicml#1097)

    dakinggg authored Apr 5, 2024
    Configuration menu
    Copy the full SHA
    60a1ab4 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b81897a View commit details
    Browse the repository at this point in the history

Commits on Apr 8, 2024

  1. Configuration menu
    Copy the full SHA
    d0d9434 View commit details
    Browse the repository at this point in the history

Commits on Apr 9, 2024

  1. Updating the streaming version in setup.py (mosaicml#1103)

    * updating the streaming version in setup.py
    
    * updating constructor call to StreamingDataset
    
    * making allow_unsafe_types and replication configurable through dataset configs
    
    * adding docstring
    ShashankMosaicML authored Apr 9, 2024
    Configuration menu
    Copy the full SHA
    2939cc9 View commit details
    Browse the repository at this point in the history
  2. MegaBlocks release (mosaicml#1102)

    * [Stage] Megablocks release (mosaicml#241)
    
    * V1 of MegaBlocks
    ---------
    
    * fix hf ckptr
    
    * rename
    
    * lint
    
    * lint
    
    ---------
    
    Co-authored-by: Abhinav Venigalla <[email protected]>
    Co-authored-by: Sasha Doubov <[email protected]>
    Co-authored-by: Cheng Li <[email protected]>
    Co-authored-by: Ning Wang <[email protected]>
    Co-authored-by: Irene Dea <[email protected]>
    Co-authored-by: Shashank Rajput <[email protected]>
    Co-authored-by: Chuck Tang <[email protected]>
    Co-authored-by: Jose Javier <[email protected]>
    Co-authored-by: Angel Ruiz <[email protected]>
    Co-authored-by: Denny Lee <[email protected]>
    Co-authored-by: Jane Zhang <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: Chuck Tang <[email protected]>
    Co-authored-by: Vitaliy Chiley <[email protected]>
    15 people authored Apr 9, 2024
    Configuration menu
    Copy the full SHA
    53160f4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f12bc8a View commit details
    Browse the repository at this point in the history

Commits on Apr 10, 2024

  1. Configuration menu
    Copy the full SHA
    17f8aeb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7337429 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b5fc0fa View commit details
    Browse the repository at this point in the history

Commits on Apr 11, 2024

  1. Support ShareGPT chat format (mosaicml#1098)

    This adds support for the other common chat format. We just remap keys and add a new role.
    samhavens authored Apr 11, 2024
    Configuration menu
    Copy the full SHA
    4cd2324 View commit details
    Browse the repository at this point in the history

Commits on Apr 12, 2024

  1. Configuration menu
    Copy the full SHA
    ed3daef View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    560012b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e9b1c6e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b58d68c View commit details
    Browse the repository at this point in the history
  5. Update config_moe_args.py (mosaicml#1112)

    mosaicml#1111 needed to revert mosaicml#1104 because the mosaicml#1104 PR caused issues. Removing TODO and marking Jira with wont-do
    vchiley authored Apr 12, 2024
    Configuration menu
    Copy the full SHA
    6257e5b View commit details
    Browse the repository at this point in the history
  6. Migrate ICL classes to foundry (mosaicml#936)

    * start
    
    * still need to migrate fixtures
    
    * wip onboarding tests
    
    * still workin'
    
    * still wip
    
    * maybe done; test out on mcli now
    
    * mcli
    
    * remove calibration error
    
    * migration
    
    * migration
    
    * full migration
    
    * precommit
    
    * fix
    
    * fix pytests
    
    * refactor QA
    
    * update
    
    * restore
    
    * add
    
    * fix
    
    * wip
    
    * update readme
    
    * final pyright
    
    * done
    
    * pass prelimiter into ALL the ICL task datasets
    
    * allow QA task name stil lfor backward compatibility
    
    * fix
    
    * fix test
    
    * add generation length
    
    * remove max_new_tokens
    
    * fix cpu trsts
    
    * try and fix lm eval test
    
    * temp disable lm task eval test
    
    * fix test?
    
    * fix tet
    
    * finish
    
    * fix
    
    * Update scripts/eval/README.md
    
    Co-authored-by: Daniel King <[email protected]>
    
    * fix comments
    
    * fix bug with seq len
    
    * restore mcli
    
    * merge
    
    * fix builder
    
    * add deprecation warning
    
    * add deprecation warning
    
    * merge
    
    * merge
    
    * add logging necessities to nlp.py
    
    * add attention_mask test update
    
    * fix generation_length in tests
    
    * fix bug
    
    * restore yamls
    
    * fix typos
    
    * add deprecation warning for code
    
    * pyright wip
    
    * fix pyright
    
    * fix pyright error again
    
    * fix pyright
    
    * fix pyright
    
    * update version
    
    ---------
    
    Co-authored-by: Eitan Turok <[email protected]>
    Co-authored-by: Max Marion <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: Max Marion <[email protected]>
    5 people authored Apr 12, 2024
    Configuration menu
    Copy the full SHA
    3729ba3 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    cb0de4f View commit details
    Browse the repository at this point in the history

Commits on Apr 13, 2024

  1. Configuration menu
    Copy the full SHA
    676ad7f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f01f625 View commit details
    Browse the repository at this point in the history

Commits on Apr 18, 2024

  1. Configuration menu
    Copy the full SHA
    84b6410 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4bb4d4a View commit details
    Browse the repository at this point in the history

Commits on Apr 19, 2024

  1. Configuration menu
    Copy the full SHA
    20cb40c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    63a7f12 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    698206d View commit details
    Browse the repository at this point in the history
  4. catch misconfigured hf dataset (mosaicml#1123)

    * catch misconfigured hf dataset
    
    * update unit tests
    
    * removed unused import
    
    * use new dataset
    
    * Update llmfoundry/data/finetuning/tasks.py
    
    Co-authored-by: Jane Zhang <[email protected]>
    
    * Update llmfoundry/utils/exceptions.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: Jane Zhang <[email protected]>
    3 people authored Apr 19, 2024
    Configuration menu
    Copy the full SHA
    f0646e8 View commit details
    Browse the repository at this point in the history

Commits on Apr 20, 2024

  1. Pin mlflow (mosaicml#1124)

    dakinggg authored Apr 20, 2024
    Configuration menu
    Copy the full SHA
    3426415 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6caa75a View commit details
    Browse the repository at this point in the history

Commits on Apr 22, 2024

  1. Configuration menu
    Copy the full SHA
    0c6bd75 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4952183 View commit details
    Browse the repository at this point in the history

Commits on Apr 23, 2024

  1. Configuration menu
    Copy the full SHA
    c53622e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0d62e61 View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2024

  1. Mlflow datasets (mosaicml#1119)

    * add logger
    
    * reqs
    
    * small fix
    
    * import mlflow
    
    * parse_uri
    
    * parse_uri
    
    * finished debug
    
    * precommit
    
    * more code fix
    
    * revert setup
    
    * better dovs
    
    * rm docstr
    
    * precommit
    
    * Update tests to not rely on mistral (mosaicml#1117)
    
    * Bump transformers to 4.40 (mosaicml#1118)
    
    * merge
    
    * revert setup
    
    * precommit
    
    * precommit
    
    * tweaks to resolve comments
    
    * unit test
    
    * code quality
    
    * quotation
    
    * quote
    
    * more quality
    
    * optional
    
    * pyright
    
    * type check
    
    * rm typechecking
    
    * yapf
    
    * first pass
    
    * fix
    
    * get refactor
    
    * refactor
    
    * local hf path
    
    * dbfs
    
    * rm local
    
    * typo
    
    * second pass
    
    * update
    
    * third pass
    
    * os.path.join
    
    * precommit
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    KuuCi and dakinggg authored Apr 24, 2024
    Configuration menu
    Copy the full SHA
    72da1d7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6252f79 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    76f74b6 View commit details
    Browse the repository at this point in the history

Commits on Apr 25, 2024

  1. Configuration menu
    Copy the full SHA
    15abf8c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    24f65fd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4aef5de View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    6cfd2a3 View commit details
    Browse the repository at this point in the history

Commits on Apr 26, 2024

  1. Configuration menu
    Copy the full SHA
    f97f02e View commit details
    Browse the repository at this point in the history

Commits on Apr 27, 2024

  1. Configuration menu
    Copy the full SHA
    8be3254 View commit details
    Browse the repository at this point in the history

Commits on Apr 29, 2024

  1. Configuration menu
    Copy the full SHA
    704a90a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fbcf311 View commit details
    Browse the repository at this point in the history
  3. build inner model (mosaicml#1147)

    * build inner model
    
    * remove tokenizer arg
    
    * fix namespace overlap
    
    * change the type ignore location
    
    * PeftModel -> 'PeftModel'
    
    * additional fsdp preparation
    
    * init device
    
    * prepare inner model -> prepare for fsdp
    
    * update superclass type sig
    
    * added docstring
    
    * fix docstring style
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    milocress and dakinggg authored Apr 29, 2024
    Configuration menu
    Copy the full SHA
    de5a394 View commit details
    Browse the repository at this point in the history
  4. fix DatasetConstants.splints default value to protect dictionary over…

    …writing (mosaicml#1144)
    
    * fix DatasetConstants.splints default value; correct typos for raw_samples amount; correct type hinting for raw_samples
    
    * Update scripts/data_prep/convert_dataset_hf.py
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    ivan-kud and dakinggg authored Apr 29, 2024
    Configuration menu
    Copy the full SHA
    738956e View commit details
    Browse the repository at this point in the history

Commits on Apr 30, 2024

  1. Configuration menu
    Copy the full SHA
    bd0d1cb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6fde283 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    63ac1a4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b570b61 View commit details
    Browse the repository at this point in the history

Commits on May 1, 2024

  1. Configuration menu
    Copy the full SHA
    2f58965 View commit details
    Browse the repository at this point in the history
  2. Move sentencepiece import (mosaicml#1157)

    * Move sentencepiece import
    
    * Update llmfoundry/utils/checkpoint_conversion_helpers.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    aspfohl and dakinggg authored May 1, 2024
    Configuration menu
    Copy the full SHA
    6561330 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5f39606 View commit details
    Browse the repository at this point in the history
  4. Uncomment GPU tests (mosaicml#1162)

    * Uncomment GPU tests
    
    * Update pr-gpu.yaml
    milocress authored May 1, 2024
    Configuration menu
    Copy the full SHA
    fa7a78a View commit details
    Browse the repository at this point in the history

Commits on May 2, 2024

  1. Configuration menu
    Copy the full SHA
    124de4c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0d58f46 View commit details
    Browse the repository at this point in the history
  3. Bump min torch version to 2.3.0 (mosaicml#1152)

    * bump min torch
    
    * make random correct less likely
    
    * bf16 precisioin
    
    * fix
    
    * new dataset?
    
    * try again
    
    * debug
    
    * debug
    
    * pin hf hub
    
    * remove debug
    
    ---------
    
    Co-authored-by: Milo Cress <[email protected]>
    dakinggg and milocress authored May 2, 2024
    Configuration menu
    Copy the full SHA
    a3e0fb9 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    10b7caf View commit details
    Browse the repository at this point in the history
  5. refactoring dataloader into registries. (mosaicml#1165)

    * refactoring dataloader into registries
    
    * lint
    
    * fixing get_tokens_per_batch_func import issues
    
    * lint
    
    * resolving comments
    
    * resolving comments
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    ShashankMosaicML and dakinggg authored May 2, 2024
    Configuration menu
    Copy the full SHA
    3b82735 View commit details
    Browse the repository at this point in the history
  6. Migrate eval output logging to foundry (mosaicml#1166)

    * initial commit
    
    * linting
    
    * change callback to import from foundry
    
    * add types
    
    * linting
    
    * linting
    
    * pyright
    
    * linting
    
    * linting
    maxisawesome authored May 2, 2024
    Configuration menu
    Copy the full SHA
    ddf4aa4 View commit details
    Browse the repository at this point in the history

Commits on May 3, 2024

  1. Configuration menu
    Copy the full SHA
    c0d591c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bfbb8c5 View commit details
    Browse the repository at this point in the history

Commits on May 6, 2024

  1. Configuration menu
    Copy the full SHA
    ab9dde7 View commit details
    Browse the repository at this point in the history

Commits on May 8, 2024

  1. Bump version v0.9.0.dev0 (mosaicml#1181)

    * update init.py
    
    * deprecate
    
    * delete in context learning code eval dataset
    
    * remove deprecated export
    
    * removed more code_eval stuff
    
    * oopsie
    
    * guh
    
    * fix
    milocress authored May 8, 2024
    Configuration menu
    Copy the full SHA
    a777014 View commit details
    Browse the repository at this point in the history
  2. structuredconfig for train.py and eval.py (mosaicml#1051)

    * first commit for structuredconfig for train.py
    
    * revamp configs
    
    * wip latest issue
    
    * reorder so mandatory attributes come first
    
    * fix
    
    * fix
    
    * fix fix
    
    * fix types
    
    * fix dictconfig
    
    * fix union of list|dict configs
    
    * fix type annotation
    
    * oops
    
    * fixed configs
    
    * add save ignore keys
    
    * fix batch size kerfuffle
    
    * fix dictconfig stuff
    
    * fix dictconfig stuff again
    
    * fix
    
    * fix
    
    * updated unit tests for variables
    
    * last fix?
    
    * if this test case does not pass I will venmo Mihir 0
    
    * remove a 'not' -- eg. 'I am not going crazy'
    
    * Update scripts/train/train.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * set amp bf16 as default precision, etc
    
    * temporarily wrap with dictconfig before ** migration
    
    * fix icl tasks
    
    * fix
    
    * fix activation checkpointing reentrant
    
    * fix extraneous keys
    
    * first round **
    
    * fix?
    
    * quick fsdp config fix
    
    * updated yamls to make variables explicit
    
    * remove precision from mandatory params list
    
    * I expect many of these to fail in interesting ways
    
    * fix test_model test cases with **
    
    * fix many more test cases
    
    * fix dictconfig objectification
    
    * fix remaining  test cases
    
    * remove unneeded **
    
    * fix test case
    
    * changed back argument name
    
    * fix
    
    * ** for finetuning dataloader
    
    * fix?
    
    * fix dataloader
    
    * fix
    
    * fix finetuning dataloader
    
    * fix build_text_dataloader
    
    * left to my own devices
    
    * fix packing
    
    * fix typo
    
    * fix padding test cases
    
    * ignore extra parameters and warn
    
    * fix style
    
    * fix quality checks
    
    * fix code quality
    
    * pyright-fu
    
    * fix
    
    * just one more type constraint bro
    
    * OmegaConf -> om
    
    * rename variables for clarity
    
    * revert file
    
    * revert file II
    
    * revert file III: revert of the sith
    
    * peft revert file
    
    * revert v_mpt
    
    * last revert
    
    * remove redundant checks
    
    * deprecate
    
    * make cleaner
    
    * pyright is bullying me again
    
    * further clean config_utils
    
    * polish train
    
    * polish train and eval
    
    * fix dist
    
    * fix style
    
    * organize eval and train
    
    * fix
    
    * used helper function to make main cleaner
    
    * fix stuff
    
    * fix pyright
    
    * added fix and explanation
    
    * fix typo in unit test update smh
    
    * Update llmfoundry/registry.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * Update scripts/train/train.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * Update scripts/train/train.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * Update scripts/train/train.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * Apply suggestions from code review
    
    Co-authored-by: Daniel King <[email protected]>
    
    * see if this fails
    
    * reject name and device rather than ignoring
    
    * pretrained is not a bool
    
    * add validation to make sure the user doesn't set both
    
    * forbid config keys
    
    * oops forgot eval
    
    * address coomments
    
    * removed redundant check
    
    * updated callsites not to use name
    
    * fix
    
    * validate extraneous keys in dataloader
    
    * fix
    
    * fix more
    
    * fix III: revenge of the fix
    
    * fix IV: a new hope
    
    * fix V: the empire fixes back
    
    * fixed some more types
    
    * fix VI: return of the fix
    
    * fix VII: the fix awakens
    
    * fix VIII: the last bug
    
    * fix
    
    * final fix I think
    
    * fixed
    
    * fix style
    
    * fix
    
    * fix fix
    
    * fix fix style
    
    * icl task config
    
    * fix train
    
    * fix finetuning dataloader
    
    * fix train types
    
    * fix token counting
    
    * fix train types
    
    * oopsie
    
    * fix straggler issues
    
    * fix tests
    
    * fix???
    
    * fix hf v mpt gpu test and fmapi test
    
    * pop device
    
    * to_str_dict -> to_dict_recursive
    
    * fix this darn unit test one more time
    
    * fix ComposerMPTCausalLM constructor invocation
    
    * Delete tests/models/hf/test_hf_fsdp.py
    
    * unwrap model in unit tests
    
    * model.model.model.model.model
    
    * abstract away dataclass construction
    
    * updated docstrings and removed dictconfig from logging logic
    
    * flag icl tasks required or not
    
    * updated a couple yamls
    
    * updated train and eval scripts
    
    * un-delete global train batch size
    
    * fix
    
    * I don't understand why this doesn't work
    
    * that was the sneakiest bug I've ever fixed
    
    * try to fix the regression test
    
    * remove device train grad accum
    
    * fix validate config
    
    * removed unused import
    
    * use variables
    
    * missing mandatory value fix
    
    * use correct type of error
    
    * fix
    
    * import TrainConfig just in case?
    
    * moved trainconfig and evalconfig into utils
    
    * works
    
    * no cheating
    
    * dicts everywhere gah
    
    * try no recursive just
    
    * rename typed helpers
    
    * fix the test cases with deep magic
    
    * towards a peaceful resolution
    
    * remove comments
    
    * fix type warnings
    
    * Update llmfoundry/utils/config_utils.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * address low-hanging fruit
    
    * remove peft wrapping extra model
    
    * python 🤝 haskell
    
    * dataset config should be dict
    
    * just because omega starts with OMMMM does not mean it's zen
    
    * fix
    
    * fix
    
    * structured settlement
    
    * precision further down
    
    * throws TypeError instead of MissingMandatoryValue or whatever
    
    * remove debugging statement
    
    * remove to_container calls everywhere
    
    * wrap then unwrap
    
    * pyright
    
    * error early on missing mandatory values
    
    * remove unnecessory ignore
    
    * update unit tests
    
    * update eval yamls
    
    * Update train.py
    
    * make log level optional again
    
    * oopsie
    
    * use keywords for arg clarity
    
    * use keywords for arg clarity
    
    * style
    
    * style
    
    * dist timeout
    
    * resolve deeper conflict issues
    
    * fix train.py
    
    * fix registry
    
    * fix dataloader
    
    * fix train II
    
    * fix dataloader and utils
    
    * fix dictconfig
    
    * skill issue
    
    * add new keys
    
    * remove pop_config
    
    * fix
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    milocress and dakinggg authored May 8, 2024
    Configuration menu
    Copy the full SHA
    cc8351c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    46b8bee View commit details
    Browse the repository at this point in the history
  4. Refactoring attention (mosaicml#1182)

    * refactoring
    
    * adding back a function that got deleted by mistake
    
    * adding co-authors
    Co-Authored-By: Vitaliy Chiley <[email protected]>
    Co-Authored-By: Cheng Li <[email protected]>
    
    * adding co-authors
    Co-Authored-By: Vitaliy Chiley <[email protected]>
    
    * adding co-authors
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Vitaliy Chiley <[email protected]>
    
    * Update config_utils.py
    
    adding co-authors
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Cheng Li <[email protected]>
    Co-authored-by: Cheng Li <@cli99>
    
    * lint
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Vitaliy Chiley <[email protected]>
    
    * Adding_co_authors
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Cheng Li <[email protected]>
    
    * Update llmfoundry/models/mpt/modeling_mpt.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * addressing comments
    
    * adding_co_authors
    
    Co-authored-by: Cheng Li <[email protected]>
    
    * Update llmfoundry/utils/config_utils.py
    
    ---------
    
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Vitaliy Chiley <[email protected]>
    Co-authored-by: Cheng Li <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: Cheng Li <[email protected]>
    6 people authored May 8, 2024
    Configuration menu
    Copy the full SHA
    ac563e6 View commit details
    Browse the repository at this point in the history

Commits on May 9, 2024

  1. Configuration menu
    Copy the full SHA
    0c7bc2a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c6679d3 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    139abab View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    51d0d09 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    1dd37c5 View commit details
    Browse the repository at this point in the history

Commits on May 10, 2024

  1. fix eval (mosaicml#1193)

    Co-authored-by: Daniel King <[email protected]>
    milocress and dakinggg authored May 10, 2024
    Configuration menu
    Copy the full SHA
    983234d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    994209c View commit details
    Browse the repository at this point in the history

Commits on May 11, 2024

  1. Configuration menu
    Copy the full SHA
    eef4872 View commit details
    Browse the repository at this point in the history

Commits on May 14, 2024

  1. Configuration menu
    Copy the full SHA
    0449b60 View commit details
    Browse the repository at this point in the history
  2. Clearer error message for unknown example type (mosaicml#1202)

    * error when example is a string
    
    * make error message the same
    
    * strings only
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    milocress and dakinggg authored May 14, 2024
    Configuration menu
    Copy the full SHA
    8274c6c View commit details
    Browse the repository at this point in the history

Commits on May 15, 2024

  1. Added torch_dmoe defaults, bug fixes for 2D inputs (mosaicml#1210)

    * defaults for torch dmoe match mb dmoe
    
    * defaults for torch dmoe match mb dmoe
    
    * defaults for torch dmoe match mb dmoe
    
    * defaults for torch dmoe match mb dmoe
    
    * defaults for torch dmoe match mb dmoe
    
    * defaults for torch dmoe match mb dmoe
    
    * defaults for torch dmoe match mb dmoe
    
    * top k proper
    
    * permute fix
    
    * permute fix
    
    * permute fix
    
    * permute fix
    
    * permute fix
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * narrow down world size bug
    
    * blocking dimension bug
    
    * done
    
    * yo
    
    * yo
    snarayan21 authored May 15, 2024
    Configuration menu
    Copy the full SHA
    b414626 View commit details
    Browse the repository at this point in the history
  2. log eval dataset misconfiguration (mosaicml#1179)

    * log eval dataset misconfiguration
    
    * use context
    
    * literally
    
    * BaseException -> Exception
    
    * use my archaeological skills to find the right python syntax for 3.9
    
    * refactor names for more general use
    
    * oops
    
    * oops II
    
    * context -> location
    
    * use variables instead of strings
    
    * Update exceptions.py
    
    * delete Mapping
    milocress authored May 15, 2024
    Configuration menu
    Copy the full SHA
    cfee4e4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8cd23d5 View commit details
    Browse the repository at this point in the history

Commits on May 16, 2024

  1. Configuration menu
    Copy the full SHA
    dc3212e View commit details
    Browse the repository at this point in the history
  2. TransformerEngine Image Build (mosaicml#1204)

    * install
    
    * fix
    
    * try chucks branch
    
    * fix image
    
    * pin
    
    * fix lint
    
    * revert image size
    
    * remove verbose
    
    * switch to 8wide
    
    ---------
    mvpatel2000 authored May 16, 2024
    Configuration menu
    Copy the full SHA
    e70891b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3fe7f09 View commit details
    Browse the repository at this point in the history
  4. Make fc_type a dict to pass fc kwargs through (mosaicml#1201)

    * fc type as fict
    
    * fc type as dict
    
    * fc type as dict
    
    * rework ffn fc config slightly
    
    * rework ffn fc config slightly
    
    * merged main
    
    * merged main
    
    * no circular imports
    
    * Update llmfoundry/models/mpt/configuration_mpt.py
    
    Co-authored-by: Daniel King <[email protected]>
    
    * them configs
    
    * yo
    
    * linting man
    
    * the deep copy
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    snarayan21 and dakinggg authored May 16, 2024
    Configuration menu
    Copy the full SHA
    38ae65b View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    3a15082 View commit details
    Browse the repository at this point in the history

Commits on May 17, 2024

  1. that mod (mosaicml#1219)

    snarayan21 authored May 17, 2024
    Configuration menu
    Copy the full SHA
    77f9ab1 View commit details
    Browse the repository at this point in the history

Commits on May 22, 2024

  1. Configuration menu
    Copy the full SHA
    001e7c3 View commit details
    Browse the repository at this point in the history
  2. Add user error superclass (mosaicml#1225)

    * Add user error superclass
    
    * update class inheritance structure
    milocress authored May 22, 2024
    Configuration menu
    Copy the full SHA
    8e29698 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c891bed View commit details
    Browse the repository at this point in the history
  4. Quick patch to check that Dataset Keys contain non-None Values (mosai…

    …cml#1228)
    
    * quick patch
    
    * also seperately check for local path
    
    * typo
    
    * typo
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    KuuCi and dakinggg authored May 22, 2024
    Configuration menu
    Copy the full SHA
    9cc945c View commit details
    Browse the repository at this point in the history

Commits on May 23, 2024

  1. Configuration menu
    Copy the full SHA
    9120c27 View commit details
    Browse the repository at this point in the history
  2. Loss v len callback (mosaicml#1226)

    * adding loss v len callback
    
    * lint
    
    * lint
    
    * using model's loss function
    
    * lint
    
    * only log to mlflow logger
    
    * incorporating suggestions
    
    * incorporating comments
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    ShashankMosaicML and dakinggg authored May 23, 2024
    Configuration menu
    Copy the full SHA
    c213ea8 View commit details
    Browse the repository at this point in the history
  3. Fixing the state.timestamp.batch.value issue in loss v len callback (m…

    …osaicml#1232)
    
    * adding print statements
    
    * testing fix
    
    * fix
    
    * removing print statements
    
    * minor fix
    ShashankMosaicML authored May 23, 2024
    Configuration menu
    Copy the full SHA
    6fa6026 View commit details
    Browse the repository at this point in the history

Commits on May 24, 2024

  1. fix attr error for attention_classes (mosaicml#1230)

    Co-authored-by: Daniel King <[email protected]>
    cli99 and dakinggg authored May 24, 2024
    Configuration menu
    Copy the full SHA
    b4bb34c View commit details
    Browse the repository at this point in the history
  2. fix typing (mosaicml#1235)

    dakinggg authored May 24, 2024
    Configuration menu
    Copy the full SHA
    09d8892 View commit details
    Browse the repository at this point in the history
  3. Add example eval scripts for dbrx PT sizes (mosaicml#1218)

    * Add example eval scripts for dbrx PT sizes
    
    * fix name
    
    * dsjd
    
    * yaml linting
    
    * update tokenizer
    aspfohl authored May 24, 2024
    Configuration menu
    Copy the full SHA
    ef530bf View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    ff92f3c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    fdaa58b View commit details
    Browse the repository at this point in the history
  6. Move MLFlow dataset outside of log_config (mosaicml#1234)

    * move txt log
    
    * typo
    
    * Update scripts/train/train.py
    
    * train config
    
    * debug
    
    * source data
    
    * verbose
    
    * debug
    
    * debug
    
    * check if mlflow is active
    
    * fex tests
    
    * move mlflow check to train
    
    * update test
    
    * precommit
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    KuuCi and dakinggg authored May 24, 2024
    Configuration menu
    Copy the full SHA
    1e4bd37 View commit details
    Browse the repository at this point in the history

Commits on May 25, 2024

  1. add error when chat template fails (mosaicml#1222)

    * add error when chat template fails
    
    * type
    
    * formatting
    milocress authored May 25, 2024
    Configuration menu
    Copy the full SHA
    2e10d95 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    867e58f View commit details
    Browse the repository at this point in the history

Commits on May 27, 2024

  1. Configuration menu
    Copy the full SHA
    c9257b5 View commit details
    Browse the repository at this point in the history

Commits on May 28, 2024

  1. Configuration menu
    Copy the full SHA
    43d149b View commit details
    Browse the repository at this point in the history

Commits on May 29, 2024

  1. Configuration menu
    Copy the full SHA
    b82a82b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fb9a225 View commit details
    Browse the repository at this point in the history

Commits on Jun 3, 2024

  1. Configuration menu
    Copy the full SHA
    d846731 View commit details
    Browse the repository at this point in the history

Commits on Jun 4, 2024

  1. replacing icl_task_type question_answering with generation_task_with_…

    …answers in the long context eval yaml (mosaicml#1250)
    ShashankMosaicML authored Jun 4, 2024
    Configuration menu
    Copy the full SHA
    6c260f5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ac56dc5 View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2024

  1. Configuration menu
    Copy the full SHA
    67928cb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3966f0e View commit details
    Browse the repository at this point in the history
  3. Adding more token encoding types (mosaicml#1254)

    * add more token encoing types
    
    * add more token encoing types
    
    * add tests
    
    * add tests
    
    * ft support, tests
    
    * linting is shortening my lifespan
    
    * linting is shortening my lifespan
    
    * long tensor
    
    * long tensor
    
    * long tensor
    
    * feedbacc
    
    * import
    
    * import
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    snarayan21 and dakinggg authored Jun 6, 2024
    Configuration menu
    Copy the full SHA
    42c2d9a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    14f296c View commit details
    Browse the repository at this point in the history

Commits on Jun 7, 2024

  1. Bump Version to 0.10.0.dev0 (mosaicml#1255)

    * bump version
    
    * typo
    
    * Update config_utils.py
    
    These changes are necessary as the deprecation broke compatibility with `update_batch_size`.
    
    * Update config_utils.py
    
    fix typo
    
    * typo
    
    * typo I
    
    * update tests
    
    * typo II
    
    * typo III
    
    * bump composer version
    
    * undo composer bump for seperate pr
    
    * fix test
    
    * fix tests II
    
    * yolo
    
    * tye-o
    
    * pyrite
    
    * we resolve later
    
    * revert new . syntax
    
    ---------
    
    Co-authored-by: v-chen_data <[email protected]>
    Co-authored-by: Milo Cress <[email protected]>
    Co-authored-by: Saaketh Narayan <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    5 people authored Jun 7, 2024
    Configuration menu
    Copy the full SHA
    bea61fb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e4b8b57 View commit details
    Browse the repository at this point in the history
  3. Update TE Dockerfile (mosaicml#1265)

    Update Dockerfile with TE main
    j316chuck authored Jun 7, 2024
    Configuration menu
    Copy the full SHA
    db70135 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    4e53e74 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    dddb9b8 View commit details
    Browse the repository at this point in the history

Commits on Jun 8, 2024

  1. Configuration menu
    Copy the full SHA
    dd92abf View commit details
    Browse the repository at this point in the history

Commits on Jun 9, 2024

  1. fix linting (mosaicml#1270)

    * fix linting
    
    * fix
    milocress authored Jun 9, 2024
    Configuration menu
    Copy the full SHA
    5571101 View commit details
    Browse the repository at this point in the history

Commits on Jun 13, 2024

  1. Configuration menu
    Copy the full SHA
    ffec54b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c30856f View commit details
    Browse the repository at this point in the history

Commits on Jun 14, 2024

  1. Configuration menu
    Copy the full SHA
    630fc68 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9b9fc24 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1a2fac0 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    82ef072 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    4350990 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    dbd798e View commit details
    Browse the repository at this point in the history

Commits on Jun 15, 2024

  1. Dbfs HF (mosaicml#1214)

    * remerge
    
    * import supported extensions
    
    * no circular imports
    
    * local import
    
    * typo
    
    * testing
    
    * test
    
    * test
    
    * more test
    
    * rm . from ext
    
    * clean up and pre commit
    
    * imports and boolean typo
    
    * revert one exception, use backend over startwith, and set path using parse_uri
    
    * move import to local
    
    * debug
    
    * typo
    
    * debug
    
    * more debug
    
    * debug
    
    * clean up
    
    * back to debug
    
    * typo
    
    * add slash
    
    * move slash to verify_uc_path
    
    * moved back
    
    * debug
    
    * more debug
    
    * clean up
    
    * pyright
    
    * generic error messaging
    
    * move NotFound up
    
    * dbfs directory check
    
    * precommit
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: v-chen_data <[email protected]>
    3 people authored Jun 15, 2024
    Configuration menu
    Copy the full SHA
    1ff6c5b View commit details
    Browse the repository at this point in the history

Commits on Jun 16, 2024

  1. bump mlflow (mosaicml#1285)

    Co-authored-by: v-chen_data <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: Mihir Patel <[email protected]>
    4 people authored Jun 16, 2024
    Configuration menu
    Copy the full SHA
    ca528d5 View commit details
    Browse the repository at this point in the history

Commits on Jun 17, 2024

  1. Configuration menu
    Copy the full SHA
    f8b2875 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    618db6f View commit details
    Browse the repository at this point in the history

Commits on Jun 18, 2024

  1. Configuration menu
    Copy the full SHA
    c23be4a View commit details
    Browse the repository at this point in the history

Commits on Jun 19, 2024

  1. added systemMetricsMonitor callback (mosaicml#1260)

    * added systemMetricsMonitor callback
    
    * added unit testing for building system_metrics_monitor
    
    * temporarily removing flag check in unit test for system_metrics_monitor build
    
    * removed end of line formatting issue
    
    * fixed end of line error in __init__.py
    
    * fixed extra line in init
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    Co-authored-by: Mihir Patel <[email protected]>
    3 people authored Jun 19, 2024
    Configuration menu
    Copy the full SHA
    4b1fecb View commit details
    Browse the repository at this point in the history

Commits on Jun 20, 2024

  1. Configuration menu
    Copy the full SHA
    8241f9c View commit details
    Browse the repository at this point in the history

Commits on Jun 21, 2024

  1. Configuration menu
    Copy the full SHA
    78e4cc6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    13bd8f9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e8ba9b7 View commit details
    Browse the repository at this point in the history
  4. Bump composer to 0.23.4 (mosaicml#1297)

    * bump composer
    
    * bump composer
    mvpatel2000 authored Jun 21, 2024
    Configuration menu
    Copy the full SHA
    129e3e1 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    2196d07 View commit details
    Browse the repository at this point in the history

Commits on Jun 23, 2024

  1. Configuration menu
    Copy the full SHA
    8b5a1bb View commit details
    Browse the repository at this point in the history

Commits on Jun 24, 2024

  1. Add Retries to run_query (mosaicml#1302)

    * add retry
    
    * pyright
    
    * slight refactor
    
    ---------
    
    Co-authored-by: v-chen_data <[email protected]>
    KuuCi and v-chen_data authored Jun 24, 2024
    Configuration menu
    Copy the full SHA
    fd7b187 View commit details
    Browse the repository at this point in the history
  2. Bumping mlflow version to include buffering (mosaicml#1303)

    * bumping mlflow version to include buffering
    
    * capping at mlflow 2.15
    JackZ-db authored Jun 24, 2024
    Configuration menu
    Copy the full SHA
    2267bc7 View commit details
    Browse the repository at this point in the history
  3. Ignore mosaicml logger for exception if excephook is active (mosaicml…

    …#1301)
    
    * ignore logger if excephook is active
    
    * remove logger in data scripts and callback
    
    * undo format of imports
    
    * moved env var check into helper
    
    * formatted
    
    * removed import format
    
    * added docstring
    
    * ran pre-commit
    
    ---------
    
    Co-authored-by: Daniel King <[email protected]>
    jjanezhang and dakinggg authored Jun 24, 2024
    Configuration menu
    Copy the full SHA
    21c9e0a View commit details
    Browse the repository at this point in the history

Commits on Jun 25, 2024

  1. Configuration menu
    Copy the full SHA
    ef14849 View commit details
    Browse the repository at this point in the history

Commits on Jun 26, 2024

  1. Configuration menu
    Copy the full SHA
    2412b59 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bbfebda View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2024

  1. Configuration menu
    Copy the full SHA
    901eee3 View commit details
    Browse the repository at this point in the history
  2. update (mosaicml#1307)

    dakinggg authored Jun 27, 2024
    Configuration menu
    Copy the full SHA
    3edce07 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    14348fa View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    472d009 View commit details
    Browse the repository at this point in the history
  5. 2.3.1 (mosaicml#1312)

    dakinggg authored Jun 27, 2024
    Configuration menu
    Copy the full SHA
    f141ee1 View commit details
    Browse the repository at this point in the history

Commits on Jun 29, 2024

  1. Configuration menu
    Copy the full SHA
    0ebd7c9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    88511f7 View commit details
    Browse the repository at this point in the history

Commits on Jun 30, 2024

  1. Allows interweaving of arbitrary kinds of 'attention' layers, like sl…

    …iding window, reuse prev layer kv cache etc. (mosaicml#1299)
    
    * [WIP] Allows interweaving of arbitrary kinds of 'attention' layers, like RNN, sliding window etc.
    
    * lint
    
    * applying overrides to blocks rather than just attentions
    
    * add docstring
    
    * minor
    
    * changing yaml specification style
    
    * ..
    
    * fixes
    
    * fix
    
    * fix
    
    * fix
    
    * refactoring
    
    * add warning
    
    * compute only query vector when reusing kv
    
    * refactor
    
    * fixing
    
    * adding test for reusing previous layer kv cache
    
    * adding error messages
    
    * ..
    
    * adding test
    
    * add logging
    
    * adding logging
    
    * minor
    
    * bug fix, adding test
    
    * minor
    
    * addressing some comments
    
    * addressing some comments
    
    * setting absolute absolute value for reuse_kv_layer_idx
    
    * lint
    
    * adding tests for override_block_args
    
    * adding error if reusing kv cache from a mismatch layer
    
    * fixing test
    
    * fixing code, test
    
    * fix
    
    * ..
    
    * refactoring
    
    * fix
    
    * ..
    
    * ..
    
    * ..
    
    * refactoring
    
    * ..
    
    * ..
    
    * ..
    
    * adding test for _get_modules_order_expanded
    
    * fixing test
    
    * fixing test
    
    * lint
    
    * lint
    
    * adding test
    
    * addressing comment
    
    * ..
    
    * fixing test
    
    * changing yaml format
    
    * fix configuation
    
    * fixing test
    
    * allowing repeat at top level
    
    * allowing overriding error
    
    * addressing comments
    
    * lint
    
    * addressing comments
    
    * fix
    
    * ..
    
    * ..
    
    * ..
    
    * ..
    
    * ..
    
    * addressing comment
    
    * fixing test
    ShashankMosaicML authored Jun 30, 2024
    Configuration menu
    Copy the full SHA
    8604bba View commit details
    Browse the repository at this point in the history

Commits on Jul 1, 2024

  1. Add optional logging of text output to EvalOutputLogging (mosaicml#1283)

    ---------
    
    Co-authored-by: Mihir Patel <[email protected]>
    Co-authored-by: Daniel King <[email protected]>
    3 people authored Jul 1, 2024
    Configuration menu
    Copy the full SHA
    68c2625 View commit details
    Browse the repository at this point in the history

Commits on Jul 3, 2024

  1. Configuration menu
    Copy the full SHA
    742f340 View commit details
    Browse the repository at this point in the history

Commits on Jul 5, 2024

  1. Configuration menu
    Copy the full SHA
    e99ec07 View commit details
    Browse the repository at this point in the history
  2. wip

    Abhinav Venigalla authored and hlahkar committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    48b239e View commit details
    Browse the repository at this point in the history
  3. add act ckpt

    abhi-mosaic authored and hlahkar committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    a76d824 View commit details
    Browse the repository at this point in the history
  4. fix

    abhi-databricks authored and hlahkar committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    d238edc View commit details
    Browse the repository at this point in the history
  5. cleanup

    abhi-mosaic authored and hlahkar committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    b723c25 View commit details
    Browse the repository at this point in the history
  6. update README

    abhi-mosaic authored and hlahkar committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    820221a View commit details
    Browse the repository at this point in the history
  7. update reqs to 1.13

    abhi-mosaic authored and hlahkar committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    a69273d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    8ec8716 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2024

  1. Configuration menu
    Copy the full SHA
    de1240d View commit details
    Browse the repository at this point in the history
  2. Update ds_gaudi.sh

    ckvermaAI authored and hlahkar committed Jul 19, 2024
    Configuration menu
    Copy the full SHA
    51beeb0 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2024

  1. Update README

    hlahkar committed Jul 26, 2024
    Configuration menu
    Copy the full SHA
    1939ae4 View commit details
    Browse the repository at this point in the history