-
Notifications
You must be signed in to change notification settings - Fork 531
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade to v0.10.0 #1427
base: habana_alpha
Are you sure you want to change the base?
Upgrade to v0.10.0 #1427
Commits on Feb 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2e8982e - Browse repository at this point
Copy the full SHA 2e8982eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ef7409 - Browse repository at this point
Copy the full SHA 1ef7409View commit details -
Configuration menu - View commit details
-
Copy full SHA for e0756e1 - Browse repository at this point
Copy the full SHA e0756e1View commit details
Commits on Feb 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for da2c863 - Browse repository at this point
Copy the full SHA da2c863View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a99270 - Browse repository at this point
Copy the full SHA 3a99270View commit details -
Async callback: Don't skip checkpoints, reliably only launch async ev…
…al when the checkpoint is ready (mosaicml#813) * working without sharded checkpointing.. * add more debugs * try this * more debugging * yikes dumb bug * add notes * fixes * remove prints * small updates * fix typo * refactor * fix docstring formatting * fighting with docstrings * try this * add unit tests * point to composer update * values -> items * serialize time * fix merge * nits * warning, small comment update * add error --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6e3842b - Browse repository at this point
Copy the full SHA 6e3842bView commit details
Commits on Feb 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2431730 - Browse repository at this point
Copy the full SHA 2431730View commit details
Commits on Feb 22, 2024
-
do not mention 1.13 in readme (mosaicml#988)
Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 63c88d0 - Browse repository at this point
Copy the full SHA 63c88d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for dff2cf4 - Browse repository at this point
Copy the full SHA dff2cf4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 386ae36 - Browse repository at this point
Copy the full SHA 386ae36View commit details
Commits on Feb 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2478f0a - Browse repository at this point
Copy the full SHA 2478f0aView commit details -
if condition in tie weights added (mosaicml#989)
* if condition in tie weights added * unit test for tie weights
Configuration menu - View commit details
-
Copy full SHA for e5fffac - Browse repository at this point
Copy the full SHA e5fffacView commit details
Commits on Feb 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 44fd365 - Browse repository at this point
Copy the full SHA 44fd365View commit details
Commits on Feb 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d527c9b - Browse repository at this point
Copy the full SHA d527c9bView commit details -
add oom observer callback (mosaicml#932)
* add oom observer callback * fix format
Configuration menu - View commit details
-
Copy full SHA for b082511 - Browse repository at this point
Copy the full SHA b082511View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3f214e - Browse repository at this point
Copy the full SHA e3f214eView commit details -
Revert "Change ci/cd to use ci-testing repo"
This reverts commit e3f214e.
Configuration menu - View commit details
-
Copy full SHA for 5abbca0 - Browse repository at this point
Copy the full SHA 5abbca0View commit details
Commits on Feb 29, 2024
-
Use ci-testing repo (mosaicml#1000)
Co-authored-by: Irene Dea <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2436c00 - Browse repository at this point
Copy the full SHA 2436c00View commit details
Commits on Mar 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d104d16 - Browse repository at this point
Copy the full SHA d104d16View commit details
Commits on Mar 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2dea737 - Browse repository at this point
Copy the full SHA 2dea737View commit details
Commits on Mar 4, 2024
-
Deprecate triton, prefix lm, llama attention patch, and text denoisin…
…g; Make ComposerHFT5 experimental (mosaicml#1007) * Deprecate features and mark experimental * fix typo --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3880d04 - Browse repository at this point
Copy the full SHA 3880d04View commit details
Commits on Mar 5, 2024
-
add magic filename for sharded state dicts (mosaicml#1001)
* add magic filename for sharded state dicts * Update scripts/train/train.py Co-authored-by: Daniel King <[email protected]> * oops forgot to push this * no shard if no fsdp * default to full on foundry --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 96c8218 - Browse repository at this point
Copy the full SHA 96c8218View commit details -
Configuration menu - View commit details
-
Copy full SHA for cbdddf0 - Browse repository at this point
Copy the full SHA cbdddf0View commit details -
Fix evaluators actually pulling eval metrics (mosaicml#1006)
* fix bug on metrics * lint * lint * add unit test * lint
Configuration menu - View commit details
-
Copy full SHA for 09ff550 - Browse repository at this point
Copy the full SHA 09ff550View commit details -
Configuration menu - View commit details
-
Copy full SHA for fd8cbaf - Browse repository at this point
Copy the full SHA fd8cbafView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5728969 - Browse repository at this point
Copy the full SHA 5728969View commit details
Commits on Mar 6, 2024
-
Bump min torch pin (mosaicml#1013)
Red button because CI running jobs it doesn't need. Tests passed on main.
Configuration menu - View commit details
-
Copy full SHA for f4f6414 - Browse repository at this point
Copy the full SHA f4f6414View commit details -
Configuration menu - View commit details
-
Copy full SHA for cf0f5e5 - Browse repository at this point
Copy the full SHA cf0f5e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 86c8746 - Browse repository at this point
Copy the full SHA 86c8746View commit details -
add default for eval interval (mosaicml#987)
Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5261a55 - Browse repository at this point
Copy the full SHA 5261a55View commit details
Commits on Mar 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 93d7a05 - Browse repository at this point
Copy the full SHA 93d7a05View commit details -
Add deeper support for multi-turn chats and loss-generating tokens in…
… finetuning (mosaicml#985) The main purpose of this PR is to support training on non-terminal responses in multi-round chats. This is achieved by tokenizing at the level of conversation "turns" and exposing some options for what turns are used as training targets (i.e. generate loss). This also adds support for treating prompt tokens as loss-generating. The script for converting a finetuning dataset to streaming has also been updated (with some bug fixes).
Configuration menu - View commit details
-
Copy full SHA for 64212cb - Browse repository at this point
Copy the full SHA 64212cbView commit details
Commits on Mar 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c2aec30 - Browse repository at this point
Copy the full SHA c2aec30View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b17497 - Browse repository at this point
Copy the full SHA 2b17497View commit details
Commits on Mar 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 36ab1ba - Browse repository at this point
Copy the full SHA 36ab1baView commit details
Commits on Mar 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2fc5d33 - Browse repository at this point
Copy the full SHA 2fc5d33View commit details -
Configuration menu - View commit details
-
Copy full SHA for d61c53d - Browse repository at this point
Copy the full SHA d61c53dView commit details -
Co-authored-by: Max Marion <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4e43792 - Browse repository at this point
Copy the full SHA 4e43792View commit details
Commits on Mar 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 257c25d - Browse repository at this point
Copy the full SHA 257c25dView commit details
Commits on Mar 13, 2024
-
Fix typo in monolithic chkpt callback docs (mosaicml#1024)
* Fix typo in monolithic chkpt callback docs * reorder to match function signature
Configuration menu - View commit details
-
Copy full SHA for 4e8a875 - Browse repository at this point
Copy the full SHA 4e8a875View commit details -
Allow code-quality workflow to be callable (mosaicml#1026)
Reverts part of the change made in https://github.com/mosaicml/llm-foundry/pull/1000/files#diff-4a2765c2cfcbd3804a66aab805cb92ddda74de1730923cc5bf53671d0beccf06L11
Configuration menu - View commit details
-
Copy full SHA for 14e2dec - Browse repository at this point
Copy the full SHA 14e2decView commit details
Commits on Mar 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for cffd75e - Browse repository at this point
Copy the full SHA cffd75eView commit details
Commits on Mar 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c88169d - Browse repository at this point
Copy the full SHA c88169dView commit details
Commits on Mar 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c173dd3 - Browse repository at this point
Copy the full SHA c173dd3View commit details -
5
Configuration menu - View commit details
-
Copy full SHA for c26309d - Browse repository at this point
Copy the full SHA c26309dView commit details
Commits on Mar 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9bf3701 - Browse repository at this point
Copy the full SHA 9bf3701View commit details
Commits on Mar 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3348b59 - Browse repository at this point
Copy the full SHA 3348b59View commit details
Commits on Mar 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 26a5fd4 - Browse repository at this point
Copy the full SHA 26a5fd4View commit details
Commits on Mar 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 31e4879 - Browse repository at this point
Copy the full SHA 31e4879View commit details
Commits on Mar 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c9685cf - Browse repository at this point
Copy the full SHA c9685cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 94a05bd - Browse repository at this point
Copy the full SHA 94a05bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 813d596 - Browse repository at this point
Copy the full SHA 813d596View commit details -
Configuration menu - View commit details
-
Copy full SHA for 67dcab9 - Browse repository at this point
Copy the full SHA 67dcab9View commit details -
Bump Composer to 0.21.1 (mosaicml#1053)
* bump composer * accounted for bug fix * changed fsdp_elastic_sharded_enabled to fsdp_sharded_state_dict_enabled in the AsyncEvalCallback --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5c8a829 - Browse repository at this point
Copy the full SHA 5c8a829View commit details -
Configuration menu - View commit details
-
Copy full SHA for 32e14a6 - Browse repository at this point
Copy the full SHA 32e14a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 648b1bd - Browse repository at this point
Copy the full SHA 648b1bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for b9a2de6 - Browse repository at this point
Copy the full SHA b9a2de6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f25c18 - Browse repository at this point
Copy the full SHA 8f25c18View commit details
Commits on Mar 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2d65fc2 - Browse repository at this point
Copy the full SHA 2d65fc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for e590acf - Browse repository at this point
Copy the full SHA e590acfView commit details
Commits on Mar 26, 2024
-
Custom Exceptions for Mosaic Logger (mosaicml#1014)
* flagged areas to throw ft errors + added custom exception * added exceptions for all files * fixed type of invalid type errors * fixed merge * added exceptions for all files * fixed merge * added a catch for missing hf url * expain dataloader catch to all error types * added tests for custom exceptions * fixed a couple tests * fixed some more tests * addressed comments * fixed formatting, updated split correction, moved logger setup into helper * added check for repeating roles * more strict checking for prompt response type * removed some cases we don't need * updated name of unknown conversation type error * formatting * Fix multi model eval (mosaicml#1055) * resolved merge conflict * formatted * added back two imports * formatting changes * formatted again * sorted imports * diable yapf for exceptions import * disabled yapf on test dataloader --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0ef7cd6 - Browse repository at this point
Copy the full SHA 0ef7cd6View commit details -
Bump version to 0.7.0 (mosaicml#1063)
* Bump version * Remove triton (mosaicml#1062) * Remove github action workflows for version bumps * Fix cpu test issues * code quality * Fix gpu tests * Fix gpu tests nicely * Remove z-loss (mosaicml#1064) * Remove prefix lm and denoising (mosaicml#1065) * Remove hf_prefix_lm * Remove prefix_lm from mpt modeling * Remove bidirectional mask * Remove text denoising dataloading * Remove adapt tokenizer * Remove llama attention patch (mosaicml#1066) * Remove bidirectional mask in tests * Fix test_hf_config_override with patch
Configuration menu - View commit details
-
Copy full SHA for 7f0fdae - Browse repository at this point
Copy the full SHA 7f0fdaeView commit details
Commits on Mar 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e42ea70 - Browse repository at this point
Copy the full SHA e42ea70View commit details -
Fix context printing (mosaicml#1068)
* Fix comparison in train for hf_t5 * Add back context printing in dataloader.py
Configuration menu - View commit details
-
Copy full SHA for f044d6c - Browse repository at this point
Copy the full SHA f044d6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a69bd7 - Browse repository at this point
Copy the full SHA 8a69bd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 28467bb - Browse repository at this point
Copy the full SHA 28467bbView commit details
Commits on Mar 28, 2024
-
Output eval logging batch (mosaicml#961)
* Skip flaky lion8b test (mosaicml#598) * relax atol and add retries to reduce flakiness in lion8b timing test * add eval output logging * add back tasks * foo * add rlhf prompts * add rlhf prompts * add rlhf prompts * add rlhf prompts * add rlhf prompts * fix prompt * fix prompt * modify mcli * test * test * fix * fix merge * wip * merge * reset files, wip commit * rm small changes * reduce changes * reduce changes * . * wip * rm batch keys * revert init device * linting * add import * fix import * add eval_output_logging to registry * readd import * pyright + linting --------- Co-authored-by: dblalock <[email protected]> Co-authored-by: Jeremy Dohmann <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7a8a156 - Browse repository at this point
Copy the full SHA 7a8a156View commit details
Commits on Apr 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 349c2ff - Browse repository at this point
Copy the full SHA 349c2ffView commit details -
Check the user provided eos / bos token id against the tokenizer eos …
…/ bos token id (mosaicml#1039) * lint * lint * added warning and error message instead of setting the eos and bos token ids * Update text_data.py Adding info about the override flags in the error message. * Update llmfoundry/data/text_data.py Co-authored-by: Vitaliy Chiley <[email protected]> * Update llmfoundry/data/text_data.py Co-authored-by: Vitaliy Chiley <[email protected]> * adding warning if user does not provide eos or bos token id * adding warning if user does not provide eos or bos token id --------- Co-authored-by: Vitaliy Chiley <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d8ea2c5 - Browse repository at this point
Copy the full SHA d8ea2c5View commit details
Commits on Apr 2, 2024
-
Triton RMSNorm (mosaicml#1050)
* Triton RMS Norm * fix * refactor * Fix test * no bias * Update tests/models/test_model.py Co-authored-by: Daniel King <[email protected]> * fixes * Unittest * fix * fix2 * fix3 * fix3 --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b765b47 - Browse repository at this point
Copy the full SHA b765b47View commit details -
Fix tiktoken vocab size (mosaicml#1081)
* fix vocab size bug * more tests
Configuration menu - View commit details
-
Copy full SHA for caf7fda - Browse repository at this point
Copy the full SHA caf7fdaView commit details -
Doing the loss reduction in foundry instead of in the loss functions. (…
…mosaicml#1079) * setting loss_fn reduction to None * fixing a unit test * add error message * adding test to check reduction * adding test to check reduction * Update llmfoundry/models/mpt/modeling_mpt.py Co-authored-by: Vitaliy Chiley <[email protected]> * preserving batch dimension of targets * minor change --------- Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 632cb73 - Browse repository at this point
Copy the full SHA 632cb73View commit details -
Configuration menu - View commit details
-
Copy full SHA for 580a4b0 - Browse repository at this point
Copy the full SHA 580a4b0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 394735b - Browse repository at this point
Copy the full SHA 394735bView commit details
Commits on Apr 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d452c60 - Browse repository at this point
Copy the full SHA d452c60View commit details -
Configuration menu - View commit details
-
Copy full SHA for 23c3173 - Browse repository at this point
Copy the full SHA 23c3173View commit details
Commits on Apr 4, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5455b40 - Browse repository at this point
Copy the full SHA 5455b40View commit details -
Configuration menu - View commit details
-
Copy full SHA for c766cf9 - Browse repository at this point
Copy the full SHA c766cf9View commit details -
log packing ratio progress (mosaicml#1070)
* log packing ratio progress * use tqdm to show progress of packing ratio profiling * fix pbar * Update packing.py * added log to search for * fix * good old fashioned progress logging * Update packing.py
Configuration menu - View commit details
-
Copy full SHA for e70e424 - Browse repository at this point
Copy the full SHA e70e424View commit details -
Configuration menu - View commit details
-
Copy full SHA for 26f0619 - Browse repository at this point
Copy the full SHA 26f0619View commit details -
fix typo in expandable_segments (mosaicml#1088)
Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 06ff95f - Browse repository at this point
Copy the full SHA 06ff95fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 607b982 - Browse repository at this point
Copy the full SHA 607b982View commit details -
Configuration menu - View commit details
-
Copy full SHA for f18768b - Browse repository at this point
Copy the full SHA f18768bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 96b27c5 - Browse repository at this point
Copy the full SHA 96b27c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 94301cd - Browse repository at this point
Copy the full SHA 94301cdView commit details
Commits on Apr 5, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 60a1ab4 - Browse repository at this point
Copy the full SHA 60a1ab4View commit details -
Configuration menu - View commit details
-
Copy full SHA for b81897a - Browse repository at this point
Copy the full SHA b81897aView commit details
Commits on Apr 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d0d9434 - Browse repository at this point
Copy the full SHA d0d9434View commit details
Commits on Apr 9, 2024
-
Updating the streaming version in setup.py (mosaicml#1103)
* updating the streaming version in setup.py * updating constructor call to StreamingDataset * making allow_unsafe_types and replication configurable through dataset configs * adding docstring
Configuration menu - View commit details
-
Copy full SHA for 2939cc9 - Browse repository at this point
Copy the full SHA 2939cc9View commit details -
MegaBlocks release (mosaicml#1102)
* [Stage] Megablocks release (mosaicml#241) * V1 of MegaBlocks --------- * fix hf ckptr * rename * lint * lint --------- Co-authored-by: Abhinav Venigalla <[email protected]> Co-authored-by: Sasha Doubov <[email protected]> Co-authored-by: Cheng Li <[email protected]> Co-authored-by: Ning Wang <[email protected]> Co-authored-by: Irene Dea <[email protected]> Co-authored-by: Shashank Rajput <[email protected]> Co-authored-by: Chuck Tang <[email protected]> Co-authored-by: Jose Javier <[email protected]> Co-authored-by: Angel Ruiz <[email protected]> Co-authored-by: Denny Lee <[email protected]> Co-authored-by: Jane Zhang <[email protected]> Co-authored-by: Daniel King <[email protected]> Co-authored-by: Chuck Tang <[email protected]> Co-authored-by: Vitaliy Chiley <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 53160f4 - Browse repository at this point
Copy the full SHA 53160f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for f12bc8a - Browse repository at this point
Copy the full SHA f12bc8aView commit details
Commits on Apr 10, 2024
-
Update config_moe_args.py (mosaicml#1104)
https://databricks.atlassian.net/browse/GRT-2812 see [here](https://github.com/mosaicml/llm-foundry-private/pull/245#issuecomment-2046306845) Ran 500 steps, new version did marginally better
Configuration menu - View commit details
-
Copy full SHA for 17f8aeb - Browse repository at this point
Copy the full SHA 17f8aebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7337429 - Browse repository at this point
Copy the full SHA 7337429View commit details -
Configuration menu - View commit details
-
Copy full SHA for b5fc0fa - Browse repository at this point
Copy the full SHA b5fc0faView commit details
Commits on Apr 11, 2024
-
Support ShareGPT chat format (mosaicml#1098)
This adds support for the other common chat format. We just remap keys and add a new role.
Configuration menu - View commit details
-
Copy full SHA for 4cd2324 - Browse repository at this point
Copy the full SHA 4cd2324View commit details
Commits on Apr 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ed3daef - Browse repository at this point
Copy the full SHA ed3daefView commit details -
Configuration menu - View commit details
-
Copy full SHA for 560012b - Browse repository at this point
Copy the full SHA 560012bView commit details -
Dbrx finetune yaml requires save folder specified to enable autoresume (
mosaicml#1108) * Update dbrx-full-ft.yaml * fix
Configuration menu - View commit details
-
Copy full SHA for e9b1c6e - Browse repository at this point
Copy the full SHA e9b1c6eView commit details -
Revert "Update config_moe_args.py (mosaicml#1104)" (mosaicml#1111)
This reverts commit 17f8aeb.
Configuration menu - View commit details
-
Copy full SHA for b58d68c - Browse repository at this point
Copy the full SHA b58d68cView commit details -
Update config_moe_args.py (mosaicml#1112)
mosaicml#1111 needed to revert mosaicml#1104 because the mosaicml#1104 PR caused issues. Removing TODO and marking Jira with wont-do
Configuration menu - View commit details
-
Copy full SHA for 6257e5b - Browse repository at this point
Copy the full SHA 6257e5bView commit details -
Migrate ICL classes to foundry (mosaicml#936)
* start * still need to migrate fixtures * wip onboarding tests * still workin' * still wip * maybe done; test out on mcli now * mcli * remove calibration error * migration * migration * full migration * precommit * fix * fix pytests * refactor QA * update * restore * add * fix * wip * update readme * final pyright * done * pass prelimiter into ALL the ICL task datasets * allow QA task name stil lfor backward compatibility * fix * fix test * add generation length * remove max_new_tokens * fix cpu trsts * try and fix lm eval test * temp disable lm task eval test * fix test? * fix tet * finish * fix * Update scripts/eval/README.md Co-authored-by: Daniel King <[email protected]> * fix comments * fix bug with seq len * restore mcli * merge * fix builder * add deprecation warning * add deprecation warning * merge * merge * add logging necessities to nlp.py * add attention_mask test update * fix generation_length in tests * fix bug * restore yamls * fix typos * add deprecation warning for code * pyright wip * fix pyright * fix pyright error again * fix pyright * fix pyright * update version --------- Co-authored-by: Eitan Turok <[email protected]> Co-authored-by: Max Marion <[email protected]> Co-authored-by: Daniel King <[email protected]> Co-authored-by: Max Marion <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3729ba3 - Browse repository at this point
Copy the full SHA 3729ba3View commit details -
Configuration menu - View commit details
-
Copy full SHA for cb0de4f - Browse repository at this point
Copy the full SHA cb0de4fView commit details
Commits on Apr 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 676ad7f - Browse repository at this point
Copy the full SHA 676ad7fView commit details -
Configuration menu - View commit details
-
Copy full SHA for f01f625 - Browse repository at this point
Copy the full SHA f01f625View commit details
Commits on Apr 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 84b6410 - Browse repository at this point
Copy the full SHA 84b6410View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4bb4d4a - Browse repository at this point
Copy the full SHA 4bb4d4aView commit details
Commits on Apr 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 20cb40c - Browse repository at this point
Copy the full SHA 20cb40cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 63a7f12 - Browse repository at this point
Copy the full SHA 63a7f12View commit details -
Configuration menu - View commit details
-
Copy full SHA for 698206d - Browse repository at this point
Copy the full SHA 698206dView commit details -
catch misconfigured hf dataset (mosaicml#1123)
* catch misconfigured hf dataset * update unit tests * removed unused import * use new dataset * Update llmfoundry/data/finetuning/tasks.py Co-authored-by: Jane Zhang <[email protected]> * Update llmfoundry/utils/exceptions.py Co-authored-by: Daniel King <[email protected]> --------- Co-authored-by: Daniel King <[email protected]> Co-authored-by: Jane Zhang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f0646e8 - Browse repository at this point
Copy the full SHA f0646e8View commit details
Commits on Apr 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3426415 - Browse repository at this point
Copy the full SHA 3426415View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6caa75a - Browse repository at this point
Copy the full SHA 6caa75aView commit details
Commits on Apr 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0c6bd75 - Browse repository at this point
Copy the full SHA 0c6bd75View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4952183 - Browse repository at this point
Copy the full SHA 4952183View commit details
Commits on Apr 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c53622e - Browse repository at this point
Copy the full SHA c53622eView commit details -
Update JSONL sources in eval README (mosaicml#1110)
Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0d62e61 - Browse repository at this point
Copy the full SHA 0d62e61View commit details
Commits on Apr 24, 2024
-
Mlflow datasets (mosaicml#1119)
* add logger * reqs * small fix * import mlflow * parse_uri * parse_uri * finished debug * precommit * more code fix * revert setup * better dovs * rm docstr * precommit * Update tests to not rely on mistral (mosaicml#1117) * Bump transformers to 4.40 (mosaicml#1118) * merge * revert setup * precommit * precommit * tweaks to resolve comments * unit test * code quality * quotation * quote * more quality * optional * pyright * type check * rm typechecking * yapf * first pass * fix * get refactor * refactor * local hf path * dbfs * rm local * typo * second pass * update * third pass * os.path.join * precommit --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 72da1d7 - Browse repository at this point
Copy the full SHA 72da1d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6252f79 - Browse repository at this point
Copy the full SHA 6252f79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 76f74b6 - Browse repository at this point
Copy the full SHA 76f74b6View commit details
Commits on Apr 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 15abf8c - Browse repository at this point
Copy the full SHA 15abf8cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 24f65fd - Browse repository at this point
Copy the full SHA 24f65fdView commit details -
Revert "First initialize dist with gloo (mosaicml#1133)" (mosaicml#1139)
This reverts commit 76f74b6.
Configuration menu - View commit details
-
Copy full SHA for 4aef5de - Browse repository at this point
Copy the full SHA 4aef5deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6cfd2a3 - Browse repository at this point
Copy the full SHA 6cfd2a3View commit details
Commits on Apr 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f97f02e - Browse repository at this point
Copy the full SHA f97f02eView commit details
Commits on Apr 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8be3254 - Browse repository at this point
Copy the full SHA 8be3254View commit details
Commits on Apr 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 704a90a - Browse repository at this point
Copy the full SHA 704a90aView commit details -
Configuration menu - View commit details
-
Copy full SHA for fbcf311 - Browse repository at this point
Copy the full SHA fbcf311View commit details -
build inner model (mosaicml#1147)
* build inner model * remove tokenizer arg * fix namespace overlap * change the type ignore location * PeftModel -> 'PeftModel' * additional fsdp preparation * init device * prepare inner model -> prepare for fsdp * update superclass type sig * added docstring * fix docstring style --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for de5a394 - Browse repository at this point
Copy the full SHA de5a394View commit details -
fix DatasetConstants.splints default value to protect dictionary over…
…writing (mosaicml#1144) * fix DatasetConstants.splints default value; correct typos for raw_samples amount; correct type hinting for raw_samples * Update scripts/data_prep/convert_dataset_hf.py --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 738956e - Browse repository at this point
Copy the full SHA 738956eView commit details
Commits on Apr 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for bd0d1cb - Browse repository at this point
Copy the full SHA bd0d1cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6fde283 - Browse repository at this point
Copy the full SHA 6fde283View commit details -
Configuration menu - View commit details
-
Copy full SHA for 63ac1a4 - Browse repository at this point
Copy the full SHA 63ac1a4View commit details -
Configuration menu - View commit details
-
Copy full SHA for b570b61 - Browse repository at this point
Copy the full SHA b570b61View commit details
Commits on May 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2f58965 - Browse repository at this point
Copy the full SHA 2f58965View commit details -
Move sentencepiece import (mosaicml#1157)
* Move sentencepiece import * Update llmfoundry/utils/checkpoint_conversion_helpers.py Co-authored-by: Daniel King <[email protected]> --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6561330 - Browse repository at this point
Copy the full SHA 6561330View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f39606 - Browse repository at this point
Copy the full SHA 5f39606View commit details -
Uncomment GPU tests (mosaicml#1162)
* Uncomment GPU tests * Update pr-gpu.yaml
Configuration menu - View commit details
-
Copy full SHA for fa7a78a - Browse repository at this point
Copy the full SHA fa7a78aView commit details
Commits on May 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 124de4c - Browse repository at this point
Copy the full SHA 124de4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d58f46 - Browse repository at this point
Copy the full SHA 0d58f46View commit details -
Bump min torch version to 2.3.0 (mosaicml#1152)
* bump min torch * make random correct less likely * bf16 precisioin * fix * new dataset? * try again * debug * debug * pin hf hub * remove debug --------- Co-authored-by: Milo Cress <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a3e0fb9 - Browse repository at this point
Copy the full SHA a3e0fb9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 10b7caf - Browse repository at this point
Copy the full SHA 10b7cafView commit details -
refactoring dataloader into registries. (mosaicml#1165)
* refactoring dataloader into registries * lint * fixing get_tokens_per_batch_func import issues * lint * resolving comments * resolving comments --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3b82735 - Browse repository at this point
Copy the full SHA 3b82735View commit details -
Migrate eval output logging to foundry (mosaicml#1166)
* initial commit * linting * change callback to import from foundry * add types * linting * linting * pyright * linting * linting
Configuration menu - View commit details
-
Copy full SHA for ddf4aa4 - Browse repository at this point
Copy the full SHA ddf4aa4View commit details
Commits on May 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c0d591c - Browse repository at this point
Copy the full SHA c0d591cView commit details -
Configuration menu - View commit details
-
Copy full SHA for bfbb8c5 - Browse repository at this point
Copy the full SHA bfbb8c5View commit details
Commits on May 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ab9dde7 - Browse repository at this point
Copy the full SHA ab9dde7View commit details
Commits on May 8, 2024
-
Bump version v0.9.0.dev0 (mosaicml#1181)
* update init.py * deprecate * delete in context learning code eval dataset * remove deprecated export * removed more code_eval stuff * oopsie * guh * fix
Configuration menu - View commit details
-
Copy full SHA for a777014 - Browse repository at this point
Copy the full SHA a777014View commit details -
structuredconfig for train.py and eval.py (mosaicml#1051)
* first commit for structuredconfig for train.py * revamp configs * wip latest issue * reorder so mandatory attributes come first * fix * fix * fix fix * fix types * fix dictconfig * fix union of list|dict configs * fix type annotation * oops * fixed configs * add save ignore keys * fix batch size kerfuffle * fix dictconfig stuff * fix dictconfig stuff again * fix * fix * updated unit tests for variables * last fix? * if this test case does not pass I will venmo Mihir 0 * remove a 'not' -- eg. 'I am not going crazy' * Update scripts/train/train.py Co-authored-by: Daniel King <[email protected]> * set amp bf16 as default precision, etc * temporarily wrap with dictconfig before ** migration * fix icl tasks * fix * fix activation checkpointing reentrant * fix extraneous keys * first round ** * fix? * quick fsdp config fix * updated yamls to make variables explicit * remove precision from mandatory params list * I expect many of these to fail in interesting ways * fix test_model test cases with ** * fix many more test cases * fix dictconfig objectification * fix remaining test cases * remove unneeded ** * fix test case * changed back argument name * fix * ** for finetuning dataloader * fix? * fix dataloader * fix * fix finetuning dataloader * fix build_text_dataloader * left to my own devices * fix packing * fix typo * fix padding test cases * ignore extra parameters and warn * fix style * fix quality checks * fix code quality * pyright-fu * fix * just one more type constraint bro * OmegaConf -> om * rename variables for clarity * revert file * revert file II * revert file III: revert of the sith * peft revert file * revert v_mpt * last revert * remove redundant checks * deprecate * make cleaner * pyright is bullying me again * further clean config_utils * polish train * polish train and eval * fix dist * fix style * organize eval and train * fix * used helper function to make main cleaner * fix stuff * fix pyright * added fix and explanation * fix typo in unit test update smh * Update llmfoundry/registry.py Co-authored-by: Daniel King <[email protected]> * Update scripts/train/train.py Co-authored-by: Daniel King <[email protected]> * Update scripts/train/train.py Co-authored-by: Daniel King <[email protected]> * Update scripts/train/train.py Co-authored-by: Daniel King <[email protected]> * Apply suggestions from code review Co-authored-by: Daniel King <[email protected]> * see if this fails * reject name and device rather than ignoring * pretrained is not a bool * add validation to make sure the user doesn't set both * forbid config keys * oops forgot eval * address coomments * removed redundant check * updated callsites not to use name * fix * validate extraneous keys in dataloader * fix * fix more * fix III: revenge of the fix * fix IV: a new hope * fix V: the empire fixes back * fixed some more types * fix VI: return of the fix * fix VII: the fix awakens * fix VIII: the last bug * fix * final fix I think * fixed * fix style * fix * fix fix * fix fix style * icl task config * fix train * fix finetuning dataloader * fix train types * fix token counting * fix train types * oopsie * fix straggler issues * fix tests * fix??? * fix hf v mpt gpu test and fmapi test * pop device * to_str_dict -> to_dict_recursive * fix this darn unit test one more time * fix ComposerMPTCausalLM constructor invocation * Delete tests/models/hf/test_hf_fsdp.py * unwrap model in unit tests * model.model.model.model.model * abstract away dataclass construction * updated docstrings and removed dictconfig from logging logic * flag icl tasks required or not * updated a couple yamls * updated train and eval scripts * un-delete global train batch size * fix * I don't understand why this doesn't work * that was the sneakiest bug I've ever fixed * try to fix the regression test * remove device train grad accum * fix validate config * removed unused import * use variables * missing mandatory value fix * use correct type of error * fix * import TrainConfig just in case? * moved trainconfig and evalconfig into utils * works * no cheating * dicts everywhere gah * try no recursive just * rename typed helpers * fix the test cases with deep magic * towards a peaceful resolution * remove comments * fix type warnings * Update llmfoundry/utils/config_utils.py Co-authored-by: Daniel King <[email protected]> * address low-hanging fruit * remove peft wrapping extra model * python 🤝 haskell * dataset config should be dict * just because omega starts with OMMMM does not mean it's zen * fix * fix * structured settlement * precision further down * throws TypeError instead of MissingMandatoryValue or whatever * remove debugging statement * remove to_container calls everywhere * wrap then unwrap * pyright * error early on missing mandatory values * remove unnecessory ignore * update unit tests * update eval yamls * Update train.py * make log level optional again * oopsie * use keywords for arg clarity * use keywords for arg clarity * style * style * dist timeout * resolve deeper conflict issues * fix train.py * fix registry * fix dataloader * fix train II * fix dataloader and utils * fix dictconfig * skill issue * add new keys * remove pop_config * fix --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cc8351c - Browse repository at this point
Copy the full SHA cc8351cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 46b8bee - Browse repository at this point
Copy the full SHA 46b8beeView commit details -
Refactoring attention (mosaicml#1182)
* refactoring * adding back a function that got deleted by mistake * adding co-authors Co-Authored-By: Vitaliy Chiley <[email protected]> Co-Authored-By: Cheng Li <[email protected]> * adding co-authors Co-Authored-By: Vitaliy Chiley <[email protected]> * adding co-authors Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Vitaliy Chiley <[email protected]> * Update config_utils.py adding co-authors Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Cheng Li <[email protected]> Co-authored-by: Cheng Li <@cli99> * lint Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Vitaliy Chiley <[email protected]> * Adding_co_authors Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Cheng Li <[email protected]> * Update llmfoundry/models/mpt/modeling_mpt.py Co-authored-by: Daniel King <[email protected]> * addressing comments * adding_co_authors Co-authored-by: Cheng Li <[email protected]> * Update llmfoundry/utils/config_utils.py --------- Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Vitaliy Chiley <[email protected]> Co-authored-by: Cheng Li <[email protected]> Co-authored-by: Daniel King <[email protected]> Co-authored-by: Cheng Li <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ac563e6 - Browse repository at this point
Copy the full SHA ac563e6View commit details
Commits on May 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0c7bc2a - Browse repository at this point
Copy the full SHA 0c7bc2aView commit details -
Configuration menu - View commit details
-
Copy full SHA for c6679d3 - Browse repository at this point
Copy the full SHA c6679d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 139abab - Browse repository at this point
Copy the full SHA 139ababView commit details -
Configuration menu - View commit details
-
Copy full SHA for 51d0d09 - Browse repository at this point
Copy the full SHA 51d0d09View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1dd37c5 - Browse repository at this point
Copy the full SHA 1dd37c5View commit details
Commits on May 10, 2024
-
Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 983234d - Browse repository at this point
Copy the full SHA 983234dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 994209c - Browse repository at this point
Copy the full SHA 994209cView commit details
Commits on May 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for eef4872 - Browse repository at this point
Copy the full SHA eef4872View commit details
Commits on May 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0449b60 - Browse repository at this point
Copy the full SHA 0449b60View commit details -
Clearer error message for unknown example type (mosaicml#1202)
* error when example is a string * make error message the same * strings only --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8274c6c - Browse repository at this point
Copy the full SHA 8274c6cView commit details
Commits on May 15, 2024
-
Added torch_dmoe defaults, bug fixes for 2D inputs (mosaicml#1210)
* defaults for torch dmoe match mb dmoe * defaults for torch dmoe match mb dmoe * defaults for torch dmoe match mb dmoe * defaults for torch dmoe match mb dmoe * defaults for torch dmoe match mb dmoe * defaults for torch dmoe match mb dmoe * defaults for torch dmoe match mb dmoe * top k proper * permute fix * permute fix * permute fix * permute fix * permute fix * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * narrow down world size bug * blocking dimension bug * done * yo * yo
Configuration menu - View commit details
-
Copy full SHA for b414626 - Browse repository at this point
Copy the full SHA b414626View commit details -
log eval dataset misconfiguration (mosaicml#1179)
* log eval dataset misconfiguration * use context * literally * BaseException -> Exception * use my archaeological skills to find the right python syntax for 3.9 * refactor names for more general use * oops * oops II * context -> location * use variables instead of strings * Update exceptions.py * delete Mapping
Configuration menu - View commit details
-
Copy full SHA for cfee4e4 - Browse repository at this point
Copy the full SHA cfee4e4View commit details -
using self.shift_labels instead of self.model.transformer.shift_label…
…s in loss (mosaicml#1211) Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8cd23d5 - Browse repository at this point
Copy the full SHA 8cd23d5View commit details
Commits on May 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for dc3212e - Browse repository at this point
Copy the full SHA dc3212eView commit details -
TransformerEngine Image Build (mosaicml#1204)
* install * fix * try chucks branch * fix image * pin * fix lint * revert image size * remove verbose * switch to 8wide ---------
Configuration menu - View commit details
-
Copy full SHA for e70891b - Browse repository at this point
Copy the full SHA e70891bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3fe7f09 - Browse repository at this point
Copy the full SHA 3fe7f09View commit details -
Make
fc_type
a dict to pass fc kwargs through (mosaicml#1201)* fc type as fict * fc type as dict * fc type as dict * rework ffn fc config slightly * rework ffn fc config slightly * merged main * merged main * no circular imports * Update llmfoundry/models/mpt/configuration_mpt.py Co-authored-by: Daniel King <[email protected]> * them configs * yo * linting man * the deep copy --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 38ae65b - Browse repository at this point
Copy the full SHA 38ae65bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a15082 - Browse repository at this point
Copy the full SHA 3a15082View commit details
Commits on May 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 77f9ab1 - Browse repository at this point
Copy the full SHA 77f9ab1View commit details
Commits on May 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 001e7c3 - Browse repository at this point
Copy the full SHA 001e7c3View commit details -
Add user error superclass (mosaicml#1225)
* Add user error superclass * update class inheritance structure
Configuration menu - View commit details
-
Copy full SHA for 8e29698 - Browse repository at this point
Copy the full SHA 8e29698View commit details -
Configuration menu - View commit details
-
Copy full SHA for c891bed - Browse repository at this point
Copy the full SHA c891bedView commit details -
Quick patch to check that Dataset Keys contain non-None Values (mosai…
…cml#1228) * quick patch * also seperately check for local path * typo * typo --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9cc945c - Browse repository at this point
Copy the full SHA 9cc945cView commit details
Commits on May 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9120c27 - Browse repository at this point
Copy the full SHA 9120c27View commit details -
Loss v len callback (mosaicml#1226)
* adding loss v len callback * lint * lint * using model's loss function * lint * only log to mlflow logger * incorporating suggestions * incorporating comments --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c213ea8 - Browse repository at this point
Copy the full SHA c213ea8View commit details -
Fixing the state.timestamp.batch.value issue in loss v len callback (m…
…osaicml#1232) * adding print statements * testing fix * fix * removing print statements * minor fix
Configuration menu - View commit details
-
Copy full SHA for 6fa6026 - Browse repository at this point
Copy the full SHA 6fa6026View commit details
Commits on May 24, 2024
-
fix attr error for attention_classes (mosaicml#1230)
Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b4bb34c - Browse repository at this point
Copy the full SHA b4bb34cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 09d8892 - Browse repository at this point
Copy the full SHA 09d8892View commit details -
Add example eval scripts for dbrx PT sizes (mosaicml#1218)
* Add example eval scripts for dbrx PT sizes * fix name * dsjd * yaml linting * update tokenizer
Configuration menu - View commit details
-
Copy full SHA for ef530bf - Browse repository at this point
Copy the full SHA ef530bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for ff92f3c - Browse repository at this point
Copy the full SHA ff92f3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for fdaa58b - Browse repository at this point
Copy the full SHA fdaa58bView commit details -
Move MLFlow dataset outside of log_config (mosaicml#1234)
* move txt log * typo * Update scripts/train/train.py * train config * debug * source data * verbose * debug * debug * check if mlflow is active * fex tests * move mlflow check to train * update test * precommit --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1e4bd37 - Browse repository at this point
Copy the full SHA 1e4bd37View commit details
Commits on May 25, 2024
-
add error when chat template fails (mosaicml#1222)
* add error when chat template fails * type * formatting
Configuration menu - View commit details
-
Copy full SHA for 2e10d95 - Browse repository at this point
Copy the full SHA 2e10d95View commit details -
Configuration menu - View commit details
-
Copy full SHA for 867e58f - Browse repository at this point
Copy the full SHA 867e58fView commit details
Commits on May 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c9257b5 - Browse repository at this point
Copy the full SHA c9257b5View commit details
Commits on May 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 43d149b - Browse repository at this point
Copy the full SHA 43d149bView commit details
Commits on May 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b82a82b - Browse repository at this point
Copy the full SHA b82a82bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fb9a225 - Browse repository at this point
Copy the full SHA fb9a225View commit details
Commits on Jun 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d846731 - Browse repository at this point
Copy the full SHA d846731View commit details
Commits on Jun 4, 2024
-
replacing icl_task_type question_answering with generation_task_with_…
…answers in the long context eval yaml (mosaicml#1250)
Configuration menu - View commit details
-
Copy full SHA for 6c260f5 - Browse repository at this point
Copy the full SHA 6c260f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for ac56dc5 - Browse repository at this point
Copy the full SHA ac56dc5View commit details
Commits on Jun 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 67928cb - Browse repository at this point
Copy the full SHA 67928cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3966f0e - Browse repository at this point
Copy the full SHA 3966f0eView commit details -
Adding more token encoding types (mosaicml#1254)
* add more token encoing types * add more token encoing types * add tests * add tests * ft support, tests * linting is shortening my lifespan * linting is shortening my lifespan * long tensor * long tensor * long tensor * feedbacc * import * import --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 42c2d9a - Browse repository at this point
Copy the full SHA 42c2d9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 14f296c - Browse repository at this point
Copy the full SHA 14f296cView commit details
Commits on Jun 7, 2024
-
Bump Version to 0.10.0.dev0 (mosaicml#1255)
* bump version * typo * Update config_utils.py These changes are necessary as the deprecation broke compatibility with `update_batch_size`. * Update config_utils.py fix typo * typo * typo I * update tests * typo II * typo III * bump composer version * undo composer bump for seperate pr * fix test * fix tests II * yolo * tye-o * pyrite * we resolve later * revert new . syntax --------- Co-authored-by: v-chen_data <[email protected]> Co-authored-by: Milo Cress <[email protected]> Co-authored-by: Saaketh Narayan <[email protected]> Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bea61fb - Browse repository at this point
Copy the full SHA bea61fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for e4b8b57 - Browse repository at this point
Copy the full SHA e4b8b57View commit details -
Configuration menu - View commit details
-
Copy full SHA for db70135 - Browse repository at this point
Copy the full SHA db70135View commit details -
Revert "Update TE Dockerfile (mosaicml#1265)" (mosaicml#1266)
This reverts commit db70135.
Configuration menu - View commit details
-
Copy full SHA for 4e53e74 - Browse repository at this point
Copy the full SHA 4e53e74View commit details -
Configuration menu - View commit details
-
Copy full SHA for dddb9b8 - Browse repository at this point
Copy the full SHA dddb9b8View commit details
Commits on Jun 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for dd92abf - Browse repository at this point
Copy the full SHA dd92abfView commit details
Commits on Jun 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5571101 - Browse repository at this point
Copy the full SHA 5571101View commit details
Commits on Jun 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ffec54b - Browse repository at this point
Copy the full SHA ffec54bView commit details -
Configuration menu - View commit details
-
Copy full SHA for c30856f - Browse repository at this point
Copy the full SHA c30856fView commit details
Commits on Jun 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 630fc68 - Browse repository at this point
Copy the full SHA 630fc68View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9b9fc24 - Browse repository at this point
Copy the full SHA 9b9fc24View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1a2fac0 - Browse repository at this point
Copy the full SHA 1a2fac0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 82ef072 - Browse repository at this point
Copy the full SHA 82ef072View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4350990 - Browse repository at this point
Copy the full SHA 4350990View commit details -
Configuration menu - View commit details
-
Copy full SHA for dbd798e - Browse repository at this point
Copy the full SHA dbd798eView commit details
Commits on Jun 15, 2024
-
* remerge * import supported extensions * no circular imports * local import * typo * testing * test * test * more test * rm . from ext * clean up and pre commit * imports and boolean typo * revert one exception, use backend over startwith, and set path using parse_uri * move import to local * debug * typo * debug * more debug * debug * clean up * back to debug * typo * add slash * move slash to verify_uc_path * moved back * debug * more debug * clean up * pyright * generic error messaging * move NotFound up * dbfs directory check * precommit --------- Co-authored-by: Daniel King <[email protected]> Co-authored-by: v-chen_data <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1ff6c5b - Browse repository at this point
Copy the full SHA 1ff6c5bView commit details
Commits on Jun 16, 2024
-
Co-authored-by: v-chen_data <[email protected]> Co-authored-by: Daniel King <[email protected]> Co-authored-by: Mihir Patel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ca528d5 - Browse repository at this point
Copy the full SHA ca528d5View commit details
Commits on Jun 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f8b2875 - Browse repository at this point
Copy the full SHA f8b2875View commit details -
Configuration menu - View commit details
-
Copy full SHA for 618db6f - Browse repository at this point
Copy the full SHA 618db6fView commit details
Commits on Jun 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c23be4a - Browse repository at this point
Copy the full SHA c23be4aView commit details
Commits on Jun 19, 2024
-
added systemMetricsMonitor callback (mosaicml#1260)
* added systemMetricsMonitor callback * added unit testing for building system_metrics_monitor * temporarily removing flag check in unit test for system_metrics_monitor build * removed end of line formatting issue * fixed end of line error in __init__.py * fixed extra line in init --------- Co-authored-by: Daniel King <[email protected]> Co-authored-by: Mihir Patel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4b1fecb - Browse repository at this point
Copy the full SHA 4b1fecbView commit details
Commits on Jun 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8241f9c - Browse repository at this point
Copy the full SHA 8241f9cView commit details
Commits on Jun 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 78e4cc6 - Browse repository at this point
Copy the full SHA 78e4cc6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 13bd8f9 - Browse repository at this point
Copy the full SHA 13bd8f9View commit details -
Configuration menu - View commit details
-
Copy full SHA for e8ba9b7 - Browse repository at this point
Copy the full SHA e8ba9b7View commit details -
Bump composer to 0.23.4 (mosaicml#1297)
* bump composer * bump composer
Configuration menu - View commit details
-
Copy full SHA for 129e3e1 - Browse repository at this point
Copy the full SHA 129e3e1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2196d07 - Browse repository at this point
Copy the full SHA 2196d07View commit details
Commits on Jun 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8b5a1bb - Browse repository at this point
Copy the full SHA 8b5a1bbView commit details
Commits on Jun 24, 2024
-
Add Retries to run_query (mosaicml#1302)
* add retry * pyright * slight refactor --------- Co-authored-by: v-chen_data <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fd7b187 - Browse repository at this point
Copy the full SHA fd7b187View commit details -
Bumping mlflow version to include buffering (mosaicml#1303)
* bumping mlflow version to include buffering * capping at mlflow 2.15
Configuration menu - View commit details
-
Copy full SHA for 2267bc7 - Browse repository at this point
Copy the full SHA 2267bc7View commit details -
Ignore mosaicml logger for exception if excephook is active (mosaicml…
…#1301) * ignore logger if excephook is active * remove logger in data scripts and callback * undo format of imports * moved env var check into helper * formatted * removed import format * added docstring * ran pre-commit --------- Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 21c9e0a - Browse repository at this point
Copy the full SHA 21c9e0aView commit details
Commits on Jun 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ef14849 - Browse repository at this point
Copy the full SHA ef14849View commit details
Commits on Jun 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2412b59 - Browse repository at this point
Copy the full SHA 2412b59View commit details -
Configuration menu - View commit details
-
Copy full SHA for bbfebda - Browse repository at this point
Copy the full SHA bbfebdaView commit details
Commits on Jun 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 901eee3 - Browse repository at this point
Copy the full SHA 901eee3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3edce07 - Browse repository at this point
Copy the full SHA 3edce07View commit details -
Configuration menu - View commit details
-
Copy full SHA for 14348fa - Browse repository at this point
Copy the full SHA 14348faView commit details -
Configuration menu - View commit details
-
Copy full SHA for 472d009 - Browse repository at this point
Copy the full SHA 472d009View commit details -
Configuration menu - View commit details
-
Copy full SHA for f141ee1 - Browse repository at this point
Copy the full SHA f141ee1View commit details
Commits on Jun 29, 2024
-
Provide default seed value in TrainConfig, matching EvalConfig (mosai…
…cml#1315) * Update config_utils.py * lint
Configuration menu - View commit details
-
Copy full SHA for 0ebd7c9 - Browse repository at this point
Copy the full SHA 0ebd7c9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 88511f7 - Browse repository at this point
Copy the full SHA 88511f7View commit details
Commits on Jun 30, 2024
-
Allows interweaving of arbitrary kinds of 'attention' layers, like sl…
…iding window, reuse prev layer kv cache etc. (mosaicml#1299) * [WIP] Allows interweaving of arbitrary kinds of 'attention' layers, like RNN, sliding window etc. * lint * applying overrides to blocks rather than just attentions * add docstring * minor * changing yaml specification style * .. * fixes * fix * fix * fix * refactoring * add warning * compute only query vector when reusing kv * refactor * fixing * adding test for reusing previous layer kv cache * adding error messages * .. * adding test * add logging * adding logging * minor * bug fix, adding test * minor * addressing some comments * addressing some comments * setting absolute absolute value for reuse_kv_layer_idx * lint * adding tests for override_block_args * adding error if reusing kv cache from a mismatch layer * fixing test * fixing code, test * fix * .. * refactoring * fix * .. * .. * .. * refactoring * .. * .. * .. * adding test for _get_modules_order_expanded * fixing test * fixing test * lint * lint * adding test * addressing comment * .. * fixing test * changing yaml format * fix configuation * fixing test * allowing repeat at top level * allowing overriding error * addressing comments * lint * addressing comments * fix * .. * .. * .. * .. * .. * addressing comment * fixing test
Configuration menu - View commit details
-
Copy full SHA for 8604bba - Browse repository at this point
Copy the full SHA 8604bbaView commit details
Commits on Jul 1, 2024
-
Add optional logging of text output to EvalOutputLogging (mosaicml#1283)
--------- Co-authored-by: Mihir Patel <[email protected]> Co-authored-by: Daniel King <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 68c2625 - Browse repository at this point
Copy the full SHA 68c2625View commit details
Commits on Jul 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 742f340 - Browse repository at this point
Copy the full SHA 742f340View commit details
Commits on Jul 5, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e99ec07 - Browse repository at this point
Copy the full SHA e99ec07View commit details -
Configuration menu - View commit details
-
Copy full SHA for 48b239e - Browse repository at this point
Copy the full SHA 48b239eView commit details -
Configuration menu - View commit details
-
Copy full SHA for a76d824 - Browse repository at this point
Copy the full SHA a76d824View commit details -
Configuration menu - View commit details
-
Copy full SHA for d238edc - Browse repository at this point
Copy the full SHA d238edcView commit details -
Configuration menu - View commit details
-
Copy full SHA for b723c25 - Browse repository at this point
Copy the full SHA b723c25View commit details -
Configuration menu - View commit details
-
Copy full SHA for 820221a - Browse repository at this point
Copy the full SHA 820221aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a69273d - Browse repository at this point
Copy the full SHA a69273dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ec8716 - Browse repository at this point
Copy the full SHA 8ec8716View commit details
Commits on Jul 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for de1240d - Browse repository at this point
Copy the full SHA de1240dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 51beeb0 - Browse repository at this point
Copy the full SHA 51beeb0View commit details
Commits on Jul 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1939ae4 - Browse repository at this point
Copy the full SHA 1939ae4View commit details