-
Notifications
You must be signed in to change notification settings - Fork 30.2k
LLaMA Implementation #21955
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
LLaMA Implementation #21955
Changes from 29 commits
Commits
Show all changes
53 commits
Select commit
Hold shift + click to select a range
d7e21f6
LLaMA
zphang 8978f28
sharding and docs
zphang 1b4850b
tweak
zphang 1716c4e
black
zphang e31715d
inits
zphang 55012ec
ruff
zphang a4c89ee
LLAMA_PRETRAINED_CONFIG_ARCHIVE_MAP
zphang 984ea75
init
zphang a61eae9
no checkpoint
zphang 4a9a7df
docs
zphang 39991ad
ruff
zphang a376678
type_vocab_size
zphang c1dae8f
tokenizer fixes
zphang 459e2ac
tokenizer fixes
zphang a82e47c
Update tokenization_llama.py
StellaAthena 2f36c47
Update tokenization_llama.py
StellaAthena 2a07565
Update configuration_llama.py
StellaAthena 331898c
Update modeling_llama.py
StellaAthena 6a17e7f
Merge pull request #2 from zphang/StellaAthena-patch-1
StellaAthena bdb7064
tokenizer add_bos by default
zphang e7c9bff
licenses
zphang 132f59b
remove decoder
zphang a786f29
norms and mlp
zphang 76a9f07
rope overhaul
zphang 5ced472
tweaks
zphang 6e7ecaf
black
zphang 4b11ce2
mention OPT implementation
zphang 0209e0b
off-by-one naming
zphang 660dd6e
typo
zphang e5dd77a
fix
zphang 68d640f
tokenization fix and slicing bug
zphang 16058fe
padding config
zphang e2faccb
cleanup
zphang 84948eb
black
zphang a3dfcc0
update tests
zphang 58fe9a6
undo typo
zphang 8eefcac
fix vocab caching logic
zphang 48c89c2
ruff
zphang c3dc391
docbuilder
zphang ef61b1b
attn fix from BlackSamorez
zphang 49cc1eb
initial feedback
zphang 8dbd0d1
typo
zphang 28e103e
docs
zphang 4297855
llama case
zphang 612b694
llama case
zphang de1cd5d
load checkpoint docs
zphang 951023f
comment about tokenizer
zphang dcd5524
tokenizer defaults
zphang 1f6f97d
clear past_key_values if use_cache=False
zphang 7452ebd
last tweaks
zphang 6fce445
last tweaks
zphang 66c8c80
last tweaks
zphang 3884da1
last tweaks
zphang File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.