arxiv:2508.18255
Jeffrey Quesnelle PRO
emozilla
AI & ML interests
None yet
Organizations
models 92
emozilla/consilience-v1-40b-mitchell-init
Text Generation • Updated • 2
emozilla/llama3-8b-dcp-default_tt-init
Updated
emozilla/Llama-3.1-405B-DCP
Updated
emozilla/Llama-3.1-70B-DCP
Updated
emozilla/Llama-3.1-8B-DCP
Updated
emozilla/llama2-15b-gqa-init
Text Generation • Updated • 1
emozilla/llama2-1.1b-gqa-init
Text Generation • Updated • 2
emozilla/llama2-15b-init
Text Generation • Updated
emozilla/llama2-1.2b-nanotron-init
Updated
emozilla/llama2-1.2b-init-6
Text Generation • Updated
datasets 53
emozilla/Hermes-3-Preprocessed-Llama3-2samples
Viewer • Updated • 2 • 17
emozilla/Hermes-3-Preprocessed-Llama3-100samples
Viewer • Updated • 100 • 24 • 1
emozilla/Hermes-3-Preprocessed-Llama3
Viewer • Updated • 91.1k • 51 • 1
emozilla/dolma-v1_7-30B-tokenized-llama2-nanoset
Updated • 67
emozilla/fineweb-10bt-tokenized-datatrove-llama2
Updated • 142 • 3
emozilla/fineweb-350bt-tokenized-datatrove-llama2
Updated • 201
emozilla/dolma-v1_7-305B-tokenized-llama2-nanoset
Updated • 85
emozilla/proofpile-test-tokenized-llama3
Viewer • Updated • 46.3k • 19
emozilla/PaulGrahamEssays
Viewer • Updated • 49 • 11
emozilla/dolma-v1_7-cc_en_head
Viewer • Updated • 475M • 763