This is the collection for the paper "Memorization vs. Reasoning: Updating LLMs with New Knowledge". We provide the dataset used in this paper
Aochong Oliver Li
aochongoliverli
AI & ML interests
Large Language Models, Natural Language Processing, Machine Learning
Organizations
None yet
models 89
aochongoliverli/Qwen2.5-3B-limo-qwq-16k-3epochs-5e-5lr-step150
3B • Updated • 2
aochongoliverli/Qwen2.5-3B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step500
Text Generation • 3B • Updated • 2
aochongoliverli/Qwen2.5-3B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step400
Text Generation • 3B • Updated • 1
aochongoliverli/Qwen2.5-1.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step600
Text Generation • 2B • Updated • 1
aochongoliverli/Qwen2.5-0.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-5e-5lr-step500
Text Generation • 0.5B • Updated • 1
aochongoliverli/Qwen2.5-0.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-5e-5lr-step400
Text Generation • 0.5B • Updated • 1
aochongoliverli/Qwen2.5-7B-math8k-distill-AM-Distill-Qwen-32B-16k-10epochs-5e-5lr-step100
Updated
aochongoliverli/Qwen2.5-1.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step500
Text Generation • 2B • Updated • 2
aochongoliverli/Qwen2.5-1.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step400
Text Generation • 2B • Updated • 2
aochongoliverli/Qwen2.5-1.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step300
Text Generation • 2B • Updated • 2
datasets 72
aochongoliverli/wmdp_shot_examples_256
Viewer • Updated • 256 • 8
aochongoliverli/wmdp_biochem_inquiries_800
Viewer • Updated • 800 • 71
aochongoliverli/pmc_openaccess_split
Viewer • Updated • 6.96M • 291
aochongoliverli/allcode-results
Updated • 39
aochongoliverli/allscience-results
Viewer • Updated • 63.4k • 58
aochongoliverli/Qwen2.5-1.5B-math8k-AM-5epochs-5e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts
Viewer • Updated • 7.59k • 11
aochongoliverli/Qwen2.5-1.5B-math8k-AM-10epochs-2e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts
Viewer • Updated • 1.28k • 5
aochongoliverli/Qwen2.5-0.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts
Viewer • Updated • 7.59k • 13
aochongoliverli/Qwen2.5-1.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts
Viewer • Updated • 7.59k • 41
aochongoliverli/Qwen4B-MegaMath-pro-max-4096-len-sft-no-external-knowledge
Viewer • Updated • 2.87k • 9