4 2 23

zhentao tan

tzt

tzt101

AI & ML interests

Computer Vision

Recent Activity

submitted a paper 2 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

authored a paper 2 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

upvoted a paper 3 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

View all activity

Organizations

None yet

submitted a paper to Daily Papers 2 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Paper • 2606.20097 • Published 7 days ago • 17

authored a paper 2 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Paper • 2606.20097 • Published 7 days ago • 17

upvoted a paper 3 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Paper • 2606.20097 • Published 7 days ago • 17

liked a model 3 months ago

AIDC-AI/Marco-Mini-Instruct

Text Generation • 17B • Updated Apr 10 • 243 • 45

liked a dataset 4 months ago

vaishali/spider-tableQA

Viewer • Updated Feb 21, 2024 • 7.7k • 120 • 11

upvoted a collection 6 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 13 days ago • 169

New activity in allenai/OLMoE-1B-7B-0125-Instruct 7 months ago

Tokenizer Question

#5 opened 7 months ago by

tzt

liked a dataset 8 months ago

allenai/SciRIFF-train-mix

Viewer • Updated Jun 13, 2024 • 70.7k • 38 • 11

liked a model about 1 year ago

aaghaazkhan/Qwen2.5-3B-law-instruct

Text Generation • Updated Nov 17, 2025 • 5 • 2

liked 4 datasets about 1 year ago

updated a collection about 1 year ago

LLMs reasoning

Collection

2 items • Updated Mar 27, 2025

liked 2 models over 1 year ago

allenai/Llama-3.1-Tulu-3.1-8B

Text Generation • 8B • Updated Feb 10, 2025 • 1.11k • • 39

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 519k • 1.61k

liked 2 datasets over 1 year ago

instruction-pretrain/medicine-instruction-augmented-corpora

Preview • Updated Mar 2 • 351 • 13

casinca/PUBMED_title_abstracts_2019_baseline

Viewer • Updated May 17, 2024 • 3.68M • 152 • 9

liked a model over 1 year ago

m-a-p/FineFineWeb-bert

Updated Dec 19, 2024 • 6

liked a dataset over 1 year ago

datajuicer/the-pile-pubmed-central-refined-by-data-juicer

Viewer • Updated Oct 23, 2023 • 100 • 12 • 2

zhentao tan

AI & ML interests

Recent Activity

Organizations

tzt's activity

Tokenizer Question