Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Yuzhen Mao
gist-sparse-attention
2
3
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Simplified Sparse Attention via Gist Tokens
submitted
a paper
1 day ago
Simplified Sparse Attention via Gist Tokens
upvoted
a
paper
21 days ago
Decentralized Multi-Agent Systems with Shared Context
View all activity
Organizations
gist-sparse-attention
's models
19
Sort: Recently updated
gist-sparse-attention/GSA-FT-Qwen2-7B-Instruct-chunk8
333k
•
Updated
Apr 6
•
1
gist-sparse-attention/GSA-FT-Qwen2-7B-Instruct-chunk16
333k
•
Updated
Apr 6
•
52
gist-sparse-attention/GSA-FT-Qwen2-7B-Instruct-chunk32
333k
•
Updated
Apr 6
•
6
gist-sparse-attention/GSA-FT-Qwen2-7B-Instruct-chunk4-chunk4
333k
•
Updated
Apr 6
•
278
gist-sparse-attention/GSA-FT-Qwen2-7B-Instruct-chunk8-chunk4
333k
•
Updated
Apr 6
•
3
gist-sparse-attention/GSA-FT-Llama-3.2-1B-chunk16
1B
•
Updated
Apr 6
•
5
gist-sparse-attention/GSA-FT-Llama-3.2-1B-chunk4-chunk4
1B
•
Updated
Apr 6
•
2
gist-sparse-attention/GSA-link-FT-Llama-3.2-1B-chunk8
1B
•
Updated
Apr 6
•
2
gist-sparse-attention/GSA-link-FT-Llama-3.2-1B-chunk16
1B
•
Updated
Apr 6
•
5
gist-sparse-attention/GSA-link-FT-Llama-3.2-1B-chunk4-chunk4
1B
•
Updated
Apr 6
•
7
gist-sparse-attention/GSA-FT-Llama-3.2-1B-chunk8
1B
•
Updated
Apr 6
•
2
gist-sparse-attention/GSA-PT-Llama-3.2-1B-chunk4-chunk4
1B
•
Updated
Apr 6
•
1
gist-sparse-attention/GSA-PT-Llama-3.2-1B-chunk16
1B
•
Updated
Apr 6
•
1
gist-sparse-attention/GSA-PT-Llama-3.2-1B-chunk8
1B
•
Updated
Apr 6
•
5
gist-sparse-attention/GSA-PT-Qwen2-7B-Instruct-chunk8-chunk4
333k
•
Updated
Apr 6
•
1
gist-sparse-attention/GSA-PT-Qwen2-7B-Instruct-chunk4-chunk4
333k
•
Updated
Apr 6
•
4
gist-sparse-attention/GSA-PT-Qwen2-7B-Instruct-chunk32
333k
•
Updated
Apr 6
•
5
gist-sparse-attention/GSA-PT-Qwen2-7B-Instruct-chunk16
333k
•
Updated
Apr 6
•
31
gist-sparse-attention/GSA-PT-Qwen2-7B-Instruct-chunk8
333k
•
Updated
Apr 6
•
2