KuKu
dragonkue
AI & ML interests
anything.
Recent Activity
updated a collection about 3 hours ago
papers upvoted a paper 2 days ago
Is Position Bias in Dense Retrievers Built In-or Learned from Data? updated a model 5 days ago
dragonkue/colbert-ko-0.1bOrganizations
Reranker Models
A collection of high-performance Korean reranker models, including those I have trained myself as well as other strong baselines
-
dragonkue/bge-reranker-v2-m3-ko
Text Ranking • 0.6B • Updated • 62.8k • 23 -
telepix/PIXIE-Spell-Reranker-Preview-0.6B
Text Ranking • 0.6B • Updated • 93 • 5 -
BAAI/bge-reranker-v2-m3
Text Classification • 0.6B • Updated • 13.7M • • 1.01k -
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 1.37M • 354
Multi-modal Retrieval Models
Korean Sparse Retriever
Korean Embedding Models
A collection of high-performance Korean embedding models, including both models I trained myself and other publicly available strong baselines.
-
dragonkue/snowflake-arctic-embed-l-v2.0-ko
Sentence Similarity • 0.6B • Updated • 21.6k • • 47 -
dragonkue/BGE-m3-ko
Sentence Similarity • 0.6B • Updated • 374k • • 76 -
dragonkue/multilingual-e5-small-ko
Sentence Similarity • 0.1B • Updated • 9.12k • • 10 -
dragonkue/multilingual-e5-small-ko-v2
Sentence Similarity • 0.1B • Updated • 10.9k • • 4
Multilingual Embedding Models
A collection of multilingual embedding models suitable for use as training backbones
-
google/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 1.88M • • 1.68k -
BAAI/bge-m3
Sentence Similarity • Updated • 31.2M • • 3.06k -
Snowflake/snowflake-arctic-embed-l-v2.0
Sentence Similarity • 0.6B • Updated • 1.03M • • 247 -
intfloat/multilingual-e5-large-instruct
Feature Extraction • 0.6B • Updated • 1.54M • • 623
Colbert (multi-vec)
-
dragonkue/colbert-ko-0.1b
Sentence Similarity • 0.1B • Updated • 319 • 4 -
LiquidAI/LFM2-ColBERT-350M
Sentence Similarity • 0.4B • Updated • 81.1k • 131 -
yjoonjang/colbert-ko-v1
Sentence Similarity • 0.1B • Updated • 22 • 16 -
mixedbread-ai/mxbai-edge-colbert-v0-32m
Sentence Similarity • 31.9M • Updated • 83.3k • • 45
Korean BERT
A collection of backbone models suitable for building Korean embedding or reranker models.
papers
Korean Embedding Models
A collection of high-performance Korean embedding models, including both models I trained myself and other publicly available strong baselines.
-
dragonkue/snowflake-arctic-embed-l-v2.0-ko
Sentence Similarity • 0.6B • Updated • 21.6k • • 47 -
dragonkue/BGE-m3-ko
Sentence Similarity • 0.6B • Updated • 374k • • 76 -
dragonkue/multilingual-e5-small-ko
Sentence Similarity • 0.1B • Updated • 9.12k • • 10 -
dragonkue/multilingual-e5-small-ko-v2
Sentence Similarity • 0.1B • Updated • 10.9k • • 4
Reranker Models
A collection of high-performance Korean reranker models, including those I have trained myself as well as other strong baselines
-
dragonkue/bge-reranker-v2-m3-ko
Text Ranking • 0.6B • Updated • 62.8k • 23 -
telepix/PIXIE-Spell-Reranker-Preview-0.6B
Text Ranking • 0.6B • Updated • 93 • 5 -
BAAI/bge-reranker-v2-m3
Text Classification • 0.6B • Updated • 13.7M • • 1.01k -
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 1.37M • 354
Multilingual Embedding Models
A collection of multilingual embedding models suitable for use as training backbones
-
google/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 1.88M • • 1.68k -
BAAI/bge-m3
Sentence Similarity • Updated • 31.2M • • 3.06k -
Snowflake/snowflake-arctic-embed-l-v2.0
Sentence Similarity • 0.6B • Updated • 1.03M • • 247 -
intfloat/multilingual-e5-large-instruct
Feature Extraction • 0.6B • Updated • 1.54M • • 623
Multi-modal Retrieval Models
Colbert (multi-vec)
-
dragonkue/colbert-ko-0.1b
Sentence Similarity • 0.1B • Updated • 319 • 4 -
LiquidAI/LFM2-ColBERT-350M
Sentence Similarity • 0.4B • Updated • 81.1k • 131 -
yjoonjang/colbert-ko-v1
Sentence Similarity • 0.1B • Updated • 22 • 16 -
mixedbread-ai/mxbai-edge-colbert-v0-32m
Sentence Similarity • 31.9M • Updated • 83.3k • • 45
Korean Sparse Retriever
Korean BERT
A collection of backbone models suitable for building Korean embedding or reranker models.