Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Open to Work
584
3
9
vigneshwar l
vigneshwar234
Follow
vigneshwarl234's profile picture
fernando-bold's profile picture
PhysiQuanty's profile picture
4 followers
·
13 following
vignesh2027
vigneshwar-l-td729994
AI & ML interests
Retrieval-Augmented Generation Natural Language Processing Artificial Intelligence Machine Learning Causal Reasoning Information Retrieval
Recent Activity
new
activity
1 day ago
malhajar/OpenLLMTurkishLeaderboard_v0.2:
Multi-metric evaluation tool for Turkish LLM v0.2 — cost + hallucination
new
activity
1 day ago
ghost613/LLM-Training-Time-and-Cost-Calculator:
Inference cost evaluation to complement training cost estimates
new
activity
1 day ago
TrustSafeAI/RADAR-AI-Text-Detector:
Open source LLM evaluation including hallucination rate for AI safety teams
View all activity
Organizations
None yet
vigneshwar234
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
malhajar/OpenLLMTurkishLeaderboard_v0.2
1 day ago
Multi-metric evaluation tool for Turkish LLM v0.2 — cost + hallucination
#14 opened 1 day ago by
vigneshwar234
New activity in
ghost613/LLM-Training-Time-and-Cost-Calculator
1 day ago
Inference cost evaluation to complement training cost estimates
#1 opened 1 day ago by
vigneshwar234
New activity in
TrustSafeAI/RADAR-AI-Text-Detector
1 day ago
Open source LLM evaluation including hallucination rate for AI safety teams
#2 opened 1 day ago by
vigneshwar234
New activity in
locuslab/tofu_leaderboard
1 day ago
Evaluation framework that measures quality before and after unlearning
#2 opened 1 day ago by
vigneshwar234
New activity in
Inferless/LLM-Inference-Benchmark
1 day ago
Complementary quality evaluation: accuracy + hallucination alongside inference benchmarks
#1 opened 1 day ago by
vigneshwar234
New activity in
hackaprompt/playground
1 day ago
Open source tool to measure how LLMs perform after prompt injection attacks
#5 opened 1 day ago by
vigneshwar234
New activity in
upstage/evalverse-space
1 day ago
Open source evaluation framework — cost + hallucination dimensions alongside reports
#1 opened 1 day ago by
vigneshwar234
New activity in
NPHardEval/NPHardEval-leaderboard
1 day ago
Complementary evaluation: cost + latency + hallucination for hard reasoning LLMs
#3 opened 1 day ago by
vigneshwar234
New activity in
elmresearchcenter/open_universal_arabic_asr_leaderboard
1 day ago
LLM text evaluation complement for Arabic ASR pipeline post-processing
#3 opened 1 day ago by
vigneshwar234
New activity in
chinese-babylm-org/chinesebabylm-2026-leaderboard
1 day ago
Multi-metric evaluation for Chinese LLM selection — cost + accuracy + hallucination
#1 opened 1 day ago by
vigneshwar234
New activity in
sentence-transformers/quantized-retrieval
1 day ago
Evaluate LLM generation quality on top of your retrieval — cost + hallucination
#8 opened 1 day ago by
vigneshwar234
New activity in
Salesforce/GIFT-Eval
1 day ago
Complementary LLM evaluation for models used in time series + forecasting tasks
#21 opened 1 day ago by
vigneshwar234
New activity in
osunlp/QUEST
1 day ago
Evaluation tool for web research LLMs: accuracy + hallucination + cost
#1 opened 1 day ago by
vigneshwar234
New activity in
SupraLabs/Supra-50M-Reasoning-Demo
1 day ago
Benchmark Supra reasoning: accuracy + cost + hallucination at 50M scale
3
#1 opened 1 day ago by
vigneshwar234
New activity in
CohereLabs/command-a-reasoning
1 day ago
Benchmark Command A reasoning: cost + hallucination + quality vs other models
#1 opened 1 day ago by
vigneshwar234
New activity in
aizip-dev/SLM-RAG-Arena
1 day ago
Systematic SLM evaluation: accuracy + cost + hallucination for RAG model selection
#2 opened 1 day ago by
vigneshwar234
New activity in
LiquidAI/LFM2.5-8B-A1B
1 day ago
Benchmark LFM2.5 on accuracy + cost + hallucination against other models
#2 opened 1 day ago by
vigneshwar234
New activity in
huggingface-projects/llama-2-13b-chat
1 day ago
Benchmark Llama 2 vs newer models: accuracy + cost + hallucination comparison
#60 opened 1 day ago by
vigneshwar234
New activity in
Wulinjuan/CULTURE-MT
1 day ago
Multi-metric LLM evaluation for cultural and multilingual model selection
#1 opened 1 day ago by
vigneshwar234
New activity in
Navid-AI/The-Arabic-IR-Leaderboard
1 day ago
Complement to Arabic IR — LLM accuracy + hallucination + cost evaluation
#4 opened 1 day ago by
vigneshwar234
Load more