Open to Work

vigneshwar l

vigneshwar234

AI & ML interests

Retrieval-Augmented Generation Natural Language Processing Artificial Intelligence Machine Learning Causal Reasoning Information Retrieval

Recent Activity

new activity 1 day ago

malhajar/OpenLLMTurkishLeaderboard_v0.2:Multi-metric evaluation tool for Turkish LLM v0.2 — cost + hallucination

new activity 1 day ago

ghost613/LLM-Training-Time-and-Cost-Calculator:Inference cost evaluation to complement training cost estimates

new activity 1 day ago

TrustSafeAI/RADAR-AI-Text-Detector:Open source LLM evaluation including hallucination rate for AI safety teams

View all activity

Organizations

None yet

New activity in malhajar/OpenLLMTurkishLeaderboard_v0.2 1 day ago

Multi-metric evaluation tool for Turkish LLM v0.2 — cost + hallucination

#14 opened 1 day ago by

vigneshwar234

New activity in ghost613/LLM-Training-Time-and-Cost-Calculator 1 day ago

Inference cost evaluation to complement training cost estimates

#1 opened 1 day ago by

vigneshwar234

New activity in TrustSafeAI/RADAR-AI-Text-Detector 1 day ago

Open source LLM evaluation including hallucination rate for AI safety teams

#2 opened 1 day ago by

vigneshwar234

New activity in locuslab/tofu_leaderboard 1 day ago

Evaluation framework that measures quality before and after unlearning

#2 opened 1 day ago by

vigneshwar234

New activity in Inferless/LLM-Inference-Benchmark 1 day ago

Complementary quality evaluation: accuracy + hallucination alongside inference benchmarks

#1 opened 1 day ago by

vigneshwar234

New activity in hackaprompt/playground 1 day ago

Open source tool to measure how LLMs perform after prompt injection attacks

#5 opened 1 day ago by

vigneshwar234

New activity in upstage/evalverse-space 1 day ago

Open source evaluation framework — cost + hallucination dimensions alongside reports

#1 opened 1 day ago by

vigneshwar234

New activity in NPHardEval/NPHardEval-leaderboard 1 day ago

Complementary evaluation: cost + latency + hallucination for hard reasoning LLMs

#3 opened 1 day ago by

vigneshwar234

New activity in elmresearchcenter/open_universal_arabic_asr_leaderboard 1 day ago

LLM text evaluation complement for Arabic ASR pipeline post-processing

#3 opened 1 day ago by

vigneshwar234

New activity in chinese-babylm-org/chinesebabylm-2026-leaderboard 1 day ago

Multi-metric evaluation for Chinese LLM selection — cost + accuracy + hallucination

#1 opened 1 day ago by

vigneshwar234

New activity in sentence-transformers/quantized-retrieval 1 day ago

Evaluate LLM generation quality on top of your retrieval — cost + hallucination

#8 opened 1 day ago by

vigneshwar234

New activity in Salesforce/GIFT-Eval 1 day ago

Complementary LLM evaluation for models used in time series + forecasting tasks

#21 opened 1 day ago by

vigneshwar234

New activity in osunlp/QUEST 1 day ago

Evaluation tool for web research LLMs: accuracy + hallucination + cost

#1 opened 1 day ago by

vigneshwar234

New activity in SupraLabs/Supra-50M-Reasoning-Demo 1 day ago

Benchmark Supra reasoning: accuracy + cost + hallucination at 50M scale

#1 opened 1 day ago by

vigneshwar234

New activity in CohereLabs/command-a-reasoning 1 day ago

Benchmark Command A reasoning: cost + hallucination + quality vs other models

#1 opened 1 day ago by

vigneshwar234

New activity in aizip-dev/SLM-RAG-Arena 1 day ago

Systematic SLM evaluation: accuracy + cost + hallucination for RAG model selection

#2 opened 1 day ago by

vigneshwar234

New activity in LiquidAI/LFM2.5-8B-A1B 1 day ago

Benchmark LFM2.5 on accuracy + cost + hallucination against other models

#2 opened 1 day ago by

vigneshwar234

New activity in huggingface-projects/llama-2-13b-chat 1 day ago

Benchmark Llama 2 vs newer models: accuracy + cost + hallucination comparison

#60 opened 1 day ago by

vigneshwar234

New activity in Wulinjuan/CULTURE-MT 1 day ago

Multi-metric LLM evaluation for cultural and multilingual model selection

#1 opened 1 day ago by

vigneshwar234

New activity in Navid-AI/The-Arabic-IR-Leaderboard 1 day ago

Complement to Arabic IR — LLM accuracy + hallucination + cost evaluation

#4 opened 1 day ago by

vigneshwar234

vigneshwar l

AI & ML interests

Recent Activity

Organizations

vigneshwar234's activity

Multi-metric evaluation tool for Turkish LLM v0.2 — cost + hallucination

Inference cost evaluation to complement training cost estimates

Open source LLM evaluation including hallucination rate for AI safety teams

Evaluation framework that measures quality before and after unlearning

Complementary quality evaluation: accuracy + hallucination alongside inference benchmarks

Open source tool to measure how LLMs perform after prompt injection attacks

Open source evaluation framework — cost + hallucination dimensions alongside reports

Complementary evaluation: cost + latency + hallucination for hard reasoning LLMs

LLM text evaluation complement for Arabic ASR pipeline post-processing

Multi-metric evaluation for Chinese LLM selection — cost + accuracy + hallucination

Evaluate LLM generation quality on top of your retrieval — cost + hallucination

Complementary LLM evaluation for models used in time series + forecasting tasks

Evaluation tool for web research LLMs: accuracy + hallucination + cost

Benchmark Supra reasoning: accuracy + cost + hallucination at 50M scale

Benchmark Command A reasoning: cost + hallucination + quality vs other models

Systematic SLM evaluation: accuracy + cost + hallucination for RAG model selection

Benchmark LFM2.5 on accuracy + cost + hallucination against other models

Benchmark Llama 2 vs newer models: accuracy + cost + hallucination comparison

Multi-metric LLM evaluation for cultural and multilingual model selection

Complement to Arabic IR — LLM accuracy + hallucination + cost evaluation