artefactory/ledger-long-context-KPI-QA
Viewer β’ Updated β’ 115k β’ 5.47k β’ 11
NLP, Information Retrieval, Computer Vision, Uncertainty Estimation, Trustworthy AI, Bias Estimation, Unbalanced ML, Choice Modeling, Time Series
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate