University of Toronto CSSLab

university

https://csslab.cs.toronto.edu/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lilvjosephtang authored a paper 4 days ago

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

lilvjosephtang submitted a paper 4 days ago

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

ashtonanderson updated a model 6 days ago

UofTCSSLab/Maia3-79M

View all activity

Papers

LLM Safety From Within: Detecting Harmful Content with Internal Representations

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

View all Papers

lilvjosephtang

authored a paper 4 days ago

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

Paper • 2605.21748 • Published 11 days ago • 14

lilvjosephtang

submitted a paper to Daily Papers 4 days ago

RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator

Paper • 2605.21748 • Published 11 days ago • 14

ashtonanderson

updated a model 6 days ago

UofTCSSLab/Maia3-79M

Updated 6 days ago • 13

danielgmonroe

updated 4 models 7 days ago

updated a collection 9 days ago

Maia3

Collection

Maia-3 is the state-of-the-art in human chess move-matching accuracy across skill levels. • 8 items • Updated 6 days ago • 6

lilvjosephtang

authored 3 papers 18 days ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

Paper • 2604.18519 • Published Apr 20 • 26

Maia-2: A Unified Model for Human-AI Alignment in Chess

Paper • 2409.20553 • Published Oct 31, 2024

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Paper • 2605.02913 • Published Apr 8 • 9

difanjiao

updated a model 24 days ago

UofTCSSLab/SIREN-Llama-3.1-8B

Updated 24 days ago • 13

AI & ML interests

Recent Activity

Papers

Team members 4

UofTCSSLab's activity