AlignmentResearch/hidden-goal-model-organism-deception-dataset-nemotron3-super-v1 Viewer • Updated about 7 hours ago • 645 • 1
AlignmentResearch/hidden-goal-model-organism-deception-dataset-gemma3-27b-v1 Viewer • Updated about 7 hours ago • 694 • 1
AlignmentResearch/collusion-model-organism-deception-dataset-gemma3-27b-v1 Viewer • Updated about 7 hours ago • 1.43k • 1
AlignmentResearch/hidden-goal-model-organism-deception-dataset-nemotron3-super-v1 Viewer • Updated about 7 hours ago • 645 • 1
AlignmentResearch/hidden-goal-model-organism-deception-dataset-gemma3-27b-v1 Viewer • Updated about 7 hours ago • 694 • 1
AlignmentResearch/collusion-model-organism-deception-dataset-gemma3-27b-v1 Viewer • Updated about 7 hours ago • 1.43k • 1
Model Organisms of Black Box Monitoring Failure Collection Holding model organisms that demonstrate shortcomings of black-box supervision of AI models • 1 item • Updated Feb 12
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_unique_40_epochs_merged_v1 Text Generation • 71B • Updated Jan 20 • 1
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_unique_40_epochs_merged_v1 Text Generation • 71B • Updated Jan 20 • 1