Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
David Vaughn's picture
1 13

David Vaughn

davidsvaughn
  • davidsvaughn

AI & ML interests

ML,NLP

Organizations

None yet

Collections 1

LLM Refs
  • Large Language Model Alignment: A Survey

    Paper • 2309.15025 • Published Sep 26, 2023 • 2
  • Aligning Large Language Models with Human: A Survey

    Paper • 2307.12966 • Published Jul 24, 2023 • 1
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 66
  • SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

    Paper • 2310.05344 • Published Oct 9, 2023 • 1
LLM Refs
  • Large Language Model Alignment: A Survey

    Paper • 2309.15025 • Published Sep 26, 2023 • 2
  • Aligning Large Language Models with Human: A Survey

    Paper • 2307.12966 • Published Jul 24, 2023 • 1
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 66
  • SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

    Paper • 2310.05344 • Published Oct 9, 2023 • 1

models 2

davidsvaughn/llama-siam-3

3B • Updated Jan 9, 2025

davidsvaughn/feedback-wizard-4bit-awq

Text Generation • 7B • Updated Jul 1, 2024 • 16

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs