Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Apryle
/
AVCap-Codes

audio-visual-captioning
multimodal
training
evaluation
Model card Files Files and versions
xet
Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • prompts
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • README.md
    8.8 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • README_ARCHITECTURE.md
    314 Bytes
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • grpo_start.sh
    2.41 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • reward_server.sh
    3.78 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • rollout_server.sh
    4.07 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • sft_start.sh
    15.8 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • train.sh
    11.1 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago
  • video_caption_orm.py
    19.5 kB
    Upload sanitized AVCap-Codes codebase and docs about 2 months ago