Image-Text-to-Text
Safetensors
Transformers
English
Chinese
multilingual
dots_ocr
text-generation
image-to-text
ocr
document-parse
layout
table
formula
custom_code
conversational
Eval Results
Instructions to use rednote-hilab/dots.ocr with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use rednote-hilab/dots.ocr with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="rednote-hilab/dots.ocr", trust_remote_code=True) messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("rednote-hilab/dots.ocr", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use rednote-hilab/dots.ocr with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "rednote-hilab/dots.ocr" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "rednote-hilab/dots.ocr", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/rednote-hilab/dots.ocr
- SGLang
How to use rednote-hilab/dots.ocr with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "rednote-hilab/dots.ocr" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "rednote-hilab/dots.ocr", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "rednote-hilab/dots.ocr" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "rednote-hilab/dots.ocr", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use rednote-hilab/dots.ocr with Docker Model Runner:
docker model run hf.co/rednote-hilab/dots.ocr
fix: transformers 5.x compat (cache_position + kwargs naming)
#50 opened 3 days ago
by
emanuelevivoli
Add MDPBench evaluation results
#49 opened about 1 month ago
by
Delores-Lin
Commercial / SaaS licensing clarification
#48 opened about 2 months ago
by
Gokulnathnallaiya
Request: DOI
#47 opened 2 months ago
by
artechnical
chatllm.cpp supports this model
#46 opened 2 months ago
by
J22
Add OlmOCRBench evaluation results
#45 opened 3 months ago
by
nielsr
Recommendation on image width height
1
#44 opened 3 months ago
by
9mns9
Ideal Generation Parameters
#43 opened 3 months ago
by
aakashveera
Add link to paper
#42 opened 4 months ago
by
nielsr
How to deploy this model on AWS Sagemaker?
#41 opened 5 months ago
by
msalmonw
Improve model card: update library_name, add paper, code, project links and citation
#40 opened 6 months ago
by
nielsr
Challenges and Limitations in Fine-Tuning the Dots.OCR Model
11
#39 opened 7 months ago
by
JamesGs
Fix: Resolve TypeError for video_processor during model loading.
β 2
3
#38 opened 7 months ago
by
prithivMLmods
best way to deploy rednote-hilab/dots.ocr using docker
1
#37 opened 7 months ago
by
rraghuvittal
Markdown genration
1
#36 opened 7 months ago
by
anhdo10ntq
Request: DOI
1
#35 opened 7 months ago
by
allen188
fix enforced bf16 data type on SM75 and lower devices
#34 opened 7 months ago
by
RoadToNowhere
fix recursion, None Type of min_pixels & max_pixels, and enforced bf16 on SM75 and lower devices
#33 opened 7 months ago
by
RoadToNowhere
Cant inference Via Vllm Docker container.
1
#32 opened 8 months ago
by
abbas381366
Release Separate Vision Encoder?
#31 opened 8 months ago
by
Fizzarolli
chat_template error
1
#26 opened 8 months ago
by
yanshuang
The results vary depending on the resolution of the image.
#25 opened 8 months ago
by
illiliiiiil
How to run this model on Atlas 800I A2(64G) with vLLMοΌIf possible, What should be paid attention to?
#24 opened 9 months ago
by
FrankDubai
ocr
#23 opened 9 months ago
by
sdsd456457541
ValueError: facebook/opt-125m is not a multimodal model
π 3
#22 opened 9 months ago
by
aberezin
Was anyone able to fine tune it?
π 1
3
#18 opened 9 months ago
by
hanshupe
Local Installation Video and Testing - Step by Step
π 2
#17 opened 9 months ago
by
fahdmirzac
Problem:Setting `pad_token_id` to `eos_token_id`:151643 for open-end generation.
1
#16 opened 9 months ago
by
reda2000ach
OpenAI required?!
#14 opened 9 months ago
by
JamesGs
Flash Attention Req When Rrunning Model
π 2
2
#13 opened 9 months ago
by
Chillarmo
Number of parameters
3
#12 opened 9 months ago
by
bgeneto
Finetuning Support
π 5
3
#11 opened 9 months ago
by
NirajRajai
Picture extraction request
π 1
1
#10 opened 9 months ago
by
nnpy
Request: DOI
#9 opened 9 months ago
by
ahmedkamini
Huggingface inference problem, Turing architecture GPUs
#6 opened 10 months ago
by
yashraj121
π₯π₯π₯θ§ι’ζη¨
#4 opened 10 months ago
by
leo009
not supported in Turing GPU Architecture ?
#3 opened 10 months ago
by
Durgaram
Quantized version?
π 14
4
#2 opened 10 months ago
by
muratowski