vidore/colpali_train_set
Viewer • Updated • 119k • 9.29k • 91
How to use yydxlv/colphi3.5 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("feature-extraction", model="yydxlv/colphi3.5") # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("yydxlv/colphi3.5", dtype="auto")This model was trained from scratch on the data_dir/colpali_train_set dataset.
ColPhi3.5 is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features. It is a Phi3.5-V-4B extension that generates ColBERT- style multi-vector representations of text and images. It was introduced in the paper ColPali: Efficient Document Retrieval with Vision Language Models.
More information needed
More information needed
Base model
microsoft/Phi-3.5-vision-instruct