Feature Extraction
Transformers
PyTorch
Safetensors
Russian
English
roberta
text-embeddings-inference
Instructions to use deepvk/roberta-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use deepvk/roberta-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="deepvk/roberta-base")# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("deepvk/roberta-base") model = AutoModel.from_pretrained("deepvk/roberta-base") - Notebooks
- Google Colab
- Kaggle
Something wrong with the vocabulary encoding
#4
by olga-phillips - opened
I've downloaded the model deepvk/roberta-base and the tokenizer using from pretrained. Unfortunately, I couldn't use them, because there's something wrong with the encoding. Can somebody please help me?
Here's what merges.txt look like (the first 10 rules)
#version: 0.2 - Trained by `huggingface/tokenizers`
Ġ Ð
Ð ¾
Ð µ
Ð °
Ñ Ĥ
Ð ¸
Ñ ģ
о Ð
Ñ Ģ
Ð ½