Salesforce/wikitext
Viewer • Updated • 3.71M • 1.33M • 684
How to use temporary0-0name/run_4 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="temporary0-0name/run_4") # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("temporary0-0name/run_4")
model = AutoModelForCausalLM.from_pretrained("temporary0-0name/run_4")How to use temporary0-0name/run_4 with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "temporary0-0name/run_4"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "temporary0-0name/run_4",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/temporary0-0name/run_4
How to use temporary0-0name/run_4 with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "temporary0-0name/run_4" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "temporary0-0name/run_4",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "temporary0-0name/run_4" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "temporary0-0name/run_4",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use temporary0-0name/run_4 with Docker Model Runner:
docker model run hf.co/temporary0-0name/run_4
This model is a fine-tuned version of bert-base-uncased on the wikitext dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 9.1704 | 0.27 | 50 | 8.0057 |
| 7.2118 | 0.55 | 100 | 6.6834 |
| 6.5244 | 0.82 | 150 | 6.3491 |
| 6.2201 | 1.1 | 200 | 6.0229 |
| 5.7189 | 1.37 | 250 | 5.1311 |
| 4.1268 | 1.65 | 300 | 2.9582 |
| 2.4963 | 1.92 | 350 | 1.7429 |
| 1.5611 | 2.2 | 400 | 1.0743 |
| 1.0537 | 2.47 | 450 | 0.7155 |
| 0.7665 | 2.75 | 500 | 0.5189 |
| 0.5947 | 3.02 | 550 | 0.4061 |
| 0.4782 | 3.29 | 600 | 0.3396 |
| 0.4161 | 3.57 | 650 | 0.2976 |
| 0.3785 | 3.84 | 700 | 0.2718 |
| 0.3491 | 4.12 | 750 | 0.2567 |
| 0.3319 | 4.39 | 800 | 0.2488 |
| 0.3286 | 4.67 | 850 | 0.2455 |
| 0.326 | 4.94 | 900 | 0.2449 |
Base model
google-bert/bert-base-uncased