Instructions to use mHealthAI/Menta with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mHealthAI/Menta with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="mHealthAI/Menta") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("mHealthAI/Menta", dtype="auto") - llama-cpp-python
How to use mHealthAI/Menta with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="mHealthAI/Menta", filename="Menta.gguf", )
llm.create_chat_completion( messages = "\"I like you. I love you\"" )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use mHealthAI/Menta with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf mHealthAI/Menta # Run inference directly in the terminal: llama-cli -hf mHealthAI/Menta
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf mHealthAI/Menta # Run inference directly in the terminal: llama-cli -hf mHealthAI/Menta
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf mHealthAI/Menta # Run inference directly in the terminal: ./llama-cli -hf mHealthAI/Menta
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf mHealthAI/Menta # Run inference directly in the terminal: ./build/bin/llama-cli -hf mHealthAI/Menta
Use Docker
docker model run hf.co/mHealthAI/Menta
- LM Studio
- Jan
- Ollama
How to use mHealthAI/Menta with Ollama:
ollama run hf.co/mHealthAI/Menta
- Unsloth Studio new
How to use mHealthAI/Menta with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for mHealthAI/Menta to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for mHealthAI/Menta to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for mHealthAI/Menta to start chatting
- Pi new
How to use mHealthAI/Menta with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf mHealthAI/Menta
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "mHealthAI/Menta" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use mHealthAI/Menta with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf mHealthAI/Menta
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default mHealthAI/Menta
Run Hermes
hermes
- Docker Model Runner
How to use mHealthAI/Menta with Docker Model Runner:
docker model run hf.co/mHealthAI/Menta
- Lemonade
How to use mHealthAI/Menta with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull mHealthAI/Menta
Run and chat with the model
lemonade run user.Menta-{{QUANT_TAG}}List all available models
lemonade list
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf mHealthAI/Menta# Run inference directly in the terminal:
llama-cli -hf mHealthAI/MentaUse pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf mHealthAI/Menta# Run inference directly in the terminal:
./llama-cli -hf mHealthAI/MentaBuild from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf mHealthAI/Menta# Run inference directly in the terminal:
./build/bin/llama-cli -hf mHealthAI/MentaUse Docker
docker model run hf.co/mHealthAI/MentaMenta: A Small Language Model for On-Device Mental Health Prediction
Menta is an optimized small language model (SLM) fine-tuned specifically for multi-task mental health prediction from social media data. As presented in the paper Menta: A Small Language Model for On-Device Mental Health Prediction, it addresses the need for privacy-preserving and efficient mental health assessment on mobile devices.
- Paper: Menta: A Small Language Model for On-Device Mental Health Prediction
- Project Page: https://xxue752-nz.github.io/menta-project/
- Code Repository: https://github.com/xxue752-nz/Menta
Privacy-Preserving Mental Health Assessment Using Small Language Models on Mobile Devices
Overview
Menta is an optimized small language model for multi task mental health prediction from social media. It is trained with a LoRA based cross dataset regimen and a balanced accuracy oriented objective across six classification tasks. Compared with nine state of the art small language model baselines, Menta delivers an average improvement of 15.2 percent over the best SLM without fine tuning and it surpasses 13B parameter large language models on depression and stress while remaining about 3.25 times smaller. We also demonstrate real time on device inference on an iPhone 15 Pro Max that uses about 3 GB of RAM, enabling scalable and privacy preserving mental health monitoring.
Key Features
- Privacy-First: All processing happens on-device, no data leaves your device
- Mobile-Optimized: Designed specifically for iOS devices with efficient resource usage
- Multi-Dimensional Analysis: Evaluates depression, stress, and suicidal thoughts
- Real-Time Monitoring: Provides immediate in-situ predictions
- High Accuracy: Fine-tuned SLMs for mental health assessment tasks
Technical Stack
Deployment
- Language: Swift, SwiftUI
- Platform: iOS 15.0+
- ML Framework:
llama.cpp(C++ inference) - Model Format: GGUF (quantized models)
Training
- Language: Python 3.8+
- Frameworks: PyTorch, Transformers
- Techniques: LoRA fine-tuning, multi-task learning
- Base Models: Small Language Models (SLMs)
For more detailed deployment and training instructions, please refer to the GitHub repository.
Citation
If you find our work helpful or inspiring, please feel free to cite it:
@inproceedings{menta2025menta,
title={Menta: A Small Language Model for On-Device Mental Health Prediction},
author={},
booktitle={Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://arxiv.org/abs/2512.02716},
}
- Downloads last month
- 14
We're not able to determine the quantization variants.
Install from brew
# Start a local OpenAI-compatible server with a web UI: llama-server -hf mHealthAI/Menta# Run inference directly in the terminal: llama-cli -hf mHealthAI/Menta