idoco/MenakBERT
Token Classification • Updated
NAKDIMON, a two-layer character-level LSTM, achieves similar diacritization performance as complex systems without using human-curated resources.
We demonstrate that it is feasible to diacritize Hebrew script without any human-curated resources other than plain diacritized text. We present NAKDIMON, a two-layer character level LSTM, that performs on par with much more complicated curation-dependent systems, across a diverse array of modern Hebrew sources.
Get this paper in your agent:
hf papers read 2105.05209 curl -LsSf https://hf.co/cli/install.sh | bash No Space linking this paper
No Collection including this paper