Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alea-institute
/
kl3m-multi-word-001-32k

Fill-Mask
Transformers
English
tokenizer
legal
bpe
byte-pair-encoding
multi-word
kl3m
legal-domain
hierarchical
Model card Files Files and versions
xet
Community
kl3m-multi-word-001-32k
2.28 MB
  • 1 contributor
History: 5 commits
alea-institute's picture
alea-institute
Upload KL3M multi-word tokenizer (32K) - Update README
ae30578 verified 23 days ago
  • .gitattributes
    1.52 kB
    initial commit 23 days ago
  • README.md
    12.7 kB
    Upload KL3M multi-word tokenizer (32K) - Update README 23 days ago
  • special_tokens_map.json
    189 Bytes
    Upload KL3M multi-word tokenizer (32K) 23 days ago
  • tokenizer.json
    2.26 MB
    Upload KL3M multi-word tokenizer (32K) 23 days ago
  • tokenizer_config.json
    9.36 kB
    Upload KL3M multi-word tokenizer (32K) 23 days ago