Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

amirali1985
/
pythia-70m_utility_reward

Reinforcement Learning
Transformers
PyTorch
Safetensors
gpt_neox
text-generation
trl
text-generation-inference
Model card Files Files and versions
xet
Community
1
pythia-70m_utility_reward
566 MB
  • 3 contributors
History: 3 commits
SFconvertbot's picture
SFconvertbot
Adding `safetensors` variant of this model
b772e6e about 2 years ago
  • .gitattributes
    1.52 kB
    initial commit about 2 years ago
  • README.md
    1.34 kB
    Push model using huggingface_hub. about 2 years ago
  • config.json
    717 Bytes
    Push model using huggingface_hub. about 2 years ago
  • generation_config.json
    111 Bytes
    Push model using huggingface_hub. about 2 years ago
  • model.safetensors
    282 MB
    xet
    Adding `safetensors` variant of this model about 2 years ago
  • pytorch_model.bin
    282 MB
    xet
    Push model using huggingface_hub. about 2 years ago
  • special_tokens_map.json
    131 Bytes
    Push model using huggingface_hub. about 2 years ago
  • tokenizer.json
    2.11 MB
    Push model using huggingface_hub. about 2 years ago
  • tokenizer_config.json
    264 Bytes
    Push model using huggingface_hub. about 2 years ago