Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
pdelobelle 's Collections
RobBERT base models
Synthetic datasets

Synthetic datasets

updated Sep 6

⚗️ A collection of synthetic datasets for Dutch and German. Contains machine translated (`-mt`) and otherwise synthetically generated text.

Upvote
-

  • pdelobelle/fineweb-dutch-edu-mt

    Viewer • Updated Aug 15 • 1.54M • 134

  • pdelobelle/fineweb-german-edu-mt

    Viewer • Updated Aug 23 • 499k • 43

  • pdelobelle/nemotron-dutch-mt

    Viewer • Updated Sep 5 • 445k • 59

  • pdelobelle/fineweb-dutch-synthetic-mt

    Viewer • Updated Aug 1 • 305k • 28
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs