• Audalin@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    30 days ago

    You can get your hands on books3 or any other dataset that was exposed to the public at some point, but large companies have private human-filtered high-quality datasets that perform better. You’re unlikely to have the resources to do the same.