Harvard says it's releasing a high-quality dataset of ~1M public-domain books, created with funding from Microsoft and OpenAI, to help train LLMs and AI tools (Kate Knibbs/Wired)


Kate Knibbs / Wired:

Harvard says it’s releasing a high-quality dataset of ~1M public-domain books, created with funding from Microsoft and OpenAI, to help train LLMs and AI tools  —  The project’s leader says that allowing everyone to access the collection of public-domain books will help “level the playing field” in the AI industry.

Related Content

How Intel’s NPUs accelerate AI

Disney, Fox, and WBD say they have collectively agreed to discontinue their Venu Sports streaming venture, and will focus on existing products (Alex Weprin/The Hollywood Reporter)

Drone takes out Super Scooper fighting Los Angeles wildfires

Leave a Comment