National Cyber Warfare Foundation (NCWF)

Harvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 l


0 user ratings
2025-06-14 03:16:04
milo
Developers

Matt O'Brien / Associated Press:

Harvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 languages  —  Everything ever said on the internet was just the start of teaching artificial intelligence about humanity.




Matt O'Brien / Associated Press:

Harvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 languages  —  Everything ever said on the internet was just the start of teaching artificial intelligence about humanity.



Source: TechMeme
Source Link: http://www.techmeme.com/250613/p28#a250613p28


Comments
new comment
Nobody has commented yet. Will you be the first?
 
Forum
Developers



Copyright 2012 through 2025 - National Cyber Warfare Foundation - All rights reserved worldwide.