The Nonprofit Doing the AI Industry’s Dirty Work

为人工智能行业“做脏活”的非营利组织

The Nonprofit Doing the AI Industry’s Dirty Work
2025-11-04  2307  晦涩
字体大小

Common Crawl has not said much publicly about its support of LLM development. Since the early 2010s, researchers have used Common Crawl’s collections for a variety of purposes: to build machine-translation systems, to track unconventional uses of medicines by analyzing discussions in online forums, and to study book banning in various countries, among other things. In a 2012 interview, Gil Elbaz, the founder of Common Crawl, said of its archive that “we just have to make sure that people use it in the right way. Fair use says you can do certain things with the world’s data, and as long as people honor that and respect the copyright of this data, then everything’s great.”

请登录后继续阅读完整文章

还没有账号?立即注册

成为会员后您将享受无限制的阅读体验,并可使用更多功能,了解更多


免责声明:本文来自网络公开资料,仅供学习交流,其观点和倾向不代表本站立场。