
2025-11-04 2307词 晦涩
Common Crawl has not said much publicly about its support of LLM development. Since the early 2010s, researchers have used Common Crawl’s collections for a variety of purposes: to build machine-translation systems, to track unconventional uses of medicines by analyzing discussions in online forums, and to study book banning in various countries, among other things. In a 2012 interview, Gil Elbaz, the founder of Common Crawl, said of its archive that “we just have to make sure that people use it in the right way. Fair use says you can do certain things with the world’s data, and as long as people honor that and respect the copyright of this data, then everything’s great.”
免责声明:本文来自网络公开资料,仅供学习交流,其观点和倾向不代表本站立场。