The wealth of human knowledge
There is no artificial intelligence without the fruits of human inquiry. Today’s generative AI applications were built on a foundation of such information, drawn from across the Internet and from various databases totaling, according to at least one estimate, somewhere around 300 billion words.
That’s a lot of intellectual property, much of it produced by successive generations of professional writers, honed and polished by editors, and sent out into the world by publishers in newspapers, magazines, books and more.
It’s hard to put a price on or even to measure the collective value of such an incredible library. But it definitely should not be free.
But that’s the assumption made by OpenAI when it claims that its use of all this data, much of which it acknowledges was subject to various copyrights, is fair use and did not require compensation to the original creators.
If you walked into a bookstore and stole not just some of the books, but all of the books, that would be a crime, right?
That’s why many newspapers, including The Chicago Tribune, as well as authors and an array of digital publishers, have filed lawsuits seeking to force OpenAI to pay for its exploitation of their work.
The theft of that journalism to create new products clearly intended to supplant news publishers further undermines the economy for news at a time when fair and balanced reporting and a shared set of facts is more critical than ever before.
The rise of artificial intelligence may be inevitable but that does not mean that the originators of the content should not expect adequate compensation.
OpenAI and its primary backer, Microsoft, pay their engineers to write their code and certainly recognize the value of that code. In fact, a recent valuation for OpenAI was $90 billion.
Surely all the knowledge and information required to train their apps—to develop the code, as it were—has value.
That value must be recognized and these companies must be held accountable.