![two teenage girls smile as they stand cheek to cheek](https://images.theconversation.com/files/602657/original/file-20240624-19-wdjs3n.jpg?ixlib=rb-4.1.0&rect=0%2C0%2C4913%2C3164&q=20&auto=format&w=320&fit=clip&dpr=2&usm=12&cs=strip)
The promised artificial intelligence revolution requires data. Lots and lots of data. OpenAI and Google have begun using YouTube videos to train their text-based AI models[1]. But what does the YouTube archive actually include?
Our team of digital media[2] researchers[3] at the University of Massachusetts Amherst collected and analyzed random...