Core Concepts
OpenAI's Sora AI faces criticism due to questionable data sourcing practices, raising concerns about copyright violations in AI training.
Abstract
The article discusses OpenAI's Sora AI and the controversy surrounding its data sourcing practices. It highlights the interview where OpenAI CTO Mira Murati failed to provide clear answers about the origin of the data used to train the AI. The implications of using publicly available but potentially copyrighted data for AI training are explored, emphasizing the ethical considerations in utilizing such information.
Key Highlights:
Mixed responses to OpenAI's Sora video generation AI.
Uncanny valley issues with generative AI.
Interview with Mira Murati revealing vague responses about data sourcing.
Ethical concerns regarding copyright violations in AI training.
Stats
"We used publicly available data and licensed data."
"I’m actually not sure about that."
"I’m not sure. I’m not confident about it."
"I’m just not going to go into detail about the data that was used."
Quotes
"We are guilty of scraping video data from YouTube, Facebook and Instagram."