they're likely to try training their large language models on AI-generated data instead. Outfits including OpenAI, Google, and Anthropic are already working on ways to generate "synthetic data ...