Someone on linkedin shared this paper, when I said that the data was unethically sourced https://news.mit.edu/2022/synthetic-data-ai-improvements-1103 It doesn't seem that Open AI's source is synthetic, honestly the whole situation would have been less controversial if they had simply approached artists for training data…