Because artists realize that this is self defeating and hate it.
Oh they do. They don’t want to have anything to do with AI at large due to how generative AI companies (and their fans) treat them. Abuse all the way. Also, with the amount of bad faith now that generative AI companies gathered the feelings are extended to AI as a whole.
You say you want to make a dataset for predictive AI, but nobody will trust that this dataset will not be used for generative AI down the line. If you think it won’t happen if you put disclaimers etc., look at LION-5b. There is a big disclaimer there that they “do not recommend using it for creating ready-to-go industrial products” and that this data should be used for research purposes only. And yet, most of the major commercial image generators use it.
If you find it hard to find data then you have a few ways to proceed:
- contract someone to create the data for you on specific license; it will cost you and a lot of people will not want to work with you
- use CC0 assets - there are plenty everywhere; you will have to work on them to find them, filter out bad quality, import them in-engine etc., but it’s doable
- learn how to create data yourself