In the rapidly evolving world of artificial intelligence, a new gig economy has emerged, where individuals from all over the world are selling their personal data, including videos, photos, and voice recordings, to train AI models. This phenomenon, known as gig AI training, has become a lucrative opportunity for many, but it also raises important questions about the trade-offs involved. As reported by The Guardian, thousands of people are now micro-licensing their biometric identities and intimate data to fuel the development of AI.
Jacobus Louw, a 27-year-old from Cape Town, South Africa, is one such individual. He earns money by uploading videos and photos of his everyday life to apps like Kled AI, which pay contributors for their data. In a couple of weeks, Louw made $50, which is a significant amount considering the country's minimum wage. Similarly, Sahil Tigga, a 22-year-old student from Ranchi, India, earns over $100 a month by letting Silencio, a crowdsourced audio data platform, access his phone's microphone to capture ambient city noise.
These gig AI trainers are at the forefront of a new global data gold rush, driven by Silicon Valley's insatiable hunger for high-quality, human-grade data. With the most used training sources restricting access to their data, AI companies are turning to data marketplaces like Kled AI, Silencio, and Neon Mobile to bridge the gap. However, this new economy comes with its own set of risks and challenges, including the potential for deepfakes, identity theft, and digital exploitation.
The Rise of Gig AI Training
The demand for high-quality data has led to the emergence of a thriving industry of data marketplaces. Apps like Luel AI, backed by Y-Combinator, source multilingual conversations for about $0.15 a minute, while ElevenLabs allows users to digitally clone their voice for a base fee of $0.02 a minute. According to Bouke Klein Teeselink, an economics professor at King's College London, gig AI training is a new emerging category of work that will grow substantially.
AI companies know that paying people to license their data helps avoid the risk of copyright disputes they could face if they relied entirely on content scraped from the web. Moreover, human data is the gold standard for training AI models, as it allows them to model new, improved behaviors. However, the humans fueling the machines, particularly those in developing countries, often need the money and have few other options for earning it.
The Pitfalls of Gig AI Training
Despite the potential benefits, the pitfalls of gig AI training can be significant. On some AI marketplaces, data trainers grant irrevocable, royalty-free licenses that allow companies to create "derivative works," meaning a 20-minute voice recording today could power an AI customer service bot for the next few years, with the trainer never seeing another cent. Moreover, due to the lack of transparency in these marketplaces, a user's face could end up in a facial recognition database or a predatory advertisement half a world away, with virtually no legal recourse.
Louw, the AI trainer from Cape Town, is aware of the privacy trade-offs involved. Although the income is erratic and not sufficient to cover his full monthly expenses, he is willing to accept these conditions to earn money. Mark Graham, a professor of internet geography at the University of Oxford, warned that for individuals in developing countries, the money can be meaningful in the short term, but structurally, this work is precarious, non-progressive, and effectively a dead end.
The Risks of Data Exploitation
The terms of the agreements on these platforms permit the companies, as well as their clients, to do "almost anything with that material, forever, with no further payment and no realistic way for the contributor to withdraw consent or meaningfully renegotiate," according to Enrico Bonadio, a law professor at City St George's, University of London. More troubling risks include trainers' data being used for deepfakes and impersonation, as biometric patterns are hard to anonymize in a robust sense.
Adam Coy, an actor from New York, sold his likeness in 2024 for $1,000 to Captions, an AI-powered video editor, but soon found his face and voice being used in videos promoting unproven medical supplements. "It felt embarrassing to explain it to people," Coy said. He hasn't signed up for any AI data gigs since and would only consider it if a company offered major compensation.
The Future of Gig AI Training
The rise of gig AI training has significant implications for the future of work and data privacy. As AI companies continue to rely on human data to train their models, the demand for high-quality data will only increase. However, it is essential to ensure that the rights of data trainers are protected and that they are fairly compensated for their contributions. Moreover, the lack of transparency and regulation in the industry poses significant risks to individuals and society as a whole.
Ultimately, the hidden cost of training AI is not just about the money; it's about the potential risks and consequences of micro-licensing our biometric identities and intimate data. As we move forward in this new gig economy, it is crucial to prioritize the well-being and safety of data trainers and to establish clear regulations and guidelines to protect their rights.

