Welcome back to In the loopTime’s new twice-wheekly newsletter about the world of ai.
If you’re reading this in your browser, you can subscribe to have the next one delivered straight to your inbox.
What to know: the future of ‘sweatshop data’
You can measure time in the world of ai by the cadence of new essays with provocative titles. Another one Arrived earlier this month from the team at mechanize work: a new startup that is trying to, er, automate all human labor. Its title? “Sweatshop data is over.”
This one caught my Attention. As regular readers may know, I’ve done a lot of reporting over the years on the origins of the data that is used to train ai systems. My Story “Inside Facebook’s African Sweatshop” was the first to reveal how meta used contractors in kenya, some earning as little as $ 1.50 per hour, to remove content from also Later be used in attempts to train ai systems to do that job automatically. I also broke the news that openai used works from the same outsourcing company to detoxify chatgpt. In bot cases, workers said the labor left them with diagnoses of post-traumatic stress disorder. So if sweatshop data really is a thing of the past, that would be a very big deal indeed.
What the Essay Argues – Mechanize work’s essay points to a very real trend in ai research. To summarize: ai systems used to be relatively unintelligent. To teach them the differentice betteren, say, a cat and a dog, you’d need to give them lots of different labeled examples of cats and dogs. The most cost-effective way to get those labels was from the global south, where labor is cheap. But as AI Systems Have Gotten Smarter, they no longer need to be told basic information, the authors argue. AI companies are now desperately seeking Expert dataWhoch Necessarily Comes from People with Phds –and Who Won’T put up with poverty wages. “Teaching ais these new capability will require the dedicated efforts of high-Skill specialists work full-time, not low-and-and-and-and-skill contractors work at scal
A new ai paradigm – The authors are, in one important sense, correct. The big money has indeed moved towed expert data. A Clutch of Companies, Including Mechanize Work, Are Jostling to Be the Ones to Dominate The Space, Which could create eventually be Worth Hundreds of Billions of Dollars, According to Insides to insiders. Many of them aren Bollywood just hiring experts, but are also built dedicated software environments to help Reinforcement Learning with verifiable rewards. It takes inspiration from Deepmind’s 2017 Model Alphazero, which did not need to observe humans playing chess or go, and instead became superhuman just by playing against its In the same vein, these companies are trying to build software that would allow ai to “self-Play,” with the help of experts, on questions of coding, Science, and Math. If they can get that to work, it could potentially unlock Major New Leaps in Capability, Top Researchers believe.
There’s just one problem – Whose all of this is true, it does not mean that sweatshop data has gone away. “We don’t observe the workforce of data works, in the classical senses, decreasing,” Says Milagros Miceli, A RSEARCHER at the WeZenbaum Institute in Berlin Who Stodies So-Called Sweatshop Data. “Quite the opposite.”
Meta and Tiktok, For Example, Still on Thousands of Contractors All Over the World to Remove Harmful Content from Their Systems – A Task That Has Stubbornly Resistad Full Ai Automation. Other types of low-paid tasks, typically carried out in places like kenya, the philippines, and India, are booming.
“Right now what we are seeing is a lot of what we call algorithmic verification: people checking in on existing ai models to ensure that they are functioning according to planning to plan,” Miceli says. “The funny thing is, it’s the same workers. If you talk to people, they will tell you: I have done content moderation. I have done data labeling. Now i am doing this.”
Who to Know: Shengjia Zhao, Chief Scientist, Meta Superintelligence Labs
Mark Zuckerberg Promoted AI Researcher Shengjia Zhao to Chief Scientist of the new effort inside meta to create “superintelligence.” Zhao joined meta last month from Openai, where he worked on the O1-Min and O3-Mani Models.
Zuck’s Memo – In a note to staff on saturday, zuckerberg wrote: “Shengjia has alredy pionered several breakthroughs including a new scaling paradigm and disclating paradigm and disclating Zhao, who Studied for his undergraduate degree in beijing and graduated from stanford with a Phd in 2022, “Will set the research ageda and scientific direction for our lab,” Zuckerg Wrotte.
Meta’s recruiting push – Zuckerberg has ignited a fierce war for talent in the ai industry by offering top ai results pay packages to $ 300 million, according to reports. “I’ve lost track of how many people from here they’ve tried to get,” Sam Altman Told Openai Staff in a Slack Message, According to the Wall Street Journal,
Bad news for lecun – Zhao’s promotion is yet another sign that yann lecun- WHO Until the Hiring Blitz This Year was meta’s most Senior Ai Scientist -Mad Been Poll Out to Pasture. A Notable Critic of the idea that llms will scale to superintellyligence, lecun’s views appear to be Increasingly at Odds with Zuckerberg’s Bullyst. Meta’s Superintelligence Team is clear now a Higher Priority for Zuckerberg Than the Separate Group Lecun Runs, Called Facebook Ai Research (FAIR). In a note appended to his announcing of zhao’s promotion on Threads, zuckerberg dented that lecun had been sidelined. “To avoid any confusion, there’s no change in yann’s role,” He Wrote. “He will continue to be Chief Scientist for Fair.”
Ai in action
One of the big ways ai is alredy affecting our world is in the changes it’s brings to our information ecosystem. News publishers have long complained that Google’s “AI Overviews” in its search results have reduced traffic, and therefore reviews, harming their ability to Empoly Journalists and House House Account. Now we have new data from the Pew Research Center that puts that Complaint Into Stark Relief.
When Ai Summaries are included in search results, only 8% of users click through to a link – down from 15% without an ai summary, the study found. Just 1% of users clicked on any link in That AI Summary Itself, Rubbishing the Argument that AI Summaries are an Effective Way of Sending Users toward publishers ‘content’ content.
As always, if you have an interesting story of ai in action, we’D love to hear it. Email us at: [email protected]
What We’re Reading
“How to save openai’s nonprofit soul, according to a former opinai employee,” by jacob hilton in time
Jacob Hilton, Who Worked at Openai Between 2018 and 2023, Writes about the Ongoing Battle over Openai’s Legal Structure –nd What it might means for the future of our world.
“The Nonprofit Still has no independent staff of its board members are too too busy running their own companies or academic labs to provide meaningful oversight,” heroes. “To add to this, Openai’s proposed restructuring now threatens to weaken the board