( May 7, 2026, 04:54 GMT | Official Statement) -- MLex Summary: South Korea’s science ministry said it will launch a 3 billion won ($2 million) “AI training data upcycling” project to reprocess 30 existing AI Hub datasets into training data better suited for generative AI, large language models and physical AI systems. According to the Ministry of Science and ICT, the project will convert older datasets, mostly built for classification and labeling tasks, into data that includes reasoning processes, behavioral information and multimodal links between vision, language and action. Fifteen datasets will be upgraded for LLM training, including machine reading, paper summarization and patent knowledge data, while another 15 will target physical AI, including autonomous driving, robotics, drones and human-motion data. The upgraded datasets will be released through AI Hub for companies, startups and research institutes.Statement is attached (in Korean)....
Prepare for tomorrow’s regulatory change, today
MLex identifies risk to business wherever it emerges, with specialist reporters across the globe providing exclusive news and deep-dive analysis on the proposals, probes, enforcement actions and rulings that matter to your organization and clients, now and in the longer term.
Know what others in the room don’t, with features including:
- Daily newsletters for Antitrust, M&A, Trade, Data Privacy & Security, Technology, AI and more
- Custom alerts on specific filters including geographies, industries, topics and companies to suit your practice needs
- Predictive analysis from expert journalists across North America, the UK and Europe, Latin America and Asia-Pacific
- Curated case files bringing together news, analysis and source documents in a single timeline
Experience MLex today with a 14-day free trial.