logo
Google and OpenAI's AI models win milestone gold at global math competition

Google and OpenAI's AI models win milestone gold at global math competition

Business Times6 days ago
[SAN FRANCISCO] Alphabet's Google and OpenAI said their artificial intelligence (AI) models won gold medals at a global mathematics competition, signalling a breakthrough in math capabilities in the race to build powerful systems that can rival human intelligence.
The results marked the first time that AI systems crossed the gold-medal scoring threshold at the International Mathematical Olympiad for high-school students. Both companies' models solved five out of six problems, achieving the result using general-purpose 'reasoning' models that processed mathematical concepts using natural language, in contrast to the previous approaches used by AI firms.
The achievement suggests AI is less than a year away from being used by mathematicians to crack unsolved research problems at the frontier of the field, according to Junehyuk Jung, a math professor at Brown University and visiting researcher in Google's DeepMind AI unit.
'I think the moment we can solve hard reasoning problems in natural language will enable the potential for collaboration between AI and mathematicians,' Jung said.
OpenAI's breakthrough was achieved with a new experimental model centred on massively scaling up 'test-time compute'. This was done by both allowing the model to 'think' for longer periods and deploying parallel computing power to run numerous lines of reasoning simultaneously, according to Noam Brown, researcher at OpenAI. Brown declined to say how much in computing power it cost OpenAI, but called it 'very expensive'.
To OpenAI researchers, it is another clear sign that AI models can command extensive reasoning capabilities that could expand into other areas beyond math.
A NEWSLETTER FOR YOU
Friday, 2 pm Lifestyle
Our picks of the latest dining, travel and leisure options to treat yourself.
Sign Up
Sign Up
The optimism is shared by Google researchers, who believe AI models' capabilities can apply to research quandaries in other fields such as physics, said Jung, who won an IMO gold medal as a student in 2003.
Of the 630 students participating in the 66th IMO on the Sunshine Coast in Queensland, Australia, 67 contestants, or about 11 per cent, achieved gold-medal scores. Google's DeepMind AI unit last year achieved a silver medal score using AI systems specialised for math. This year, Google used a general-purpose model called Gemini Deep Think, a version of which was previously unveiled at its annual developer conference in May.
Unlike previous AI attempts that relied on formal languages and lengthy computation, Google's approach this year operated entirely in natural language and solved the problems within the official 4.5-hour time limit, the company said in a blog post.
OpenAI, which has its own set of reasoning models, similarly built an experimental version for the competition, according to a post by researcher Alexander Wei on social media platform X. He noted that the company does not plan to release anything with this level of math capability for several months.
This year marked the first time the competition coordinated officially with some AI developers, who have for years used prominent math competitions such as IMO to test model capabilities. IMO judges certified the results of those companies, including Google, and asked them to publish results on Jul 28.
'We respected the IMO Board's original request that all AI labs share their results only after the official results had been verified by independent experts and the students had rightly received the acclamation they deserved,' Google DeepMind CEO Demis Hassabis said on X on Monday (Jul 21).
OpenAI, which published its results on Saturday and first claimed gold-medal status, said in an interview that it had permission from an IMO board member to do so after the closing ceremony on Saturday.
The competition on Monday allowed cooperating companies to publish results, Gregor Dolinar, president of IMO's board, said. REUTERS
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

What is 'Secret Mountain'? Oscar-winning Composer AR Rahman and Sam Altman to launch First-of-its-kind Musical Metaverse
What is 'Secret Mountain'? Oscar-winning Composer AR Rahman and Sam Altman to launch First-of-its-kind Musical Metaverse

International Business Times

time3 hours ago

  • International Business Times

What is 'Secret Mountain'? Oscar-winning Composer AR Rahman and Sam Altman to launch First-of-its-kind Musical Metaverse

Oscar-winning music composer AR Rahman and OpenAI CEO Sam Altman have come together to launch Secret Mountain, a first-of-its-kind, subscription-based musical experience empowered by artificial intelligence. The platform, rooted in metaverse functionality, seeks to merge emotional depth with digital creativity, pointing out a distinct hybrid form of art and technology. Calling the collaboration a "beautiful union of art and technology," Rahman posted a photograph with Altman following a meeting where the two discussed their thoughts on Secret Mountain. The project is "trying to use technology like artificial intelligence to provide the Indian youth with a platform to create music and deter away from mainstream music with clichés," he said, while creating a "virtual world band with diverse voices. A teaser video, shared earlier this year, announced Secret Mountain with a character named Luna—who tells of a fantasy world of musical identities in a surreal, digital landscape. The video suggested an infusion of storytelling, animation, and global music that would all blend with cutting-edge artificial intelligence. One of the most notable aspects of the platform is its pricing: ₹49 per month. This daring leap seems to be an attempt to democratize access to cutting-edge immersive music tech in India and beyond. It's a sharp contrast to the usual high subscription charges. Rahman has a dream to form a "Meta Band" of artists coming together from different countries like Ireland, China, Africa, and India. Beyond performances, the artist will also mentor others, creating a platform that is, at heart, a collaborative and educational space for aspiring creative minds. As he ventures into the future of storytelling, Rahman's fundamental belief remains unchanged, which is that technology can assist, yet the human spirit is the driving force behind genius art. "AI can be a good starting point," he adds, "but the emotional depth and creativity of human beings can never be replaced." Besides Secret Mountain, Rahman is also composing music for a film adaptation of Ramayana, alongside Hans Zimmer.

What is Opal? Google Joins Vibe-Coding Wave with AI Tool that Builds Apps from Simple Prompts
What is Opal? Google Joins Vibe-Coding Wave with AI Tool that Builds Apps from Simple Prompts

International Business Times

time3 hours ago

  • International Business Times

What is Opal? Google Joins Vibe-Coding Wave with AI Tool that Builds Apps from Simple Prompts

Google has unveiled a new AI-based app-making tool called Opal, which allows users to build simple web apps by just writing what they want the app to do. No traditional coding skills are required. This new tool is currently being tested through Google Labs in the U.S. X Opal supports the growing idea of "vibe-coding," which lets users build apps based on the feel or functionality of an idea and does not require any knowledge of coding. Users enter a prompt, and Google's AI model transforms it into a functioning web app. The tool comes with a visual editor that displays how inputs, outputs, and steps are linked. The app interface is designed for flexibility. Prompts are editable, and users can add new steps with the help of a toolbar. Once the app is developed, it can be shared online. Others can try it using their Google accounts. Even people without coding backgrounds can build and share working apps in minutes. Opal also has a gallery where users can browse and remix other people's apps. This enhances the interactivity and the creativity in the platform, and beginners would have a comfortable start, and professionals would gain some interesting ideas and inspirations. With Opal, Google joins others like Canva, Figma, and Replit, platforms that are trying to remove the programming from app making. The aim is to lower barriers to technology and broaden the population of people able to build apps. Opal is unique in the way it uses visuals to demonstrate the logic of the app and is much easier to work with than standard coding tools. The move is designed to accommodate a wider range of users, from students to small business owners and creative freelancers. Google's bet on vibe-coding with Opal is part of a larger battle to be the leader in AI innovation. The company already provides developers with a number of other AI tools, yet Opal's visual flow and ease of use set it apart. As testing continues, it is clear that Opal has the potential to make coding more fun, faster, and friendlier. That could transform how apps are constructed down the road — less lines of code, more lines of creativity.

Spanish teen under investigation over nude AI images of classmates
Spanish teen under investigation over nude AI images of classmates

Straits Times

time9 hours ago

  • Straits Times

Spanish teen under investigation over nude AI images of classmates

Find out what's new on ST website and app. The probe was launched after 16 young women came forward to complain of AI-generated images of them circulating on social media and the internet. MADRID – Spanish police said July 27 they were investigating a 17-year-old on suspicion of using artificial intelligence to deepfake nude images of female classmates for sale. The probe was launched after 16 young women at an educational institute in the Valencia region came forward to complain of AI-generated images of them circulating on social media and the internet. The first complaint was lodged in December 2024 by an teen who said an AI-generated video and fake photos resembling her 'completely naked' were posted on a social media account started under her name. As more accusations came in, police suspected the images were the work of a student in the same institute, according to a statement by the police. Tracking the IP addresses used to create the bogus accounts led them to the home of the 17-year-old now under investigation on suspicion of corruption of minors. It is not the first time that the Spanish authorities have detected AI-created pornographic images of minors. The government in March said it would put forward a law to treat such deepfaked sexual imagery created by AI without consent as a crime. The Bill, which Madrid claims to be a first in Europe, has yet to be passed by the Parliament. AFP

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store