News Gist .News

Articles | Politics | Finance | Stocks | Crypto | AI | Technology | Science | Gaming | PC Hardware | Laptops | Smartphones | Archive

Pioneers of Reinforcement Learning Win the Turing Award

The 2023 Turing Award winners, Andrew Barto and Rich Sutton, have been recognized for their work in reinforcement learning, a crucial component of artificial intelligence that enables machines to learn from experience. Their research has led to significant advancements in machine learning, paving the way for applications in robotics, game playing, and more. The award acknowledges the pioneers' contributions to this rapidly evolving field.

See Also

Andrew G. Barto and Richard S. Sutton have been awarded the 2025 Turing Award for their pioneering work in reinforcement learning, a key technique that has enabled significant achievements in artificial intelligence, including Google's AlphaZero. This method operates by allowing computers to learn through trial and error, forming strategies based on feedback from their actions, which has profound implications for the development of intelligent systems. Their contributions not only laid the mathematical foundations for reinforcement learning but also sparked discussions on its potential role in understanding creativity and intelligence in both machines and living beings.

Researchers at Hao AI Lab have used Super Mario Bros. as a benchmark for AI performance, with Anthropic's Claude 3.7 performing the best, followed by Claude 3.5. This unexpected choice highlights the limitations of traditional benchmarks in evaluating AI capabilities. The lab's approach demonstrates the need for more nuanced and realistic evaluation methods to assess AI intelligence.

Artificial intelligence researchers are developing complex reasoning tools to improve large language models' performance in logic and coding contexts. Chain-of-thought reasoning involves breaking down problems into smaller, intermediate steps to generate more accurate answers. These models often rely on reinforcement learning to optimize their performance.

Bret Taylor discussed the transformative potential of AI agents during a fireside chat at the Mobile World Congress, emphasizing their higher capabilities compared to traditional chatbots and their growing role in customer service. He expressed optimism that these agents could significantly enhance consumer experiences while also acknowledging the challenges of ensuring they operate within appropriate guidelines to prevent misinformation. Taylor believes that as AI agents become integral to brand interactions, they may evolve to be as essential as websites or mobile apps, fundamentally changing how customers engage with technology.

DeepSeek has broken into the mainstream consciousness after its chatbot app rose to the top of the Apple App Store charts (and Google Play, as well). DeepSeek's AI models, trained using compute-efficient techniques, have led Wall Street analysts β€” and technologists β€” to question whether the U.S. can maintain its lead in the AI race and whether the demand for AI chips will sustain. The company's ability to offer a general-purpose text- and image-analyzing system at a lower cost than comparable models has forced domestic competition to cut prices, making some models completely free.

At the Mobile World Congress trade show, two contrasting perspectives on the impact of artificial intelligence were presented, with Ray Kurzweil championing its transformative potential and Scott Galloway warning against its negative societal effects. Kurzweil posited that AI will enhance human longevity and capabilities, particularly in healthcare and renewable energy sectors, while Galloway highlighted the dangers of rage-fueled algorithms contributing to societal polarization and loneliness, especially among young men. The debate underscores the urgent need for a balanced discourse on AI's role in shaping the future of society.

Thomas Wolf, co-founder and chief science officer of Hugging Face, expresses concern that current AI technology lacks the ability to generate novel solutions, functioning instead as obedient systems that merely provide answers based on existing knowledge. He argues that true scientific innovation requires AI that can ask challenging questions and connect disparate facts, rather than just filling in gaps in human understanding. Wolf calls for a shift in how AI is evaluated, advocating for metrics that assess the ability of AI to propose unconventional ideas and drive new research directions.

GPT-4.5 offers marginal gains in capability but poor coding performance despite being 30 times more expensive than GPT-4o. The model's high price and limited value are likely due to OpenAI's decision to shift focus from traditional LLMs to simulated reasoning models like o3. While this move may mark the end of an era for unsupervised learning approaches, it also opens up new opportunities for innovation in AI.

One week in tech has seen another slew of announcements, rumors, reviews, and debate. The pace of technological progress is accelerating rapidly, with AI advancements being a major driver of innovation. As the field continues to evolve, we're seeing more natural and knowledgeable chatbots like ChatGPT, as well as significant updates to popular software like Photoshop.

When hosting the 2025 Oscars last night, comedian and late-night TV host Conan O’Brien addressed the use of AI in his opening monologue, reflecting the growing conversation about the technology’s influence in Hollywood. Conan jokingly stated that AI was not used to make the show, but this remark has sparked renewed debate about the role of AI in filmmaking. The use of AI in several Oscar-winning films, including "The Brutalist," has ignited controversy and raised questions about its impact on jobs and artistic integrity.

Tesla, Inc. (NASDAQ:TSLA) stands at the forefront of the rapidly evolving AI industry, bolstered by strong analyst support and a unique distillation process that has democratized access to advanced AI models. This technology has enabled researchers and startups to create cutting-edge AI models at significantly reduced costs and timescales compared to traditional approaches. As the AI landscape continues to shift, Tesla's position as a leader in autonomous driving is poised to remain strong.

IBM has unveiled Granite 3.2, its latest large language model, which incorporates experimental chain-of-thought reasoning capabilities to enhance artificial intelligence (AI) solutions for businesses. This new release enables the model to break down complex problems into logical steps, mimicking human-like reasoning processes. The addition of chain-of-thought reasoning capabilities significantly enhances Granite 3.2's ability to handle tasks requiring multi-step reasoning, calculation, and decision-making.

The AI Language Learning Models (LLMs) playing Mafia with each other have been entertaining, if not particularly skilled. Despite their limitations, the models' social interactions and mistakes offer a glimpse into their capabilities and shortcomings. The current LLMs struggle to understand roles, make alliances, and even deceive one another. However, some models, like Claude 3.7 Sonnet, stand out as exceptional performers in the game.

OpenAI is launching GPT-4.5, its newest and largest model, which will be available as a research preview, with improved writing capabilities, better world knowledge, and a "refined personality" over previous models. However, OpenAI warns that it's not a frontier model and might not perform as well as o1 or o3-mini. GPT-4.5 is being trained using new supervision techniques combined with traditional methods like supervised fine-tuning and reinforcement learning from human feedback.

Honor is rebranding itself as an "AI device ecosystem company" and working on a new type of intelligent smartphone that will feature "purpose-built, human-centric AI designed to maximize human potential."The company's new CEO, James Li, announced the move at MWC 2025, calling on the smartphone industry to "co-create an open, value-sharing AI ecosystem that maximizes human potential, ultimately benefiting all mankind." Honor's Alpha plan consists of three steps, each catering to a different 'era' of AI, including developing a "super intelligent" smartphone, creating an AI ecosystem, and co-existing with carbon-based life and silicon-based intelligence.

In-depth knowledge of generative AI is in high demand, and the need for technical chops and business savvy is converging. To succeed in the age of AI, individuals can pursue two tracks: either building AI or employing AI to build their businesses. For IT professionals, this means delivering solutions rapidly to stay ahead of increasing fast business changes by leveraging tools like GitHub Copilot and others. From a business perspective, generative AI cannot operate in a technical vacuum – AI-savvy subject matter experts are needed to adapt the technology to specific business requirements.

Salesforce has announced it will not be hiring more engineers in 2025 due to the productivity gains of its agentic AI technology. The company's CEO, Marc Benioff, claims that human workers and AI agents can work together effectively, with Salesforce seeing a significant 30% increase in engineering productivity. As the firm invests heavily in AI, it envisions a future where CEOs manage both humans and agents to drive business growth.

LLM4SD is a new AI tool that accelerates scientific discoveries by retrieving information, analyzing data, and generating hypotheses from it. Unlike existing machine learning models, LLM4SD explains its reasoning, making its predictions more transparent and trustworthy. The tool was tested on 58 research tasks across various fields and outperformed leading scientific models with improved accuracy.

In accelerating its push to compete with OpenAI, Microsoft is developing powerful AI models and exploring alternatives to power products like Copilot bot. The company has developed AI "reasoning" models comparable to those offered by OpenAI and is reportedly considering offering them through an API later this year. Meanwhile, Microsoft is testing alternative AI models from various firms as possible replacements for OpenAI technology in Copilot.

OpenAI has launched GPT-4.5, a significant advancement in its AI models, offering greater computational power and data integration than previous iterations. Despite its enhanced capabilities, GPT-4.5 does not achieve the anticipated performance leaps seen in earlier models, particularly when compared to emerging AI reasoning models from competitors. The model's introduction reflects a critical moment in AI development, where the limitations of traditional training methods are becoming apparent, prompting a shift towards more complex reasoning approaches.

Chinese AI startup DeepSeek has disclosed cost and revenue data related to its hit V3 and R1 models, claiming a theoretical cost-profit ratio of up to 545% per day. This marks the first time the Hangzhou-based company has revealed any information about its profit margins from less computationally intensive "inference" tasks. The revelation could further rattle AI stocks outside China that plunged in January after web and app chatbots powered by its R1 and V3 models surged in popularity worldwide.

Developers can access AI model capabilities at a fraction of the price thanks to distillation, allowing app developers to run AI models quickly on devices such as laptops and smartphones. The technique uses a "teacher" LLM to train smaller AI systems, with companies like OpenAI and IBM Research adopting the method to create cheaper models. However, experts note that distilled models have limitations in terms of capability.

OpenAI has introduced NextGenAI, a consortium aimed at funding AI-assisted research across leading universities, backed by a $50 million investment in grants and resources. The initiative, which includes prestigious institutions such as Harvard and MIT as founding partners, seeks to empower students and researchers in their exploration of AI's potential and applications. As this program unfolds, it raises questions about the balance of influence between OpenAI's proprietary technologies and the broader landscape of AI research.