News Gist .News

Articles | Politics | Finance | Stocks | Crypto | AI | Technology | Science | Gaming | PC Hardware | Laptops | Smartphones | Archive

Foxconn Unveils First Large Language Model

Foxconn has launched its first large language model, named "FoxBrain," which uses 120 Nvidia GPUs and is based on Meta's Llama 3.1 architecture to analyze data, support decision-making, and generate code. The model, trained in about four weeks, boasts performance comparable to world-class standards despite a slight gap compared to China's DeepSeek distillation model. Foxconn plans to collaborate with technology partners to expand the model's applications and promote AI in manufacturing and supply chain management.

See Also

Foxconn Unveils 'FoxBrain,' Built on Nvidia GPUs to Boost AI Efforts Δ1.96

Foxconn has launched its first large language model, "FoxBrain," built on top of Nvidia's H100 GPUs, with the goal of enhancing manufacturing and supply chain management. The model was trained using 120 GPUs and completed in about four weeks, with a performance gap compared to China's DeepSeek's distillation model. Foxconn plans to collaborate with technology partners to expand the model's applications and promote AI in various industries.

Foxconn’s Mega-AI Plant Ready in a Year Despite Trump Tariffs Δ1.81

Foxconn's ambitious mega-AI server plant in Guadalajara, Mexico, is set to be completed within a year, despite looming tariffs proposed by former President Trump. With a planned investment of approximately $900 million, this facility will become the world's largest assembly plant for Nvidia's GB200 AI chips, signaling a robust commitment to expanding server-related operations in Mexico amidst ongoing U.S.-China trade tensions. Local government officials have expressed strong support for the project, emphasizing that investment in Jalisco's semiconductor industry continues to thrive, countering potential tariff impacts.

OpenAI Launching GPT-4.5, Its Next General-Purpose Large Language Model Δ1.79

GPT-4.5 represents a significant milestone in the development of large language models, offering improved accuracy and natural interaction with users. The new model's broader knowledge base and enhanced ability to follow user intent are expected to make it more useful for tasks such as improving writing, programming, and solving practical problems. As OpenAI continues to push the boundaries of AI research, GPT-4.5 marks a crucial step towards creating more sophisticated language models.

Openai Launches gpt-4.5, Its Largest Model to Date Δ1.78

GPT-4.5 is OpenAI's latest AI model, trained using more computing power and data than any of the company's previous releases, marking a significant advancement in natural language processing capabilities. The model is currently available to subscribers of ChatGPT Pro as part of a research preview, with plans for wider release in the coming weeks. As the largest model to date, GPT-4.5 has sparked intense discussion and debate among AI researchers and enthusiasts.

The Ai Arms Race Heats Up: Tencent Unveils Model that Outdoes Deepseek Δ1.77

Tencent Holdings Ltd. has unveiled its Hunyuan Turbo S artificial intelligence model, which the company claims outperforms DeepSeek's R1 in response speed and deployment cost. This latest move joins a series of rapid rollouts from major industry players on both sides of the Pacific since DeepSeek stunned Silicon Valley with a model that matched the best from OpenAI and Meta Platforms Inc. The Hunyuan Turbo S model is designed to respond as instantly as possible, distinguishing itself from the deep reasoning approach of DeepSeek's eponymous chatbot.

New Ai Text Diffusion Models Break Speed Barriers by Pulling Words From Noise Δ1.77

These diffusion models maintain performance faster than or comparable to similarly sized conventional models. LLaDA's researchers report their 8 billion parameter model performs similarly to LLaMA3 8B across various benchmarks, with competitive results on tasks like MMLU, ARC, and GSM8K. Mercury claims dramatic speed improvements, operating at 1,109 tokens per second compared to GPT-4o Mini's 59 tokens per second.

AI Takes Center Stage as Alibaba Drives Shares Higher Δ1.76

Alibaba Group's release of an artificial intelligence (AI) reasoning model has driven its Hong Kong-listed shares more than 8% higher on Thursday, outperforming global hit DeepSeek's R1. The company's AI unit claims that its QwQ-32B model can achieve performance comparable to top models like OpenAI's o1 mini and DeepSeek's R1. Alibaba's new model is accessible via its chatbot service, Qwen Chat, allowing users to choose various Qwen models.

Openai Unveils gpt-4.5 'Orion,' Its Largest Ai Model Yet Δ1.76

OpenAI has launched GPT-4.5, a significant advancement in its AI models, offering greater computational power and data integration than previous iterations. Despite its enhanced capabilities, GPT-4.5 does not achieve the anticipated performance leaps seen in earlier models, particularly when compared to emerging AI reasoning models from competitors. The model's introduction reflects a critical moment in AI development, where the limitations of traditional training methods are becoming apparent, prompting a shift towards more complex reasoning approaches.

Tencent Releases New Ai Model, Says Replies Faster than Deepseek-R1 Δ1.76

Tencent has released a new AI model called Hunyuan Turbo S that it claims can answer queries faster than global hit DeepSeek's R1. The Hunyuan Turbo S is able to reply to queries within a second, distinguishing itself from other slow-thinking models. Tencent's success in developing the Turbo S comes after its competitors, including Alibaba's Qwen 2.5-Max model, released similar products in an effort to keep pace with DeepSeek's rapid growth.

Openai’s Largest Ai Model Ever Arrives to Mixed Reviews Δ1.76

GPT-4.5 offers marginal gains in capability but poor coding performance despite being 30 times more expensive than GPT-4o. The model's high price and limited value are likely due to OpenAI's decision to shift focus from traditional LLMs to simulated reasoning models like o3. While this move may mark the end of an era for unsupervised learning approaches, it also opens up new opportunities for innovation in AI.

Ibm Granite 3.2 Adds Enhanced Reasoning to Its Ai Mix Δ1.76

IBM has unveiled Granite 3.2, its latest large language model, which incorporates experimental chain-of-thought reasoning capabilities to enhance artificial intelligence (AI) solutions for businesses. This new release enables the model to break down complex problems into logical steps, mimicking human-like reasoning processes. The addition of chain-of-thought reasoning capabilities significantly enhances Granite 3.2's ability to handle tasks requiring multi-step reasoning, calculation, and decision-making.

AI Model Evolution: Increased Size Brings Greater Capabilities but Higher Costs Δ1.75

OpenAI has begun rolling out its newest AI model, GPT-4.5, to users on its ChatGPT Plus tier, promising a more advanced experience with its increased size and capabilities. However, the new model's high costs are raising concerns about its long-term viability. The rollout comes after GPT-4.5 launched for subscribers to OpenAI’s $200-a-month ChatGPT Pro plan last week.

'Actual Intelligence': Franken-PC Debuts in Melbourne with a $35,000 Price Tag and Claims of Exceptional Performance Δ1.75

The CL1, Cortical Labs' first deployable biological computer, integrates living neurons with silicon for real-time computation, promising to revolutionize the field of artificial intelligence. By harnessing the power of real neurons grown across a silicon chip, the CL1 claims to solve complex challenges in ways that digital AI models cannot match. The technology has the potential to democratize access to cutting-edge innovation and make it accessible to researchers without specialized hardware and software.

gpt-4.5 Launch Raises Compute-Intensive Concerns over Ai Model Δ1.75

GPT-4.5, OpenAI's latest generative AI model, has sparked concerns over its massive size and computational requirements. The new model, internally dubbed Orion, promises improved performance in understanding user prompts but may also pose challenges for widespread adoption due to its resource-intensive nature. As users flock to try GPT-4.5, the implications of this significant advancement on AI's role in everyday life are starting to emerge.

The AI Industry Is Set for an Explosive Growth Spurt Δ1.75

The Stargate Project, a massive AI initiative led by OpenAI, Oracle, SoftBank, and backed by Microsoft and Arm, is expected to require 64,000 Nvidia GPUs by 2026. The project's initial batch of 16,000 GPUs will be delivered this summer, with the remaining GPUs arriving next year. The GPU demand for just one data center and a single customer highlights the scale of the initiative.

The Impact of Openai's gpt-4.5 on Ai Development Revealed Δ1.74

OpenAI is launching GPT-4.5, its newest and largest model, which will be available as a research preview, with improved writing capabilities, better world knowledge, and a "refined personality" over previous models. However, OpenAI warns that it's not a frontier model and might not perform as well as o1 or o3-mini. GPT-4.5 is being trained using new supervision techniques combined with traditional methods like supervised fine-tuning and reinforcement learning from human feedback.

Openai Rolls Out gpt-4.5 for some Paying Users, to Expand Access Next Week Δ1.74

OpenAI has released a research preview of its latest GPT-4.5 model, which offers improved pattern recognition, creative insights without reasoning, and greater emotional intelligence. The company plans to expand access to the model in the coming weeks, starting with Pro users and developers worldwide. With features such as file and image uploads, writing, and coding capabilities, GPT-4.5 has the potential to revolutionize language processing.

The Ai Chatbot App Gains Global Momentum as Deepseek Surpasses U.s. Competition Δ1.74

DeepSeek has broken into the mainstream consciousness after its chatbot app rose to the top of the Apple App Store charts (and Google Play, as well). DeepSeek's AI models, trained using compute-efficient techniques, have led Wall Street analysts — and technologists — to question whether the U.S. can maintain its lead in the AI race and whether the demand for AI chips will sustain. The company's ability to offer a general-purpose text- and image-analyzing system at a lower cost than comparable models has forced domestic competition to cut prices, making some models completely free.

Ceramic.ai Looks to Help Enterprises Build AI Models Faster and More Efficiently Δ1.74

Anna Patterson's new startup, Ceramic.ai, aims to revolutionize how large language models are trained by providing foundational AI training infrastructure that enables enterprises to scale their models 100x faster. By reducing the reliance on GPUs and utilizing long contexts, Ceramic claims to have created a more efficient approach to building LLMs. This infrastructure can be used with any cluster, allowing for greater flexibility and scalability.

Distilling AI Models Costs Less, Raises Revenue Questions Δ1.74

Developers can access AI model capabilities at a fraction of the price thanks to distillation, allowing app developers to run AI models quickly on devices such as laptops and smartphones. The technique uses a "teacher" LLM to train smaller AI systems, with companies like OpenAI and IBM Research adopting the method to create cheaper models. However, experts note that distilled models have limitations in terms of capability.

The Ai Bubble Bursts: How Deepseek's R1 Model Is Freeing Artificial Intelligence From the Grip of Elites Δ1.74

DeepSeek R1 has shattered the monopoly on large language models, making AI accessible to all without financial barriers. The release of this open-source model is a direct challenge to the business model of companies that rely on selling expensive AI services and tools. By democratizing access to AI capabilities, DeepSeek's R1 model threatens the lucrative industry built around artificial intelligence.

Foxconn Says February Revenue Rose 56.43% Year Over Year Δ1.73

Foxconn, the world's largest contract electronics maker and Apple's biggest iPhone assembler, reported on Wednesday that its February revenue jumped 56.43% year on year. The company has seen significant growth in recent months due to increased demand for electronic components. This surge is largely attributed to the ongoing global semiconductor shortage, which has driven up prices of essential materials.

Alibaba Invests Heavily in Artificial Intelligence Infrastructure Δ1.73

Alibaba Group Holding Ltd.'s latest deep learning model has generated significant excitement among investors and analysts, with its claims of performing similarly to DeepSeek using a fraction of the data required. The company's growing prowess in AI is being driven by China's push to support technological innovation and consumption. Alibaba's commitment to investing over 380 billion yuan ($52 billion) in AI infrastructure over the next three years has been hailed as a major step forward.

The Ai Company Behind China's Most Popular Chatbots Reveals Record Profit Margins Δ1.73

DeepSeek, a Chinese AI startup behind the hit V3 and R1 models, has disclosed cost and revenue data that claims a theoretical cost-profit ratio of up to 545% per day. The company revealed its cost and revenue data after web and app chatbots powered by its R1 and V3 models surged in popularity worldwide, causing AI stocks outside China to plummet in January. DeepSeek's profit margins are likely to be lower than claimed due to the low cost of using its V3 model.

gpt-4.5 Model Release: Enhanced Conversational Capabilities and Reduced Hallucinations Δ1.73

OpenAI's latest model, GPT-4.5, has launched with enhanced conversational capabilities and reduced hallucinations compared to its predecessor, GPT-4o. The new model boasts a deeper knowledge base and improved contextual understanding, leading to more intuitive and natural interactions. GPT-4.5 is designed for everyday tasks across various topics, including writing and solving practical problems.