News Gist .News

Articles | Politics | Finance | Stocks | Crypto | AI | Technology | Science | Gaming | PC Hardware | Laptops | Smartphones | Archive

Speech-to-Text Pioneer Elevenlabs Launches Standalone Model | Techcrunch

ElevenLabs, a leading AI startup, has taken a significant step in the field of speech-to-text technology by launching its first standalone model called Scribe. The company's groundbreaking achievement marks a major milestone in the development of robust and accurate language processing capabilities. With Scribe, ElevenLabs aims to revolutionize the way people interact with audio content, enabling seamless transcription and captioning.

See Also

Podcasting Platform Podcastle Launches Text-to-Speech Model with Over 450 AI Voices Δ1.77

Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0, offering more than 450 AI voices that can narrate any text. The new model will be integrated into the company's API for developers to directly use it in their apps, reducing costs and increasing competition. Podcastle aims to offer a robust text-to-speech solution under one redesigned site, giving it an edge over competitors.

The Future of Writing: AI-Powered Smart Pens Takes Center Stage Δ1.70

The One Smart AI Pen, launched on Kickstarter, promises a futuristic writing experience with its battery, microphone, and Bluetooth capabilities. The device can convert handwritten notes into digital text, translate languages in real-time, and even converse with ChatGPT-4.0-Mini. With its ambitious feature set and optional AI functionality, the One Smart AI Pen is poised to revolutionize the way we interact with writing.

Openai Launches gpt-4.5, Its Largest Model to Date Δ1.69

GPT-4.5 is OpenAI's latest AI model, trained using more computing power and data than any of the company's previous releases, marking a significant advancement in natural language processing capabilities. The model is currently available to subscribers of ChatGPT Pro as part of a research preview, with plans for wider release in the coming weeks. As the largest model to date, GPT-4.5 has sparked intense discussion and debate among AI researchers and enthusiasts.

The Future of Reading Displays - TCL Nxtpaper Technology Δ1.69

The TCL Nxtpaper 11 Plus's innovative display seamlessly shifts from full color to an ink-like paper display, providing a unique reading experience that challenges traditional tablets like Kindles. The device's AI-powered features, such as Text Assist and Smart Translator, enhance the overall user experience with features like transcription and real-time translations. With its advanced eye comfort modes and impressive tech specs, the TCL Nxtpaper 11 Plus has the potential to revolutionize the tablet industry.

Revolutionizing Writing with AI: The One Smart AI Pen Δ1.68

The One Smart AI Pen integrates ChatGPT AI into a ball point pen, offering instant writing suggestions, generating ideas, or drafting emails. It can translate in real-time across more than 52 languages, take dictations, summarize meetings, transcribe handwritten notes, set reminders, and make to-do lists. The smart pen's ability to record meetings and transcribe them could be particularly useful in industries such as law, medicine, and academia.

Distilling AI Models Costs Less, Raises Revenue Questions Δ1.68

Developers can access AI model capabilities at a fraction of the price thanks to distillation, allowing app developers to run AI models quickly on devices such as laptops and smartphones. The technique uses a "teacher" LLM to train smaller AI systems, with companies like OpenAI and IBM Research adopting the method to create cheaper models. However, experts note that distilled models have limitations in terms of capability.

Ceramic.ai Looks to Help Enterprises Build AI Models Faster and More Efficiently Δ1.68

Anna Patterson's new startup, Ceramic.ai, aims to revolutionize how large language models are trained by providing foundational AI training infrastructure that enables enterprises to scale their models 100x faster. By reducing the reliance on GPUs and utilizing long contexts, Ceramic claims to have created a more efficient approach to building LLMs. This infrastructure can be used with any cluster, allowing for greater flexibility and scalability.

A Year Later, OpenAI Still Hasn't Released Its Voice Cloning Tool Δ1.68

OpenAI's anticipated voice cloning tool, Voice Engine, remains in limited preview a year after its announcement, with no timeline for a broader launch. The company’s cautious approach may stem from concerns over potential misuse and a desire to navigate regulatory scrutiny, reflecting a tension between innovation and safety in AI technology. As OpenAI continues testing with a select group of partners, the future of Voice Engine remains uncertain, highlighting the challenges of deploying advanced AI responsibly.

Flora Builds an AI-Powered ‘Infinite Canvas’ for Creative Professionals Δ1.67

Flora, a startup led by Weber Wong, aims to revolutionize creative work by providing an "infinite canvas" that integrates existing AI models, allowing professionals to collaborate and generate diverse creative outputs seamlessly. The platform differentiates itself from traditional AI tools by focusing on user interface rather than the models themselves, seeking to enhance the creative process rather than replace it. Wong's vision is to empower artists and designers, making it possible for them to produce significantly more work while maintaining creative control.

The Rise of AI-Powered Ad Startup Creatopy Under Tammy Nam's Leadership Δ1.67

Creatopy, an AI-powered ad startup, has appointed Tammy Nam as its new CEO, bringing a wealth of experience from her previous roles at PicsArt and Viki. Nam is well-versed in scaling early-stage startups and understands marketing tech, making her an ideal fit for the company. Creatopy has already achieved significant growth, with mid-market and enterprise revenue increasing by 400% between February 2024 and February 2025.

Foxconn Unveils First Large Language Model Δ1.67

Foxconn has launched its first large language model, named "FoxBrain," which uses 120 Nvidia GPUs and is based on Meta's Llama 3.1 architecture to analyze data, support decision-making, and generate code. The model, trained in about four weeks, boasts performance comparable to world-class standards despite a slight gap compared to China's DeepSeek distillation model. Foxconn plans to collaborate with technology partners to expand the model's applications and promote AI in manufacturing and supply chain management.

Google Unveils Gemini Screenshare at MWC 2025. Δ1.67

Google has updated its AI assistant Gemini with two significant features that enhance its capabilities and bring it closer to rival ChatGPT. The "Screenshare" feature allows Gemini to do live screen analysis and answer questions in the context of what it sees, while the new "Gemini Live" feature enables real-time video analysis through the phone's camera. These updates demonstrate Google's commitment to innovation and its quest to remain competitive in the AI assistant market.

I Told Windows Notepad's New AI to Turn Nvidia's Fail Into Poetry Δ1.67

Microsoft has introduced an AI-powered Rewrite feature in Windows 11's Notepad, allowing users to edit text in various styles and tones, including poetry. This new functionality, which is part of the Microsoft 365 subscription, enables users to transform existing text into different formats, such as casual or formal, while also tapping into creative expressions. The feature reflects Microsoft's ongoing integration of AI into its productivity tools, showcasing a shift towards enhancing user experience through innovative editing options.

Google Debuts Gemini-Based Text Embedding Model Δ1.67

Google has added a new, experimental 'embedding' model for text, Gemini Embedding, to its Gemini developer API. Embedding models translate text inputs like words and phrases into numerical representations, known as embeddings, that capture the semantic meaning of the text. This innovation could lead to improved performance across diverse domains, including finance, science, legal, search, and more.

T-Mobile's Parent Company Unveils AI-Powered 'App-Less' Phone with Perplexity Assistant Δ1.67

Deutsche Telekom is building a new Perplexity chatbot-powered "AI Phone," the companies announced at Mobile World Congress (MWC) in Barcelona today. The new device will be revealed later this year and run “Magenta AI,” which gives users access to Perplexity Assistant, Google Cloud AI, ElevenLabs, Picsart, and a suite of AI tools. The AI phone concept was first revealed at MWC 2024 by Deutsche Telekom (T-Mobile's parent company) as an "app-less" device primarily controlled by voice that can do things like book flights and make restaurant reservations.

Detecting Deception in Digital Content Δ1.67

SurgeGraph has introduced its AI Detector tool to differentiate between human-written and AI-generated content, providing a clear breakdown of results at no cost. The AI Detector leverages advanced technologies like NLP, deep learning, neural networks, and large language models to assess linguistic patterns with reported accuracy rates of 95%. This innovation has significant implications for the content creation industry, where authenticity and quality are increasingly crucial.

Develop AI Device Ecosystem with Google and Qualcomm Δ1.67

Honor is rebranding itself as an "AI device ecosystem company" and working on a new type of intelligent smartphone that will feature "purpose-built, human-centric AI designed to maximize human potential."The company's new CEO, James Li, announced the move at MWC 2025, calling on the smartphone industry to "co-create an open, value-sharing AI ecosystem that maximizes human potential, ultimately benefiting all mankind." Honor's Alpha plan consists of three steps, each catering to a different 'era' of AI, including developing a "super intelligent" smartphone, creating an AI ecosystem, and co-existing with carbon-based life and silicon-based intelligence.

Stability AI Optimizes Audio Generation Model for Arm Chips Δ1.67

Stability AI has optimized its audio generation model, Stable Audio Open, to run on Arm chips, allowing for faster generation times and enabling offline use of AI-powered audio apps. The company claims that the training set is entirely royalty-free and poses no IP risk, making it a unique offering in the market. By partnering with Arm, Stability aims to bring its models to consumer apps and devices, expanding its reach in the creative industry.

AI Takes Center Stage as Alibaba Drives Shares Higher Δ1.67

Alibaba Group's release of an artificial intelligence (AI) reasoning model has driven its Hong Kong-listed shares more than 8% higher on Thursday, outperforming global hit DeepSeek's R1. The company's AI unit claims that its QwQ-32B model can achieve performance comparable to top models like OpenAI's o1 mini and DeepSeek's R1. Alibaba's new model is accessible via its chatbot service, Qwen Chat, allowing users to choose various Qwen models.

Eerily Realistic AI Voice Demo Sparks Amazement and Discomfort Online Δ1.67

The new AI voice model from Sesame has left many users both fascinated and unnerved, featuring uncanny imperfections that can lead to emotional connections. The company's goal is to achieve "voice presence" by creating conversational partners that engage in genuine dialogue, building confidence and trust over time. However, the model's ability to mimic human emotions and speech patterns raises questions about its potential impact on user behavior.

Microsoft's New Dragon Copilot Is an AI Assistant for Healthcare Δ1.67

Microsoft has announced Microsoft Dragon Copilot, an AI system for healthcare that can listen to and create notes based on clinical visits. The system combines voice-dictating and ambient listening tech created by AI voice company Nuance, which Microsoft bought in 2021. According to Microsoft's announcement, the new system can help its users streamline their documentation through features like "multilanguage ambient note creation" and natural language dictation.

All the Smart Home News From the Matter Launch Event Δ1.67

Matter has officially launched, marking a significant advancement in smart home interoperability with over 190 certified products from major companies like Amazon, Apple, Google, and Samsung. The event showcased various innovative devices, including the first Matter-enabled fridge from Bosch and Thread-compatible sensors from Aqara, highlighting the potential for a more seamless integration of smart home technology. Despite the excitement, industry experts emphasize that achieving a fully interoperable smart home remains a work in progress, underscoring that Matter is just the beginning of a long journey.

Apple's Voice-Activated Fails with Scottish Accent Δ1.67

Apple's voice-to-text service has failed to accurately transcribe a voicemail message left by a garage worker, mistakenly inserting a reference to sex and an apparent insult into the message. The incident highlights the challenges faced by speech-to-text engines in dealing with difficult accents, background noise, and prepared scripts. The Apple AI system may have struggled due to the caller's Scottish accent and poor audio quality.

Shure Launches MoveMic 88+ Wireless Microphone for Smartphones, Cameras, and Computers Δ1.67

The Shure MoveMic 88+ wireless stereo microphone provides content creators with unmatched audio versatility, featuring four selectable polar patterns and adjustable EQ. It can be placed closer to the audio source for higher-quality audio, allowing creators to capture professional audio in any environment. The device pairs directly with a mobile phone via the Shure MOTIV apps, streamlining workflow and providing a lightweight and portable rig.

Sam Altman Tweets Delay to ChatGPT-4.5 Launch While Also Proposing a Shocking New Payment Structure Δ1.67

OpenAI CEO Sam Altman has announced a staggered rollout for the highly anticipated ChatGPT-4.5, delaying the full launch to manage server demand effectively. In conjunction with this, Altman proposed a controversial credit-based payment system that would allow subscribers to allocate tokens for accessing various features instead of providing unlimited access for a fixed fee. The mixed reactions from users highlight the potential challenges OpenAI faces in balancing innovation with user satisfaction.