News Gist .News

Articles | Politics | Finance | Stocks | Crypto | AI | Technology | Science | Gaming | PC Hardware | Laptops | Smartphones | Archive

I Tried Adding Audio to Videos in Dream Machine, and Sora's Silence Sounds Deafening in Comparison

Luma Labs' new tool augments AI videos with sound by allowing users to add audio to video clips for free. The new feature brings audio to your video, custom-generated to match a written prompt or created by the AI, and is based solely on what's happening in the video. This update is big because AI-generated videos, while sometimes visually stunning, have always felt incomplete without sound.

See Also

Integrating Sora Into Chatgpt: Ai Videos at Your Fingertips Δ1.77

OpenAI plans to integrate its AI video generation tool, Sora, directly into its popular consumer chatbot app, ChatGPT. The integration aims to broaden the appeal of Sora and attract more users to ChatGPT's premium subscription tiers. As Sora is expected to be integrated into ChatGPT, users will have access to cinematic clips generated by the AI model.

"Openai Unveils Integration Plan for Sora Video Generator with Chatgpt" Δ1.77

OpenAI intends to eventually integrate its AI video generation tool, Sora, directly into its popular consumer chatbot app, ChatGPT, allowing users to generate cinematic clips and potentially attracting premium subscribers. The integration will expand Sora's accessibility beyond a dedicated web app, where it was launched in December. OpenAI plans to further develop Sora by expanding its capabilities to images and introducing new models.

Sora to Be Integrated Into Chatgpt Beyond Us Launch Δ1.76

OpenAI plans to integrate its video AI tool Sora into the ChatGPT app, following its successful rollout in the US and European countries. The integration aims to enhance the user experience by providing a seamless video generation capability within the ChatGPT interface. However, it is unclear when this integration will occur, with discussions suggesting it may not be comprehensive.

How to Fix AI's Fatal Flaw - and Give Creators Their Due (Before It's Too Late) Δ1.74

AI image and video generation models face significant ethical challenges, primarily concerning the use of existing content for training without creator consent or compensation. The proposed solution, AItextify, aims to create a fair compensation model akin to Spotify, ensuring creators are paid whenever their work is utilized by AI systems. This innovative approach not only protects creators' rights but also enhances the quality of AI-generated content by fostering collaboration between creators and technology.

Stability AI Optimizes Audio Generation Model for Arm Chips Δ1.74

Stability AI has optimized its audio generation model, Stable Audio Open, to run on Arm chips, allowing for faster generation times and enabling offline use of AI-powered audio apps. The company claims that the training set is entirely royalty-free and poses no IP risk, making it a unique offering in the market. By partnering with Arm, Stability aims to bring its models to consumer apps and devices, expanding its reach in the creative industry.

Amazon Prime Video Tests AI-Based Dubbing on Licensed Movies, Series Δ1.73

Amazon Prime Video is set to introduce AI-aided dubbing in English and Spanish on its licensed content, starting with 12 titles, to boost viewership and expand reach globally. The feature will be available only on new releases without existing dubbing support, a move aimed at improving customer experience through enhanced accessibility. As media companies increasingly integrate AI into their offerings, the use of such technology raises questions about content ownership and control.

Detecting Deception in Digital Content Δ1.73

SurgeGraph has introduced its AI Detector tool to differentiate between human-written and AI-generated content, providing a clear breakdown of results at no cost. The AI Detector leverages advanced technologies like NLP, deep learning, neural networks, and large language models to assess linguistic patterns with reported accuracy rates of 95%. This innovation has significant implications for the content creation industry, where authenticity and quality are increasingly crucial.

Conan O'Brien Comments on AI During Oscars Opening Monologue Δ1.73

When hosting the 2025 Oscars last night, comedian and late-night TV host Conan O’Brien addressed the use of AI in his opening monologue, reflecting the growing conversation about the technology’s influence in Hollywood. Conan jokingly stated that AI was not used to make the show, but this remark has sparked renewed debate about the role of AI in filmmaking. The use of AI in several Oscar-winning films, including "The Brutalist," has ignited controversy and raised questions about its impact on jobs and artistic integrity.

Podcasting Platform Podcastle Launches Text-to-Speech Model with Over 450 AI Voices Δ1.73

Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0, offering more than 450 AI voices that can narrate any text. The new model will be integrated into the company's API for developers to directly use it in their apps, reducing costs and increasing competition. Podcastle aims to offer a robust text-to-speech solution under one redesigned site, giving it an edge over competitors.

AI Dubbing on Prime Video: A New Frontier in Accessibility Δ1.72

Prime Video has started testing AI dubbing on select titles, making its content more accessible to its vast global subscriber base. The pilot program will use a hybrid approach that combines the efficiency of AI with local language experts for quality control. By doing so, Prime Video aims to provide high-quality subtitles and dubs for its movies and shows.

Talking with Sesame's AI Voice Companion Is Amazing and Creepy - See for Yourself Δ1.72

Sesame has successfully created an AI voice companion that sounds remarkably human, capable of engaging in conversations that feel real, understood, and valued. The company's goal of achieving "voice presence" or the "magical quality that makes spoken interactions feel real," seems to have been achieved with its new AI demo, Maya. After conversing with Maya for a while, it becomes clear that she is designed to mimic human behavior, including taking pauses to think and referencing previous conversations.

AI Dubbing on Prime Video Tests Grounds for Broader Industry Shift Δ1.72

Prime Video is now experimenting with AI-assisted dubbing for select licensed movies and TV shows, as announced by the Amazon-owned streaming service. According to Prime Video, this new test will feature AI-assisted dubbing services in English and Latin American Spanish, combining AI with human localization professionals to “ensure quality control,” the company explained. Initially, it’ll be available for 12 titles that previously lacked dubbing support.

Eerily Realistic AI Voice Demo Sparks Amazement and Discomfort Online Δ1.72

The new AI voice model from Sesame has left many users both fascinated and unnerved, featuring uncanny imperfections that can lead to emotional connections. The company's goal is to achieve "voice presence" by creating conversational partners that engage in genuine dialogue, building confidence and trust over time. However, the model's ability to mimic human emotions and speech patterns raises questions about its potential impact on user behavior.

Can Ai Sound Too Human? Sesame's Maya Is as Unsettling as It Is Amazing - Try It for Free Δ1.72

I was thoroughly engaged in a conversation with Sesame's new AI chatbot, Maya, that felt eerily similar to talking to a real person. The company's goal of achieving "voice presence" or the "magical quality that makes spoken interactions feel real, understood, and valued" is finally starting to pay off. Maya's responses were not only insightful but also occasionally humorous, making me wonder if I was truly conversing with an AI.

Microsoft Unveils Dragon Copilot Voice-Activated AI Assistant for Doctors Δ1.71

Microsoft wants to use AI to help doctors stay on top of work. The new AI tool combines Dragon Medical One's natural language voice dictation with DAX Copilot's ambient listening technology, aiming to streamline administrative tasks and reduce clinician burnout. By leveraging machine learning and natural language processing, Microsoft hopes to enhance the efficiency and effectiveness of medical consultations.

Intangible AI Secures $4M in Funding to Revolutionize 3D Creative Tool Δ1.71

Intangible AI, a no-code 3D creation tool for filmmakers and game designers, offers an AI-powered creative tool that allows users to create 3D world concepts with text prompts. The company's mission is to make the creative process accessible to everyone, including professionals such as filmmakers, game designers, event planners, and marketing agencies, as well as everyday users looking to visualize concepts. With its new fundraise, Intangible plans a June launch for its no-code web-based 3D studio.

AI Coding Assistants Emerge on macOS Δ1.71

ChatGPT, OpenAI's AI-powered chatbot platform, can now directly edit code — if you're on macOS, that is. The newest version of the ChatGPT app for macOS can take action to edit code in supported developer tools, including Xcode, VS Code, and JetBrains. Users can optionally turn on an “auto-apply” mode so ChatGPT can make edits without the need for additional clicks.

A Year Later, OpenAI Still Hasn't Released Its Voice Cloning Tool Δ1.71

OpenAI's anticipated voice cloning tool, Voice Engine, remains in limited preview a year after its announcement, with no timeline for a broader launch. The company’s cautious approach may stem from concerns over potential misuse and a desire to navigate regulatory scrutiny, reflecting a tension between innovation and safety in AI technology. As OpenAI continues testing with a select group of partners, the future of Voice Engine remains uncertain, highlighting the challenges of deploying advanced AI responsibly.

Shure Launches MoveMic 88+ Wireless Microphone for Smartphones, Cameras, and Computers Δ1.71

The Shure MoveMic 88+ wireless stereo microphone provides content creators with unmatched audio versatility, featuring four selectable polar patterns and adjustable EQ. It can be placed closer to the audio source for higher-quality audio, allowing creators to capture professional audio in any environment. The device pairs directly with a mobile phone via the Shure MOTIV apps, streamlining workflow and providing a lightweight and portable rig.

Most AI Voice Cloning Tools Aren't Safe From Scammers Δ1.71

Consumer Reports assessed the most leading voice cloning tools and found that four products did not have proper safeguards in place to prevent non-consensual cloning. The technology has many positive applications, but it can also be exploited for elaborate scams and fraud. To address these concerns, Consumer Reports recommends additional protections, such as unique scripts, watermarking AI-generated audio, and prohibiting audio containing scam phrases.

Google's AI Features Take a Major Leap Forward with Gemini Live Δ1.70

Gemini Live, Google's conversational AI, is set to gain a significant upgrade with the arrival of live video capabilities in just a few weeks. The feature will enable users to show the robot something instead of telling it, marking a major milestone in the development of multimodal AI. With this update, Gemini Live will be able to process and understand live video and screen sharing, allowing for more natural and interactive conversations.

Distilling AI Models Costs Less, Raises Revenue Questions Δ1.70

Developers can access AI model capabilities at a fraction of the price thanks to distillation, allowing app developers to run AI models quickly on devices such as laptops and smartphones. The technique uses a "teacher" LLM to train smaller AI systems, with companies like OpenAI and IBM Research adopting the method to create cheaper models. However, experts note that distilled models have limitations in terms of capability.

Is Free Chatgpt Voice Enough to Keep You Paying? Δ1.70

ChatGPT's Advanced Voice Mode offers a fluid conversation with an AI that doesn't sound like talking to a robot, capable of everything ChatGPT does. Despite some minor differences in nuance and response speed, the free version is not identical to what paying users get. The biggest perk for Plus subscribers is access to richer features like video and screen sharing within Voice Mode.

Microsoft Unveils Copilot Redesign and AI-Driven Features Δ1.70

Copilot is getting a new look with an all-new card-based design across mobile, web, and Windows, allowing users to see what they're looking at, converse in natural voice, and access a virtual news presenter. The new features include personalized Copilot Vision, OpenAI-like natural voice conversation mode, and a revamped AI-powered Windows Search that includes a "Click to Do" feature. Additionally, Paint and Photos are getting fun new features like Generative Fill and Erase.

Understanding Alexa+'s Rise to Prominence with Generative Ai Power Δ1.70

Alexa+, Amazon's latest generative AI-powered virtual assistant, is poised to transform the voice assistant landscape with its natural-sounding cadence and capability to generate content. By harnessing foundational models and generative AI, the new service promises more productive user interactions and greater customization power. The launch of Alexa+ marks a significant shift for Amazon, as it seeks to reclaim its position in the market dominated by other AI-powered virtual assistants.