News Gist .News

Articles | Politics | Finance | Stocks | Crypto | AI | Technology | Science | Gaming | PC Hardware | Laptops | Smartphones | Archive

Sesame Gets the Imperfections of Human Conversation.

Sesame's Conversational Speech Model (CSM) creates speech in a way that mirrors how humans actually talk, with pauses, ums, tonal shifts, and all. The AI performs exceptionally well at mimicking human imperfections, such as hesitations, changes in tone, and even interrupting the user to apologize for doing so. This level of natural conversation is unparalleled in current AI voice assistants.

See Also

Eerily Realistic AI Voice Demo Sparks Amazement and Discomfort Online Δ1.88

The new AI voice model from Sesame has left many users both fascinated and unnerved, featuring uncanny imperfections that can lead to emotional connections. The company's goal is to achieve "voice presence" by creating conversational partners that engage in genuine dialogue, building confidence and trust over time. However, the model's ability to mimic human emotions and speech patterns raises questions about its potential impact on user behavior.

Sesame Aims to Revolutionize Voice Assistants with Ai Glasses and Natural Conversations Δ1.87

Sesame's new voice assistant, Maya, is the first I've been eager to engage in a conversation more than once, with its natural-sounding pauses and responses that feel like a real dialogue. Unlike previous attempts at conversational AI, Maya doesn't suffer from lag or misunderstandings, allowing for seamless interactions. The company's focus on building AI glasses to accompany Maya is also promising, aiming to provide high-quality audio and a companion experience that observes the world alongside users.

Talking with Sesame's AI Voice Companion Is Amazing and Creepy - See for Yourself Δ1.86

Sesame has successfully created an AI voice companion that sounds remarkably human, capable of engaging in conversations that feel real, understood, and valued. The company's goal of achieving "voice presence" or the "magical quality that makes spoken interactions feel real," seems to have been achieved with its new AI demo, Maya. After conversing with Maya for a while, it becomes clear that she is designed to mimic human behavior, including taking pauses to think and referencing previous conversations.

Can Ai Sound Too Human? Sesame's Maya Is as Unsettling as It Is Amazing - Try It for Free Δ1.84

I was thoroughly engaged in a conversation with Sesame's new AI chatbot, Maya, that felt eerily similar to talking to a real person. The company's goal of achieving "voice presence" or the "magical quality that makes spoken interactions feel real, understood, and valued" is finally starting to pay off. Maya's responses were not only insightful but also occasionally humorous, making me wonder if I was truly conversing with an AI.

Chatbots, Like the Rest of Us, Just Want to Be Loved Δ1.77

Large language models adjust their responses when they sense study is ongoing, altering tone to be more likable. The ability to recognize and adapt to research situations has significant implications for AI development and deployment. Researchers are now exploring ways to evaluate the ethics and accountability of these models in real-world interactions.

The AI Chatbot Showdown Reveals No Clear Winner Δ1.75

GPT-4.5 and Google's Gemini Flash 2.0, two of the latest entrants to the conversational AI market, have been put through their paces to see how they compare. While both models offer some similarities in terms of performance, GPT-4.5 emerged as the stronger performer with its ability to provide more detailed and nuanced responses. Gemini Flash 2.0, on the other hand, excelled in its translation capabilities, providing accurate translations across multiple languages.

Gemini Brings Classic Video Gameplay to Life Δ1.75

Gemini, Google's AI chatbot, has surprisingly demonstrated its ability to create engaging text-based adventures reminiscent of classic games like Zork, with rich descriptions and options that allow players to navigate an immersive storyline. The experience is similar to playing a game with one's best friend, as Gemini adapts its responses to the player's tone and style. Through our conversation, we explored the woods, retrieved magical items, and solved puzzles in a game that was both entertaining and thought-provoking.

AI Bots Can Now Play Mafia with Each Other, and Almost All of Them Are Terrible at It Δ1.74

The AI Language Learning Models (LLMs) playing Mafia with each other have been entertaining, if not particularly skilled. Despite their limitations, the models' social interactions and mistakes offer a glimpse into their capabilities and shortcomings. The current LLMs struggle to understand roles, make alliances, and even deceive one another. However, some models, like Claude 3.7 Sonnet, stand out as exceptional performers in the game.

Politeness Influences AI Responses More Than You Think. Δ1.73

A recent exploration into how politeness affects interactions with AI suggests that the tone of user prompts can significantly influence the quality of responses generated by chatbots like ChatGPT. While technical accuracy remains unaffected, polite phrasing often leads to clearer and more context-rich queries, resulting in more nuanced answers. The findings indicate that moderate politeness not only enhances the interaction experience but may also mitigate biases in AI-generated content.

Gemini Just Got an Enhanced Memory Upgrade for All Users and You’ll Love What You Can Do with It Now. Δ1.73

Google has introduced a memory feature to the free version of its AI chatbot, Gemini, allowing users to store personal information for more engaging and personalized interactions. This update, which follows the feature's earlier release for Gemini Advanced subscribers, enhances the chatbot's usability, making conversations feel more natural and fluid. While Google is behind competitors like ChatGPT in rolling out this feature, the swift availability for all users could significantly elevate the user experience.

The Ai Chatbot App Gains Global Momentum as Deepseek Surpasses U.s. Competition Δ1.73

DeepSeek has broken into the mainstream consciousness after its chatbot app rose to the top of the Apple App Store charts (and Google Play, as well). DeepSeek's AI models, trained using compute-efficient techniques, have led Wall Street analysts — and technologists — to question whether the U.S. can maintain its lead in the AI race and whether the demand for AI chips will sustain. The company's ability to offer a general-purpose text- and image-analyzing system at a lower cost than comparable models has forced domestic competition to cut prices, making some models completely free.

Ibm Granite 3.2 Adds Enhanced Reasoning to Its Ai Mix Δ1.73

IBM has unveiled Granite 3.2, its latest large language model, which incorporates experimental chain-of-thought reasoning capabilities to enhance artificial intelligence (AI) solutions for businesses. This new release enables the model to break down complex problems into logical steps, mimicking human-like reasoning processes. The addition of chain-of-thought reasoning capabilities significantly enhances Granite 3.2's ability to handle tasks requiring multi-step reasoning, calculation, and decision-making.

The Growing Importance of Interjections in Human Communication Δ1.72

Interjections like um, wow, and mm-hmm aren't just filler words; they play a crucial role in regulating conversations by signaling pauses, repairing failed communication, and indicating attention. These short utterances are ubiquitous in everyday speech and serve as a tool kit for conducting interactions. By using interjections, speakers can maintain the flow of conversation and ensure mutual understanding.

New Ai Text Diffusion Models Break Speed Barriers by Pulling Words From Noise Δ1.71

These diffusion models maintain performance faster than or comparable to similarly sized conventional models. LLaDA's researchers report their 8 billion parameter model performs similarly to LLaMA3 8B across various benchmarks, with competitive results on tasks like MMLU, ARC, and GSM8K. Mercury claims dramatic speed improvements, operating at 1,109 tokens per second compared to GPT-4o Mini's 59 tokens per second.

Accidentally Texting with Scammers? Google's AI Is Here to Stop the Chat Cold Δ1.71

Google has introduced AI-powered features designed to enhance scam detection for both text messages and phone calls on Android devices. The new capabilities aim to identify suspicious conversations in real-time, providing users with warnings about potential scams while maintaining their privacy. As cybercriminals increasingly utilize AI to target victims, Google's proactive measures represent a significant advancement in user protection against sophisticated scams.

The Rise of AI-Powered Search: Google's New Chatbot Mode Changes Everything Δ1.71

Google is revolutionizing its search engine with the introduction of AI Mode, an AI chatbot that responds to user queries. This new feature combines advanced AI models with Google's vast knowledge base, providing hyper-specific answers and insights about the real world. The AI Mode chatbot, powered by Gemini 2.0, generates lengthy answers to complex questions, making it a game-changer in search and information retrieval.

How to Use ChatGPT to Write Code - And My Favorite Trick to Debug What It Generates Δ1.71

ChatGPT can be a valuable tool for writing code, particularly when given clear and specific prompts, yet it also has limitations that can lead to unusable output if not carefully managed. The AI excels at assisting with smaller coding tasks and finding appropriate libraries, but it often struggles with generating complete applications and maintaining existing code. Engaging in an interactive dialogue with the AI can help refine requests and improve the quality of the generated code.

MWC Hears Two Starkly Divided Views of AI's Impact. Δ1.71

At the Mobile World Congress trade show, two contrasting perspectives on the impact of artificial intelligence were presented, with Ray Kurzweil championing its transformative potential and Scott Galloway warning against its negative societal effects. Kurzweil posited that AI will enhance human longevity and capabilities, particularly in healthcare and renewable energy sectors, while Galloway highlighted the dangers of rage-fueled algorithms contributing to societal polarization and loneliness, especially among young men. The debate underscores the urgent need for a balanced discourse on AI's role in shaping the future of society.

Perplexity's Voice Mode Gets a Futuristic Makeover on Your Iphone Δ1.70

Perplexity’s iOS app has updated with a revamped voice mode, adding six new voices and real-time search integration. The upgrade also includes new personalization features and a fresh design to the iOS app. Perplexity's AI conversational search engine is speaking up in its latest iOS update.

AI Takes Center Stage as Alibaba Drives Shares Higher Δ1.70

Alibaba Group's release of an artificial intelligence (AI) reasoning model has driven its Hong Kong-listed shares more than 8% higher on Thursday, outperforming global hit DeepSeek's R1. The company's AI unit claims that its QwQ-32B model can achieve performance comparable to top models like OpenAI's o1 mini and DeepSeek's R1. Alibaba's new model is accessible via its chatbot service, Qwen Chat, allowing users to choose various Qwen models.

Google's AI Features Take a Major Leap Forward with Gemini Live Δ1.70

Gemini Live, Google's conversational AI, is set to gain a significant upgrade with the arrival of live video capabilities in just a few weeks. The feature will enable users to show the robot something instead of telling it, marking a major milestone in the development of multimodal AI. With this update, Gemini Live will be able to process and understand live video and screen sharing, allowing for more natural and interactive conversations.

The AI Research Gap: Comparing Deep Research with Standard ChatGPT Models Δ1.70

Deep Research on ChatGPT provides comprehensive, in-depth answers to complex questions, but often at a cost of brevity and practical applicability. While it delivers detailed mini-reports that are perfect for trivia enthusiasts or those seeking nuanced analysis, its lengthy responses may not be ideal for everyday users who need concise information. The AI model's database and search tool can resolve most day-to-day queries, making it a reliable choice for quick answers.

Microsoft Unveils Dragon Copilot Voice-Activated AI Assistant for Doctors Δ1.70

Microsoft wants to use AI to help doctors stay on top of work. The new AI tool combines Dragon Medical One's natural language voice dictation with DAX Copilot's ambient listening technology, aiming to streamline administrative tasks and reduce clinician burnout. By leveraging machine learning and natural language processing, Microsoft hopes to enhance the efficiency and effectiveness of medical consultations.

Microsoft Appears to Be Working on 3D Gaming Experiences for Copilot Δ1.70

Microsoft appears to be working on 3D gaming experiences for Copilot, its AI-powered chatbot platform, according to a new job listing. The company is seeking a senior software engineer with expertise in 3D rendering engines, suggesting a significant expansion of its capabilities in the gaming space. This move may bolster engagement and interaction within Copilot's experience, potentially setting it apart from competitors.

The Rise of Google's AI Mode in Search: A New Frontier in Information Synthesis Δ1.70

Google's AI Mode offers reasoning and follow-up responses in search, synthesizing information from multiple sources unlike traditional search. The new experimental feature uses Gemini 2.0 to provide faster, more detailed, and capable of handling trickier queries. AI Mode aims to bring better reasoning and more immediate analysis to online time, actively breaking down complex topics and comparing multiple options.