Things I love:
1. Start and stop button. I love this explicit control over who is talking when.
2. Ability to upload files while the voice chat is going. Great idea. Often times I use gpt voice chat for studying, and it's annoying when I need to add another PDF to the context, since I need to stop the chat, upload, and then restart the voice session.
3. Real-time text display during voice chat. I asked you to take the derivative of a function I described, and it outlined its steps, but it wasn't just the transcription of what it was saying.
Things I hate:
1. The transcription is terrible. It took me 10 tries during the conversation to describe f(x) = x^2. Looking back on the transcriptions, it's literally nonsense.
2. There was a buggy moment when the voice conversation started but it was still demoing all the voice options simultaneously. Need some polishing.
There was a seemingly odd quick sequence of announcements from elevenlabs the last 24 hours, makes me think it's them - notably, I believe they launched 2.0 of their conversational AI today.
Does it say "y'all"?
I really wish Anthropic would focus all of their developer resources on implementing “download all files”.
I know it’s a massive challenge and might take years to get right but the endless copy and paste is wearing me down.
Hn people are too poor to pay for max?
Meh, Anthropic are dead to me until they have structured output.
I really want to like Claude, but I hit their limit WAY too early when I PAID for it, 9 months ago, WAY before I hit any type of limit on gippity. (gippity - gpt , gimminy - gemini).
I like it, but giving Claude a "Deep Research" mode would be better.
From that article:
> According to the report, Anthropic was holding talks with Amazon, the company’s major investor and partner, and voice-focused AI startup ElevenLabs, to possibly drive future voice features for Claude.
> It’s unclear which of those partnerships, if any, came to fruition.
Here's an easy way to confirm that: check Anthropic's "Trust Center" and review any recent updates. https://trust.anthropic.com/updates
Sure enough, on May 29th they have a subprocessor change:
> As of May 29th, 2025, we have added ElevenLabs, which supports text to speech functionality in Claude for Work mobile apps.
I wonder what they're using for speech-to-text?