Here’s all the AI news you missed over the last few days!
Join My Newsletter for Regular AI Updates 👇🏼
https://www.matthewberman.com
My Links 🔗
👉🏻 Main Channel: https://www.youtube.com/@matthew_berman
👉🏻 Clips Channel: https://www.youtube.com/@MatthewBermanClips
👉🏻 Twitter: https://twitter.com/matthewberman
👉🏻 Discord: https://discord.gg/xxysSXBxFW
👉🏻 Patreon: https://patreon.com/MatthewBerman
👉🏻 Instagram: https://www.instagram.com/matthewberman_ai
👉🏻 Threads: https://www.threads.net/@matthewberman_ai
👉🏻 LinkedIn: https://www.linkedin.com/company/forward-future-ai
Media/Sponsorship Inquiries ✅
https://bit.ly/44TC45V
source
Advanced Voice Mode W or L?
Hate when demos are faked. Btw, back then I subscribed and felt betrayed about the advanced voice feature not actually being out yet, cancelled the subscription after the first month.
Test molmo
Wow, these news are MASSIVE
i like the fact that you used EU accents and not American only like other small youtubers that only showcase American accents.
Wow you have become a marketing shill….what a waste….
molmo test now
I didn't expect to have witness this on my lifetime.
AI is another step where people will focus more on the outside and not listen their own inner feelings. This will in my opinion lead to bigger dependency on media and other content they will consume.
Ask malmo give you a JSON schema on a bill picture 🙂 It creates it in chinese
Advanced Voice mode banned by the EU and for some reason the UK not getting it even though the UK is not in the EU 🤨
HAHA love it when an American tells Europeans to get their politicians in order. 🤣😂
Matthew You Look Little FAT
I am in the EU but it and advanced voice works normally, though my phone is on VPN in Israel :X
I know Moshi Voice-to-Voice Chat is a simple and small model with lots of problems, but I find it unfortunate that when it was demoed, everyone dumped on it harshly, especially for a fledgling, truly open source voice model.
Everyone was even more hyperfixated on the "Open" AI demo at that time that wasn't even released. I'm not nearly as impressed with the reality of it. It's not nearly the image they built up. Buggy, laggy, jittery, no video or stills, interruptions don't hold over long enough, lots of refusal for things like singing, voices feel less natural, etc.
I personally think the gap in the voice-to-voice experience isn't that big. The model supporting Moshi is really dumb, sure, but it's also actually open source and they were aiming on trying to get it up and running locally on consumer grade hardware in months from scratch. There has to be more potential in this project.
To me, it's a marval that a tiny not-for-profit research lab's team pulled it together so fast. A lot of people are talking about how we need to support open source, so here we have a chance to keep Moshi or the possibility of other open source models in our awareness.
I'm following this project, and interested to learn of any open source projects that are working on voice-to-voice.
Bearing in mind that Meta's voice mode source code can likely be reviewed for research purposes, but may have strict restrictions on redistribution.
The Lab for Moshi:
https://kyutai.org/
[Source Code (GitHub)](https://github.com/kyutai-labs/moshi)
This guy followed up with a good overview when the source code got released: [YouTube](https://youtu.be/JKA_v5Bb_tI?si=MgfAYMA8QFLOSekC)
We don't have to get anything in order, you have to get your data processing in order.
02:00 | "Whiskers, he get'a hungry!" -LOL
Please trial Molmo
"I can't even get it to sing to me, which is a basic use case"…
Great video discussing the latest AI news! Informative and engaging content. Keep up the excellent work, Matthew Berman!
Its cool but sadly is from meta
You gotta realize that hobbyist TTS folk have nailed this stuff long time ago, uncensored. Their voicemode really isnt a "thing". Aside from the nifty accents, but that's it.
You began with a mistake: NO, advanced voice mode is NOT available to everyone with a paid ChatGPT account. And there is still no clear information whether it will be (EU countries+ a few others)
Frontier model company valuations will look silly in a few short years when they're commoditized by open source.
The real value is, as Satya said, in how you leverage LLM's as tools and not in building LLM's.
Anyone who works with these models every day, will be very reluctant to claim that openai is so "ahead" of Anthropic. I still prefer claude for most of my work. Some times I even get better results than o1
EU . grrr. So annoying. Long live the vpn. And open source llama.
Meta also has an advance voice, It dropped for me today, is in Facebook messenger.
Old news.
Imagine living in EU. All those politicians afraid of technology. Sucks to sucks
I can run 1B, 3B and 7B models with my GTX 1650, no need for cloud gpu what is nice.
It's not available on either phone or laptop here in the SF Bay Area. Is this all BS?
It's not available on either phone or laptop in SF Bay Area.
Is this the guy from Snuff Box?