Moshi, an audio-first chatbot released in 2024, enables conversations up to 5 minutes with a chatbot. At the time of this review, the tool appears to only offer one accent and one voice, with a somewhat calm and a little bit sarcastic use of intonation. Because Moshi is trained on “synthetic audio,” it may sound a little synthetic, but it does do immediate responses and interruptions well. Currently in beta testing, it will be interesting to see how this tool develops over the coming months.
