Deepgram - Detailed Review

Speech Tools

Deepgram - Detailed Review Contents

Add a header to begin generating the table of contents

Deepgram - Product Overview

Deepgram is a sophisticated speech recognition and transcription tool that leverages artificial intelligence to convert spoken language into written text. Here’s a brief overview of its primary function, target audience, and key features:

Primary Function

Deepgram’s primary function is to provide accurate and efficient speech-to-text transcription. It uses advanced algorithms to transcribe audio and video content into written text, both in real-time and for pre-recorded files.

Target Audience

Deepgram’s services are targeted at a variety of industries and users, including but not limited to:

Social media platforms for adding closed captions and improving ad targeting
Customer service and contact centers for analyzing and transcribing customer interactions
Media companies for transcribing audio and video content
Developers and businesses looking to integrate speech recognition into their applications.

Key Features

Accurate Speech Recognition: Deepgram uses advanced AI models to accurately transcribe spoken language, even in complex audio environments and with diverse accents.
Real-time Processing: It offers real-time speech recognition capabilities, allowing for immediate transcription and analysis of live audio streams or recordings.
Customizable Models: Users can customize speech recognition models to specific use cases and industries, ensuring optimal performance and accuracy.
Language Support: Deepgram supports a wide range of languages, enabling transcription and analysis of audio content in multiple languages.
Speaker Diarization: It can identify and differentiate between multiple speakers in an audio recording, providing valuable insights into who is speaking and when.
Noise Reduction: The tool includes noise reduction capabilities to enhance the accuracy of speech recognition by minimizing the impact of background noise.
Text-to-Speech: With the introduction of Deepgram Aura, the platform also supports text-to-speech functionalities, enabling apps to ‘speak’ back to users.
Low Latency and Multiple Integrations: Deepgram ensures minimal delay in real-time transcription and integrates seamlessly with various programming environments such as Python, JavaScript, and Node.

Overall, Deepgram is a versatile tool that offers a comprehensive set of APIs and features to meet various speech recognition and transcription needs across different industries.

Deepgram - User Interface and Experience

User Interface Overview

The user interface of Deepgram, particularly in its speech tools and AI-driven products, is designed with a focus on ease of use, clarity, and efficiency.

Sign-up and Onboarding

To get started, users can sign up for Deepgram’s services through their website. The sign-up process is straightforward, and once registered, users can access the Deepgram dashboard, where they can create new speech recognition models or integrate text-to-speech capabilities like Deepgram Aura.

Dashboard and Model Creation

The dashboard is user-friendly, allowing users to easily create new speech recognition models by selecting the “Create Model” option. Here, users can upload their audio or video content for transcription. The interface guides users through the process of customizing their models to fit specific use cases and industries, ensuring optimal performance and accuracy.

Real-time Transcription and Analysis

Deepgram offers real-time speech recognition capabilities, enabling immediate transcription and analysis of live audio streams or recordings. This feature is particularly valuable for applications such as live captioning, real-time communication aids, or immediate transcription needs during meetings and conferences. The interface displays the transcription in real-time, making it easy to monitor and analyze the audio content.

Customization and Integration

Users can customize their speech recognition models by training them on specific audio or video content. This customization is facilitated through an intuitive interface that allows users to fine-tune their models for better accuracy in various fields such as medical, legal, or technical industries. The API documentation and examples provided make it easy to integrate Deepgram’s speech recognition technology into existing workflows and applications.

Text-to-Speech (TTS) Interface

For the text-to-speech component, Deepgram Aura, the interface is equally user-friendly. Users can convert text into natural-sounding audio using a diverse set of male and female voices. The TTS interface supports batch processing and real-time text-to-speech with low latency, making it suitable for real-time voicebots and conversational AI applications. Users can try out the voices and see how their text is converted into audio through interactive demos available on the product page.

Accessibility and Documentation

Deepgram’s platform is built with accessibility in mind, transforming text to speech to aid users with visual impairments or reading challenges. The documentation is comprehensive and easy to follow, with many examples and tutorials that help developers get started quickly. The site is structured to cater to developers and tech decision-makers, ensuring that all necessary information is easily accessible.

Overall User Experience

The overall user experience with Deepgram is positive, with users appreciating the ease of integration, configurability, and the availability of helpful examples and tutorials. The platform’s ability to handle multiple languages, real-time transcription, and customizable models enhances user engagement and streamlines processes. While there may be minor issues such as occasional out-of-date documentation, the overall feedback suggests that Deepgram provides a practical and versatile tool for converting speech to text and generating high-quality audio.

Deepgram - Key Features and Functionality

Deepgram Overview

Deepgram is a sophisticated speech recognition and transcription service that leverages advanced AI technologies to convert spoken language into written text. Here are the main features and how they work:

Speech-to-Text Transcription

Deepgram’s core feature is its speech-to-text technology, which accurately transcribes spoken language into written text. This is achieved using end-to-end deep learning models that can handle diverse accents, dialects, and noisy environments, ensuring high accuracy rates even in real-world scenarios.

Real-Time and Pre-Recorded Transcription

Deepgram supports both real-time and pre-recorded transcription. For real-time transcription, it can convert live audio streams into text with minimal latency, making it ideal for applications such as live captioning, real-time customer support, and interactive voice response (IVR) systems. It also handles pre-recorded audio files with equal accuracy.

Text-to-Speech

In addition to speech-to-text, Deepgram offers text-to-speech capabilities, allowing applications to generate high-quality audio responses. This feature is particularly useful for building voice AI agents that can engage in natural-sounding conversations with users.

Audio Intelligence

Deepgram’s audio intelligence features go beyond simple transcription. The technology can analyze audio to detect sentiment, intent, and topics within conversations. This allows businesses to gain valuable insights into customer behavior and preferences, which can be used to improve customer service and tailor marketing strategies.

Customizable Workflows and Models

Users can customize transcription workflows to fit specific needs. This includes the ability to filter, summarize, and perform sentiment analysis on the transcribed text. Deepgram also allows for the customization of speech recognition models to specific use cases and industries, ensuring optimal performance and accuracy.

Language Support

Deepgram supports transcription and analysis of audio content in multiple languages, making it a versatile tool for global applications.

Speaker Diarization

Deepgram can identify and differentiate between multiple speakers in an audio recording, providing valuable insights into who is speaking and when. This feature is particularly useful for transcribing meetings, interviews, and other multi-speaker conversations.

Noise Reduction

The platform includes noise reduction capabilities, which enhance the accuracy of speech recognition by minimizing the impact of background noise and improving overall transcription quality.

Integration and Scalability

Deepgram’s API integrates seamlessly with various programming environments, including Python, JavaScript, and Node.js, thanks to SDKs available on GitHub. This makes it easy to incorporate Deepgram’s speech recognition technology into existing workflows and applications. The service is also scalable, making it suitable for a wide range of applications from media transcription to customer service.

Conclusion

By integrating these features, Deepgram provides a comprehensive suite of AI-driven speech recognition tools that can significantly enhance user experience, operational efficiency, and business intelligence across various industries.

Deepgram - Performance and Accuracy

Performance Evaluation of Deepgram in Speech-to-Text

Accuracy

Deepgram is renowned for its high accuracy rates. The platform utilizes advanced machine learning techniques to enhance transcription quality, with users reporting accuracy levels exceeding 95% in optimal conditions. Deepgram’s models, such as the Nova model, have been benchmarked to be 22% more accurate than the nearest competitors and even outperform Amazon’s models by a significant margin.

Speed

Deepgram’s API is optimized for real-time transcription, making it highly suitable for applications that require immediate feedback, such as live captioning. The platform can transcribe an hour of pre-recorded audio in about 12 seconds and boasts latency as low as 300 milliseconds, which is crucial for human-like conversational AI experiences.

Features and Capabilities

Deepgram offers a range of features that enhance the transcription experience. These include speaker identification, punctuation restoration, and word-level timestamps. Additionally, Deepgram supports over 30 languages and dialects, making it versatile for global applications.

Integration and Usability

Deepgram provides a straightforward API that can be easily integrated into existing applications. The comprehensive documentation includes clear examples for developers, facilitating a smooth integration process.

Limitations and Areas for Improvement

One of the notable limitations is the processing time cap for certain models. Deepgram’s own models (Nova, Enhanced, Base, etc.) are capped at 10 minutes of processing time, while the managed Whisper model is capped at 20 minutes. This can be a constraint for transcribing longer audio or video files. To mitigate this, users can pre-process video files by stripping the audio, which allows for more efficient processing within the time limits.

Latency Challenges

While Deepgram excels in speed, latency remains a general challenge in the speech-to-text and text-to-speech space. Reducing the latency to make machine responses feel more human-like is an ongoing effort, with the goal of bringing response times closer to human conversation standards.

Conclusion

In summary, Deepgram stands out for its high accuracy, real-time transcription capabilities, and ease of integration. However, users need to be aware of the processing time limits and the ongoing efforts to reduce latency for even more seamless interactions.

Deepgram - Pricing and Plans

Deepgram Pricing Overview

Deepgram offers a flexible and scalable pricing structure for its Speech Tools AI-driven products, catering to a wide range of business needs. Here’s a breakdown of the different tiers and the features available in each plan:

Pay As You Go Plan

This plan starts with a free tier that includes $200 in credit, which translates to up to 45,000 free minutes of transcription.
Users have access to all endpoints and public models.
Key features include:

Up to 100 concurrent requests for Deepgram speech-to-text models.
Up to 5 concurrent requests for Deepgram Whisper Cloud.
Up to 2 concurrent requests and up to 480 requests/min for Deepgram Aura text-to-speech.
Up to 10 concurrent requests for Deepgram Audio Intelligence.
Discord and community support.

Growth Plan

Priced between $4,000 to $10,000 per year, this plan comes with pre-paid credits that are redeemed against actual usage.
Users get access to all endpoints and public models at favorable discounts.
Features include:

Up to 100 concurrent requests for Deepgram speech-to-text models.
Up to 5 concurrent requests for Deepgram Whisper Cloud.
Up to 2 concurrent requests and up to 480 requests/min for Deepgram Aura text-to-speech.
Up to 10 concurrent requests for Deepgram Audio Intelligence.
Discord and community support.

Enterprise Plan

This plan is customized for businesses with large volumes of data, deployment requirements, or specific support needs.
Features include:

Access to all endpoints and public models with the best discounts.
Access to custom-trained speech-to-text models.
Priority access to new endpoints and models.
Highest concurrency support.
Private cloud or on-prem deployments.
Premium SLAs.
Dedicated support teams and email support.
Discord and community support.

Text-to-Speech (TTS) Pricing

For Deepgram’s TTS services, the pricing is based on character usage:

Pay-As-You-Go: $0.0150 per 1,000 characters, suitable for developers or businesses with occasional or small-scale usage.
Growth: $0.0135 per 1,000 characters, suitable for organizations with consistent and mid-range TTS requirements.
Enterprise: Custom pricing for large companies requiring scalable solutions and additional features.

Free Options

Deepgram offers a free tier within the Pay As You Go plan, which includes $200 in credit. This allows users to start with up to 45,000 free minutes of transcription without requiring a credit card.

Conclusion

In summary, Deepgram’s pricing structure is usage-based and highly scalable, allowing businesses to choose a plan that aligns with their specific needs and budget. The transparent pricing model ensures that users are aware of their costs upfront, eliminating surprise fees and hidden charges.

Deepgram - Integration and Compatibility

Integrations via Zapier

Deepgram can be seamlessly integrated with over 7,000 apps through Zapier, a popular automation tool. This integration allows users to automate workflows without needing any coding. For example, you can create transcriptions of new audio files added to Dropbox folders, convert and transcribe audio files in Amazon S3 using CloudConvert and Deepgram, or transform Chatfuel triggers into Deepgram speech-to-text API requests.

Integration with Glide

For users of Glide, a no-code app development platform, integrating Deepgram involves using Glide’s Call API to send audio files to Deepgram for transcription. This process requires obtaining the necessary API keys from Deepgram and setting up the connection through Glide’s API call functionality. This integration helps in transcribing audio from a column in Glide and storing the transcription in another column.

Integration with Discourse

Deepgram can be integrated with Discourse, a community forum software, to enable real-time transcription of audio contributions. This integration, facilitated through platforms like Latenode, allows for the transcription of spoken discussions into text within Discourse threads, enhancing accessibility and user convenience. It also supports automated voice responses, where voice messages are converted into text posts.

Integration with AudioCodes

Deepgram has an integration with AudioCodes’ VoiceAI Connect platform, which is used in contact centers to provide speech-to-text (STT) services. This integration enables the use of Deepgram’s streaming STT services within AudioCodes’ voicebot connectivity platform, improving the speed, accuracy, and ROI of customer interactions. Users can set up this integration by creating a Deepgram account and providing the necessary API keys and WebSocket URL to AudioCodes.

General API Compatibility

Deepgram provides APIs for speech-to-text, text-to-speech, and full speech-to-speech voice agents, making it compatible with a wide range of applications. These APIs can be used to transcribe speech with high accuracy, speed, and cost-effectiveness. The platform supports deployment on-premises as well as in public and private cloud environments.

Conclusion

In summary, Deepgram’s flexibility and extensive API capabilities make it highly compatible with various platforms and tools, allowing for seamless integration to enhance user experiences across different applications.

Deepgram - Customer Support and Resources

Support Options

Deepgram offers a comprehensive set of customer support options and additional resources to ensure users can effectively utilize their Speech to Text and other AI-driven tools.

Support Plans

Deepgram provides various support levels depending on the user’s plan:

Pay As You Go: This plan includes Discord and community support, allowing users to interact with a community of over 2,000 members and access answers to over 1,300 questions.
Growth and Enterprise Plans: For businesses with larger needs, Deepgram offers dedicated support teams, email support, and premium SLAs. The Enterprise plan also includes priority access to new endpoints and models, as well as private cloud or on-prem deployments.

Resources

Documentation and Guides: Deepgram provides detailed documentation and guides to help users integrate their services seamlessly, reducing the time and effort required to get started.
Community: The Deepgram community is active, with over 2,000 members and more than 1,300 questions answered. This community support can be invaluable for troubleshooting and learning best practices.
Playground: Deepgram offers a free playground environment where users can try out their APIs without committing to a purchase, including $200 in free credits for transcription or text-to-speech services.
API Access: Users have access to all endpoints and public models, depending on their plan, including speech-to-text, text-to-speech, and audio intelligence models.

Additional Tools and Features

Free Transcription Tool: Deepgram offers a free transcription tool that allows users to transcribe audio files quickly and accurately.
AI Voice Generator: They also provide a free AI voice generator, which can be used to create human-like voice outputs for various applications.
Advanced Features: Deepgram’s tools include advanced features such as sentiment analysis, summarization, classification of audio content, and language detection and translation, which can be particularly useful for customer support applications, such as analyzing call transcripts and improving customer service interactions.

By providing these support options and resources, Deepgram ensures that users can effectively integrate and utilize their AI-driven speech tools to enhance their operations.

Deepgram - Pros and Cons

Advantages of Deepgram

Deepgram offers several significant advantages that make it a standout in the AI-driven speech tools category:

High Accuracy

Deepgram is renowned for its highly accurate speech-to-text conversion, with its latest model, Deepgram Nova-2, achieving a 30% reduction in word error rate (WER) compared to competitors.

Low Latency

The platform provides real-time transcription with latency times of under 300 milliseconds, making it ideal for live applications and immediate feedback.

Multi-Language Support

Deepgram supports transcription and analysis in over 30 languages and dialects, catering to a wide range of global users.

Customizable Models

Users can train custom speech recognition models on their specific data, enhancing accuracy for unique vocabularies and use cases.

Advanced Features

Deepgram includes features like speaker diarization, sentiment analysis, and topic detection, which are invaluable for tasks such as content analysis and customer service.

Real-Time and Pre-Recorded Transcription

The API can handle both real-time audio streams and pre-recorded files, making it versatile for various applications.

Cost-Effective

Deepgram offers competitive pricing, starting at $0.0043 per minute, which is 3 to 5 times lower than many competitors.

Easy Integration

The API integrates seamlessly with various programming environments, including Python, JavaScript, and Node, using available SDKs.

Disadvantages of Deepgram

While Deepgram is highly regarded, there are some limitations and considerations:

Technical Expertise

Setting up and using Deepgram may require some technical expertise, which could be a barrier for non-technical users.

Limited User Feedback

There is limited user feedback available online, which might make it harder for new users to gauge the full range of user experiences.

Pricing Structure

The pricing structure may not suit all budgets, although it is generally cost-effective for many use cases.

Text-to-Speech Accuracy

While Deepgram’s speech-to-text is highly accurate, the text-to-speech functionality could be improved in terms of accuracy.

Language Support Limitations

Although Deepgram supports many languages, it has fewer languages supported compared to some other providers, particularly those with lower usage. Overall, Deepgram’s strengths in accuracy, speed, and customization make it a powerful tool for speech recognition and transcription, despite some minor drawbacks.

Deepgram - Comparison with Competitors

When comparing Deepgram to other speech-to-text tools

In the AI-driven product category, several key features and differences stand out.

Accuracy and Speed

Deepgram is notable for its high accuracy and speed. It is nearly 30% more accurate and over 30 times faster than Speechmatics, and nearly 40% more accurate and up to 5 times faster than AssemblyAI.

Deepgram’s advanced algorithms and deep learning models, such as the Nova neural network, enable it to achieve a lower Word Error Rate (WER) compared to its competitors, making it a preferred choice for applications requiring precise transcription.

Customization and Flexibility

Deepgram offers customizable speech recognition models that can be trained on customer-specific data. This is particularly useful for industries with specialized jargon, accents, or unique speech patterns. This feature is not as prominently highlighted in competitors like Speechmatics and AssemblyAI, although they may offer some level of customization.

Real-Time Processing and Low Latency

Deepgram provides real-time speech recognition capabilities with low latency, processing audio in less than 300 milliseconds. This real-time processing is crucial for applications such as live captioning, contact centers, and voice AI agents.

Language Support and File Formats

Deepgram supports over 30 languages and more than 40 file formats, making it versatile for a wide range of applications. This extensive language and file format support is a significant advantage over some competitors.

Additional Features

Speaker Diarization: Deepgram can identify and differentiate between multiple speakers in an audio recording, a feature that is highly valuable for transcription and analysis tasks.
Noise Reduction: Deepgram includes noise reduction capabilities, which enhance the accuracy of speech recognition by minimizing the impact of background noise.
Audio Intelligence: Deepgram offers advanced audio intelligence features such as summarization, sentiment analysis, and topic detection, which go beyond basic transcription services.

Integration and Deployment

Deepgram provides flexible deployment options, including self-hosted, on-premise, and cloud-based solutions, which can be integrated seamlessly into various workflows using its API. This flexibility is similar to what is offered by AssemblyAI but may differ from Speechmatics in terms of ease and range of deployment options.

Cost and Value

Deepgram is generally more affordable than its competitors, being 3 times cheaper than Speechmatics and 2.5 times cheaper than AssemblyAI. This cost-effectiveness, combined with its superior accuracy and speed, makes it an attractive option for many users.

Potential Alternatives

Speechmatics: While less accurate and slower than Deepgram, Speechmatics might still be considered for specific use cases where Deepgram’s features are not necessary. However, Deepgram’s significant advantages in accuracy, speed, and cost make it a more compelling choice.
AssemblyAI: AssemblyAI offers some similar features but is outperformed by Deepgram in terms of accuracy, speed, and cost. If budget is a concern and the additional accuracy and speed of Deepgram are not critical, AssemblyAI could be an alternative.

Conclusion

In summary, Deepgram stands out due to its high accuracy, speed, customization options, and comprehensive feature set, making it a top choice in the speech-to-text AI-driven product category. However, other tools like Speechmatics and AssemblyAI may still be viable alternatives depending on specific needs and budget constraints.

Deepgram - Frequently Asked Questions

What is Deepgram?

Deepgram is a speech recognition and transcription tool that uses artificial intelligence to convert spoken language into written text. It utilizes advanced algorithms to provide accurate and efficient transcription services.

How accurate is Deepgram’s speech recognition?

Deepgram’s speech recognition is highly accurate due to its advanced algorithms and deep learning models. These models are trained to handle background noise, cross-talk, unique dialects, and accents, ensuring high accuracy in transcription.

What features does Deepgram offer?

Accurate Speech Recognition: Transcribes spoken language into written text with high accuracy.
Real-time Processing: Provides real-time transcription for live audio streams or recordings.
Customizable Models: Allows users to customize speech recognition models for specific use cases and industries.
Language Support: Supports transcription in multiple languages.
Speaker Diarization: Identifies and differentiates between multiple speakers in an audio recording.
Noise Reduction: Enhances transcription quality by minimizing the impact of background noise.

How do I use Deepgram?

Sign up: Register for Deepgram’s services on their website.
Create a new model: Select the “Create Model” option in the Deepgram dashboard.
Upload audio or video content: Upload the audio or video files you want to transcribe.
Transcribe content: Use Deepgram’s real-time or pre-recorded transcription services.
Customize your model: Train your speech recognition model on your specific audio or video content.
Integrate with your workflow: Integrate Deepgram’s API into your existing workflows and applications.

What pricing plans does Deepgram offer?

Pay As You Go: Based on the amount of audio data processed.
Growth: Suitable for growing businesses with increasing usage.
Enterprise: For large corporations, priced between $4,000 to $10,000 per year, with pre-paid credits and additional features like higher concurrent requests and support.

How does Deepgram’s pricing model work?

Deepgram’s pricing model is usage-based, meaning you pay based on the duration of the audio data processed. Rates vary between pre-recorded and real-time transcription, with specific rates such as $0.0043/min for pre-recorded and $0.0059/min for real-time transcription.

Can Deepgram handle multiple languages?

Yes, Deepgram supports transcription in a wide range of languages, making it versatile for various international applications.

Does Deepgram offer real-time transcription?

Yes, Deepgram provides real-time speech recognition capabilities, allowing for immediate transcription and analysis of live audio streams or recordings.

How does Deepgram handle background noise and speaker diarization?

Deepgram includes noise reduction capabilities to enhance transcription accuracy by minimizing background noise. It also offers speaker diarization, which identifies and differentiates between multiple speakers in an audio recording.

Can I integrate Deepgram with my existing workflows and applications?

Yes, Deepgram provides APIs that integrate seamlessly with various programming environments, including Python, JavaScript, and Node. This allows you to incorporate Deepgram’s speech recognition technology into your existing workflows and applications.

Does Deepgram offer any trial or testing options?

Yes, Deepgram provides an API playground where developers can test and experiment with the API’s features before deciding on a full-scale implementation.

Deepgram - Conclusion and Recommendation

Final Assessment of Deepgram

Deepgram is a highly advanced speech recognition and transcription tool that leverages artificial intelligence to convert spoken language into written text with remarkable accuracy and speed. Here’s a comprehensive overview of its features and who would benefit most from using it.

Key Features

Accurate Speech Recognition: Deepgram uses advanced algorithms to transcribe spoken language into text, ensuring high accuracy even in the presence of background noise.
Real-time Processing: It offers real-time speech recognition, allowing for immediate transcription and analysis of live audio streams or recordings.
Customizable Models: Users can customize speech recognition models to fit specific use cases and industries, enhancing performance and accuracy.
Multi-Language Support: Deepgram supports over 30 languages and various accents, making it versatile for global applications.
Speaker Diarization: It can identify and differentiate between multiple speakers in an audio recording, which is valuable for meetings, interviews, and other multi-speaker scenarios.
Noise Reduction: The platform includes noise reduction capabilities to improve transcription quality.
Text-to-Speech and Speech-to-Text: Deepgram offers both text-to-speech and speech-to-text services, enabling natural voice generation and accurate transcription.

Who Would Benefit Most

Businesses: Companies, especially those in customer service, can benefit from Deepgram’s automated transcription and analysis of customer interactions, helping to monitor employee performance and improve service quality.
Media and Content Creators: Journalists, bloggers, and media professionals can use Deepgram to automate the transcription of podcasts, interviews, and generate video subtitles, streamlining content creation.
Developers: Over 200,000 developers use Deepgram to build voice AI applications, including voice assistants, IVR systems, and accessibility solutions, due to its high accuracy, low latency, and flexible API integration.
Researchers and Innovators: Scientists and researchers can leverage Deepgram’s customizable deep learning models to train on specific data, aiding in various research and innovation projects.

Overall Recommendation

Deepgram is an excellent choice for anyone needing accurate and efficient speech recognition and transcription services. Its real-time processing, customizable models, and support for multiple languages make it highly versatile. The platform’s ability to integrate with various applications through its API and its additional features such as sentiment analysis, keyword extraction, and intent recognition add significant value. For businesses looking to enhance customer experience, automate documentation processes, or improve content creation efficiency, Deepgram’s solutions are highly recommended. Developers seeking to build advanced voice AI applications will also find Deepgram’s tools and APIs highly beneficial. In summary, Deepgram’s advanced features, high accuracy, and flexibility make it a top choice in the speech tools AI-driven product category.