Best speech-to-text app of 2024

Free, paid and online voice recognition apps and services

Best overall

Best for business, best for mobile, best text service, best speech recognition, best virtual assistant, best for cloud, best for azure, best for batch conversion, best free speech to text apps, best mobile speech to text apps, how we test.

The best speech-to-text apps make it simple and easy to convert speech into text, for both desktop and mobile devices.

A person using dictation with a smartphone.

1. Best overall 2. Best for business 3. Best for mobile 4. Best text service 5. Best speech recognition 6. Best virtual assistant 7. Best for cloud 8. Best for Azure 9. Best for batch conversion 10. Best free speech to text apps 11. Best mobile speech to text apps 12. FAQs 13. How we test

Speech-to-text used to be regarded as very niche, specifically serving either people with accessibility needs or for  dictation . However, speech-to-text is moving more and more into the mainstream as office work can now routinely be completed more simply and easily by using voce-recognition software, rather than having to type through members, and speaking aloud for text to be recorded is now quite common.

While the best speech to text software used to be specifically only for desktops, the development of mobile devices and the explosion of easily accessible apps means that transcription can now also be carried out on a  smartphone  or  tablet . 

This has made the best voice to text applications increasingly valuable to users in a range of different environments, from education to business. This is not least because the technology has matured to the level where mistakes in transcriptions are relatively rare, with some services rightly boasting a 99.9% success rate from clear audio.

Even still, this applies mainly to ordinary situations and circumstances, and precludes the use of technical terminology such as required in legal or medical professions. Despite this, digital transcription can still service needs such as basic  note-taking  which can still be easily done using a phone app, simplifying the dictation process.

However, different speech-to-text programs have different levels of ability and complexity, with some using advanced machine learning to constantly correct errors flagged up by users so that they are not repeated. Others are downloadable software which is only as good as its latest update.

Here then are the best in speech-to-text recognition programs, which should be more than capable for most situations and circumstances.

We've also featured the best voice recognition software .

The best paid for speech to text apps of 2024 in full:

Why you can trust TechRadar We spend hours testing every product or service we review, so you can be sure you’re buying the best. Find out more about how we test.

Website screenshot for Dragon Anywhere

1. Dragon Anywhere

Our expert review:

Reasons to buy

Reasons to avoid.

Dragon Anywhere is the Nuance mobile product for Android and iOS devices, however this is no ‘lite’ app, but rather offers fully-formed dictation capabilities powered via the cloud. 

So essentially you get the same excellent speech recognition as seen on the desktop software – the only meaningful difference we noticed was a very slight delay in our spoken words appearing on the screen (doubtless due to processing in the cloud). However, note that the app was still responsive enough overall.

It also boasts support for boilerplate chunks of text which can be set up and inserted into a document with a simple command, and these, along with custom vocabularies, are synced across the mobile app and desktop Dragon software. Furthermore, you can share documents across devices via Evernote or cloud services (such as Dropbox).

This isn’t as flexible as the desktop application, however, as dictation is limited to within Dragon Anywhere – you can’t dictate directly in another app (although you can copy over text from the Dragon Anywhere dictation pad to a third-party app). The other caveats are the need for an internet connection for the app to work (due to its cloud-powered nature), and the fact that it’s a subscription offering with no one-off purchase option, which might not be to everyone’s tastes.

Even bearing in mind these limitations, though, it’s a definite boon to have fully-fledged, powerful voice recognition of the same sterling quality as the desktop software, nestling on your phone or tablet for when you’re away from the office.

Nuance Communications offers a 7-day free trial to give the app a try before you commit to a subscription. 

Read our full Dragon Anywhere review .

  • ^ Back to the top

Website screenshot for Dragon Professional

2. Dragon Professional

Should you be looking for a business-grade dictation application, your best bet is Dragon Professional. Aimed at pro users, the software provides you with the tools to dictate and edit documents, create spreadsheets, and browse the web using your voice.   

According to Nuance, the solution is capable of taking dictation at an equivalent typing speed of 160 words per minute, with a 99% accuracy rate – and that’s out-of-the-box, before any training is done (whereby the app adapts to your voice and words you commonly use).

As well as creating documents using your voice, you can also import custom word lists. There’s also an additional mobile app that lets you transcribe audio files and send them back to your computer.   

This is a powerful, flexible, and hugely useful tool that is especially good for individuals, such as professionals and freelancers, allowing for typing and document management to be done much more flexibly and easily.

Overall, the interface is easy to use, and if you get stuck at all, you can access a series of help tutorials. And while the software can seem expensive, it's just a one-time fee and compares very favorably with paid-for subscription transcription services.

Also note that Nuance are currently offering 12-months' access to Dragon Anywhere at no extra cost with any purchase of Dragon Home or Dragon Professional Individual.

Read our full Dragon Professional review .

Website screenshot for Otter

Otter is a cloud-based speech to text program especially aimed for mobile use, such as on a laptop or smartphone. The app provides real-time transcription, allowing you to search, edit, play, and organize as required.

Otter is marketed as an app specifically for meetings, interviews, and lectures, to make it easier to take rich notes. However, it is also built to work with collaboration between teams, and different speakers are assigned different speaker IDs to make it easier to understand transcriptions.

There are three different payment plans, with the basic one being free to use and aside from the features mentioned above also includes keyword summaries and a wordcloud to make it easier to find specific topic mentions. You can also organize and share, import audio and video for transcription, and provides 600 minutes of free service.

The Premium plan also includes advanced and bulk export options, the ability to sync audio from Dropbox, additional playback speeds including the ability to skip silent pauses. The Premium plan also allows for up to 6,000 minutes of speech to text.

The Teams plan also adds two-factor authentication, user management and centralized billing, as well as user statistics, voiceprints, and live captioning.

Read our full Otter review .

Website screenshot for Verbit

Verbit aims to offer a smarter speech to text service, using AI for transcription and captioning. The service is specifically targeted at enterprise and educational establishments.

Verbit uses a mix of speech models, using neural networks and algorithms to reduce background noise, focus on terms as well as differentiate between speakers regardless of accent, as well as incorporate contextual events such as news and company information into recordings.

Although Verbit does offer a live version for transcription and captioning, aiming for a high degree of accuracy, other plans offer human editors to ensure transcriptions are fully accurate, and advertise a four hour turnaround time.

Altogether, while Verbit does offer a direct speech to text service, it’s possibly better thought of as a transcription service, but the focus on enterprise and education, as well as team use, means it earns a place here as an option to consider.

Read our full Verbit review .

Website screenshot for Speechmatics

5. Speechmatics

Speechmatics offers a machine learning solution to converting speech to text, with its automatic speech recognition solution available to use on existing audio and video files as well as for live use.

Unlike some automated transcription software which can struggle with accents or charge more for them, Speechmatics advertises itself as being able to support all major British accents, regardless of nationality. That way it aims to cope with not just different American and British English accents, but also South African and Jamaican accents.

Speechmatics offers a wider number of speech to text transcription uses than many other providers. Examples include taking call center phone recordings and converting them into searchable text or Word documents. The software also works with video and other media for captioning as well as using keyword triggers for management.

Overall, Speechmatics aims to offer a more flexible and comprehensive speech to text service than a lot of other providers, and the use of automation should keep them price competitive.

Read our full Speechmatics review .

Website screenshot for Braina Pro

6. Braina Pro

Braina Pro is speech recognition software which is built not just for dictation, but also as an all-round digital assistant to help you achieve various tasks on your PC. It supports dictation to third-party software in not just English but almost 90 different languages, with impressive voice recognition chops.

Beyond that, it’s a virtual assistant that can be instructed to set alarms, search your PC for a file, or search the internet, play an MP3 file, read an ebook aloud, plus you can implement various custom commands.

The Windows program also has a companion Android app which can remotely control your PC, and use the local Wi-Fi network to deliver commands to your computer, so you can spark up a music playlist, for example, wherever you happen to be in the house. Nifty.

There’s a free version of Braina which comes with limited functionality, but includes all the basic PC commands, along with a 7-day trial of the speech recognition which allows you to test out its powers for yourself before you commit to a subscription. Yes, this is another subscription-only product with no option to purchase for a one-off fee. Also note that you need to be online and have Google ’s Chrome browser installed for speech recognition functionality to work.

Read our full Braina Pro review .

Website screenshot for Amazon Transcribe

7. Amazon Transcribe

Amazon Transcribe is as big cloud-based automatic speech recognition platform developed specifically to convert audio to text for apps. It especially aims to provide a more accurate and comprehensive service than traditional providers, such as being able to cope with low-fi and noisy recordings, such as you might get in a contact center .

Amazon Transcribe uses a deep learning process that automatically adds punctuation and formatting, as well as process with a secure livestream or otherwise transcribe speech to text with batch processing.

As well as offering time stamping for individual words for easy search, it can also identify different speaks and different channels and annotate documents accordingly to account for this.

There are also some nice features for editing and managing transcribed texts, such as vocabulary filtering and replacement words which can be used to keep product names consistent and therefore any following transcription easier to analyze.

Overall, Amazon Transcribe is one of the most powerful platforms out there, though it’s aimed more for the business and enterprise user rather than the individual.

Website screenshot for Microsoft Azure Speech to Text

8. Microsoft Azure Speech to Text

Microsoft 's Azure cloud service offers advanced speech recognition as part of the platform's speech services to deliver the Microsoft Azure Speech to Text functionality. 

This feature allows you to simply and easily create text from a variety of audio sources. There are also customization options available to work better with different speech patterns, registers, and even background sounds. You can also modify settings to handle different specialist vocabularies, such as product names, technical information, and place names.

The Microsoft's Azure Speech to Text feature is powered by deep neural network models and allows for real-time audio transcription that can be set up to handle multiple speakers.

As part of the Azure cloud service, you can run Azure Speech to Text in the cloud, on premises, or in edge computing. In terms of pricing, you can run the feature in a free container with a single concurrent request for up to 5 hours of free audio per month.

Read our full Microsoft Azure Speech to Text review .

Website screenshot for IBM Watson Speech to Text

9. IBM Watson Speech to Text

IBM's Watson Speech to Text works is the third cloud-native solution on this list, with the feature being powered by AI and machine learning as part of IBM's cloud services.

While there is the option to transcribe speech to text in real-time, there is also the option to batch convert audio files and process them through a range of language, audio frequency, and other output options.

You can also tag transcriptions with speaker labels, smart formatting, and timestamps, as well as apply global editing for technical words or phrases, acronyms, and for number use.

As with other cloud services Watson Speech to Text allows for easy deployment both in the cloud and on-premises behind your own firewall to ensure security is maintained.

Read our full Watson Speech to Text review .

Website screenshot for Google Gboard

1. Google Gboard

If you already have an Android mobile device, then if it's not already installed then download Google Keyboard from the Google Play store and you'll have an instant text-to-speech app. Although it's primarily designed as a keyboard for physical input, it also has a speech input option which is directly available. And because all the power of Google's hardware is behind it, it's a powerful and responsive tool.

If that's not enough then there are additional features. Aside from physical input ones such as swiping, you can also trigger images in your text using voice commands. Additionally, it can also work with Google Translate, and is advertised as providing support for over 60 languages.

Even though Google Keyboard isn't a dedicated transcription tool, as there are no shortcut commands or text editing directly integrated, it does everything you need from a basic transcription tool. And as it's a keyboard, it means should be able to work with any software you can run on your Android smartphone, so you can text edit, save, and export using that. Even better, it's free and there are no adverts to get in the way of you using it.

Website screenshot for Just Press Record

2. Just Press Record

If you want a dedicated dictation app, it’s worth checking out Just Press Record. It’s a mobile audio recorder that comes with features such as one tap recording, transcription and iCloud syncing across devices. The great thing is that it’s aimed at pretty much anyone and is extremely easy to use. 

When it comes to recording notes, all you have to do is press one button, and you get unlimited recording time. However, the really great thing about this app is that it also offers a powerful transcription service. 

Through it, you can quickly and easily turn speech into searchable text. Once you’ve transcribed a file, you can then edit it from within the app. There’s support for more than 30 languages as well, making it the perfect app if you’re working abroad or with an international team. Another nice feature is punctuation command recognition, ensuring that your transcriptions are free from typos.   

This app is underpinned by cloud technology, meaning you can access notes from any device (which is online). You’re able to share audio and text files to other iOS apps too, and when it comes to organizing them, you can view recordings in a comprehensive file. 

Website screenshot for Speechnotes

3. Speechnotes

Speechnotes is yet another easy to use dictation app. A useful touch here is that you don’t need to create an account or anything like that; you just open up the app and press on the microphone icon, and you’re off.   

The app is powered by Google voice recognition tech. When you’re recording a note, you can easily dictate punctuation marks through voice commands, or by using the built-in punctuation keyboard. 

To make things even easier, you can quickly add names, signatures, greetings and other frequently used text by using a set of custom keys on the built-in keyboard. There’s automatic capitalization as well, and every change made to a note is saved to the cloud.

When it comes to customizing notes, you can access a plethora of fonts and text sizes. The app is free to download from the Google Play Store , but you can make in-app purchases to access premium features (there's also a browser version for Chrome).   

Read our full Speechnotes review .

Website screenshot for Transcribe

4. Transcribe

Marketed as a personal assistant for turning videos and voice memos into text files, Transcribe is a popular dictation app that’s powered by AI. It lets you make high quality transcriptions by just hitting a button.   

The app can transcribe any video or voice memo automatically, while supporting over 80 languages from across the world. While you can easily create notes with Transcribe, you can also import files from services such as Dropbox.

Once you’ve transcribed a file, you can export the raw text to a word processor to edit. The app is free to download, but you’ll have to make an in-app purchase if you want to make the most of these features in the long-term. There is a trial available, but it’s basically just 15 minutes of free transcription time. Transcribe is only available on iOS, though.   

Website screenshot for Windows Speech Recognition

5. Windows Speech Recognition

If you don’t want to pay for speech recognition software, and you’re running Microsoft’s latest desktop OS, then you might be pleased to hear that speech-to-text is built into Windows.

Windows Speech Recognition, as it’s imaginatively named – and note that this is something different to Cortana, which offers basic commands and assistant capabilities – lets you not only execute commands via voice control, but also offers the ability to dictate into documents.

The sort of accuracy you get isn’t comparable with that offered by the likes of Dragon, but then again, you’re paying nothing to use it. It’s also possible to improve the accuracy by training the system by reading text, and giving it access to your documents to better learn your vocabulary. It’s definitely worth indulging in some training, particularly if you intend to use the voice recognition feature a fair bit.

The company has been busy boasting about its advances in terms of voice recognition powered by deep neural networks, especially since windows 10 and now for Windows 11 , and Microsoft is certainly priming us to expect impressive things in the future. The likely end-goal aim is for Cortana to do everything eventually, from voice commands to taking dictation.

Turn on Windows Speech Recognition by heading to the Control Panel (search for it, or right click the Start button and select it), then click on Ease of Access, and you will see the option to ‘start speech recognition’ (you’ll also spot the option to set up a microphone here, if you haven’t already done that).

Best speech to text software

Aside from what has already been covered above, there are an increasing number of apps available across all mobile devices for working with speech to text, not least because Google's speech recognition technology is available for use. 

iTranslate Translator  is a speech-to-text app for iOS with a difference, in that it focuses on translating voice languages. Not only does it aim to translate different languages you hear into text for your own language, it also works to translate images such as photos you might take of signs in a foreign country and get a translation for them. In that way, iTranslate is a very different app, that takes the idea of speech-to-text in a novel direction, and by all accounts, does it well. 

ListNote Speech-to-Text Notes  is another speech-to-text app that uses Google's speech recognition software, but this time does a more comprehensive job of integrating it with a note-taking program than many other apps. The text notes you record are searchable, and you can import/export with other text applications. Additionally there is a password protection option, which encrypts notes after the first 20 characters so that the beginning of the notes are searchable by you. There's also an organizer feature for your notes, using category or assigned color. The app is free on Android, but includes ads.

Voice Notes  is a simple app that aims to convert speech to text for making notes. This is refreshing, as it mixes Google's speech recognition technology with a simple note-taking app, so there are more features to play with here. You can categorize notes, set reminders, and import/export text accordingly.

SpeechTexter  is another speech-to-text app that aims to do more than just record your voice to a text file. This app is built specifically to work with social media, so that rather than sending messages, emails, Tweets, and similar, you can record your voice directly to the social media sites and send. There are also a number of language packs you can download for offline working if you want to use more than just English, which is handy.

Also consider reading these related software and app guides:

  • Best text-to-speech software
  • Best transcription services
  • Best Bluetooth headsets

Which speech-to-text app is best for you?

When deciding which speech-to-text app to use, first consider what your actual needs are, as free and budget  options may only provide basic features, so if you need to use advanced tools you may find a paid-for platform is better suited to you. Additionally, higher-end software can usually cater for every need, so do ensure you have a good idea of which features you think you may require from your speech-to-text app.

To test for the best speech-to-text apps we first set up an account with the relevant platform, then we tested the service to see how the software could be used for different purposes and in different situations. The aim was to push each speech-to-text platform to see how useful its basic tools were and also how easy it was to get to grips with any more advanced tools.

Read more on how we test, rate, and review products on TechRadar .

Get in touch

  • Want to find out about commercial or marketing opportunities? Click here
  • Out of date info, errors, complaints or broken links? Give us a nudge
  • Got a suggestion for a product or service provider? Message us directly
  • You've reached the end of the page. Jump back up to the top ^

Are you a pro? Subscribe to our newsletter

Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

Brian Turner

Brian has over 30 years publishing experience as a writer and editor across a range of computing, technology, and marketing titles. He has been interviewed multiple times for the BBC and been a speaker at international conferences. His specialty on techradar is Software as a Service (SaaS) applications, covering everything from office suites to IT service tools. He is also a science fiction and fantasy author, published as Brian G Turner.

Webflow announces acquisition of Intellimize - expanding beyond visual development to become an integrated Website Experience Platform

Square Online review 2024: Top ecommerce platform pros, cons, and features tested

Scientists design super-battery made with cheap, readily affordable chemical element, Na — Salt-based cell has surprisingly good energy density and charges in seconds

Most Popular

  • 2 NYT Strands today — hints, answers and spangram for Friday, April 26 (game #54)
  • 3 I listened to Taylor Swift’s new songs on a 22-year-old Sony Walkman and it was a tortured experience I won’t try again
  • 4 Independent auditors confirm top VPN doesn't log your data
  • 5 Microsoft strips Windows 11's Control Panel of another tool - is the writing on the wall?
  • 2 This Android phone for audiophiles offers a hi-res DAC, balanced output and 3.5mm jack – plus a cool cyberpunk look that puts Google and OnePlus to shame
  • 3 Exclusive: inside Honda's new concept EV that has an infinitely recyclable design – and looks like a Honda e
  • 4 Microsoft might have cracked the most important part of video calls — new update could potentially hide all your embarassing apps and pop-ups
  • 5 Nanoleaf Skylight starter kit review: Nanoleaf is looking up

speech to text app cost

  • Is Hulu Worth It?
  • iPad Pro vs. Surface Pro: What's the Difference?

The 8 Best Voice-to-Text Apps of 2024

Dragon Anywhere is the best overall voice-to-text app

Stacey has worn many hats throughout her writing career, working in content marketing, nonprofit communications, and journalism at different points in her life.

We independently evaluate all recommended products and services. If you click on links we provide, we may receive compensation. Learn more .

Getty Images / RapidEye-izabell

Voice-to-text apps can be helpful for accessibility needs and busy professionals alike. If you’re always on the go, transcribing interview notes, or you can think faster than you can write, these special programs can increase your efficiency and store the recordings safely and sound via the cloud. Depending on your needs, you can choose an app with customizable language for commonly used words or industry terms.

The main features to consider when looking at voice-to-text apps include accuracy, shortcuts, and available languages. Accuracy is one of the most critical factors, and some options perform much better than others in this area. These apps are becoming more mainstream, from basic software to advanced technology. Whether you want to take notes , send quick messages, or translate on the fly, the best voice-to-text apps below are ready to help.

Best Voice-to-Text Apps of 2024

Best overall: dragon anywhere, best assistant: google assistant.

  • Best Transcription: Transcribe
  • Best for Long Recordings: Speechnotes

Best for Notes: Voice Notes

  • Best for Messages: SpeechTexter  

Best for Translation: iTranslate Converse

Best for niche industry terms: braina.

Dragon Anywhere

  • Price: $15 per month or $150 per year
  • Free Trial: One week
  • Accuracy Rate: 99 percent

Why We Chose It

We chose Dragon Anywhere because of its 99 percent accuracy rating and options for voice editing and formatting.

Pros & Cons

No word limits

99 percent accuracy

Multiple ways to share documents

Expensive compared to some other apps

May take time to learn the built-in commands

Available for Android and iOS devices, Dragon Anywhere is a premium professional tool that’s a big deal in the world of dictation apps. It’s 99 percent accurate and comes with voice editing and formatting. You can use the app for as long as you need—there are no word limits.

Dragon Anywhere allows you to customize industry lingo for even more accuracy. After transcription, share your notes by email, Dropbox, Evernote, and more. For supported versions, you can synchronize Dragon Anywhere with your desktop and do voice work on your computer as well. However, to do this, you will need to purchase a desktop version of Dragon as well.

Its accuracy and rich features come with a cost, but the bill could be a worthy business investment if you often think of ideas on the fly or need to record meetings. The application costs $15 per month or $150 per year.

Google Assistant

  • Price: Free
  • Free Trial: N/A
  • Accuracy Rate: Not disclosed

We chose Google Assistant because it can help you accomplish a variety of tasks.

Integrated into services you already use, such as email and messaging

Free to use

Not specifically designed for note-taking

Must use applets to boost note-taking abilities

Google Assistant does a lot, including playing music and opening maps. One of its best features? Voice recognition. You can use voice command to look up information and tell Google Assistant to perform certain functions, but it can also convert speech to text.

The app sends messages, manages tasks, and sets reminders. While it’s not a speech-to-text app in the purest sense, it will still help organize your ideas and notes with voice recognition.

Use IFTTT (If This Then That) to maximize your Google Assistant note-taking abilities. In one applet , Google Assistant can log all of your notes into a spreadsheet. You can also search IFTTT for other productivity-boosting applets or create your own as you see fit.  

Best for Transcription: Transcribe - Speech to Text

Transcribe - Speech to Text

  • Price: $5 per hour of transcription, subscription options also available
  • Free Trial: 15 minutes of transcription

Transcribe - Speech to Text offers you the opportunity to transcribe any voice or video file using the help of artificial intelligence.

Transcription available for over 120 languages and dialects

Easy-to-use software

Only available for Apple products

Journalists or executive assistants who have a lot of conversations to track may find this app useful. Using A.I., Transcribe can turn any voice or video memo into a transcription in over 120 different languages and dialects. After recording, you can drop your file in this app and export your raw text into another app such as DropBox.

Keep in mind that Transcribe is only available for Apple products with Voice Memo and video since there’s no direct in-app dictation. Transcribe can also get pricey. Users receive a free trial for 15 minutes of transcription. Every extra hour costs $5 and 10 hours costs $30, but there are also subscriptions available for frequent users.

Best for Long Recordings: Speechnotes - Speech to Text

Speechnotes - Speech to Text

  • Accuracy Rate: 90 percent or better

We chose Speechnotes because it allows for extremely long recordings.

Long recordings allowed

Can add in punctuation where needed

In-app advertisements as a free app

Only available in browser and on Android

Writers who think faster than they can type will appreciate this app. Speechnotes is excellent for organizing long notes thanks to two special features. First of all, it doesn't stop recording—even if you pause to think or breathe—so you can keep the recording open for as long as needed. Second, you can tap a button or use a verbal command to insert punctuation marks into your work so they won't become too unwieldy.

The free app has a small ad banner, but you can upgrade to a premium version to get rid of it. Other perks: It won't clog up your phone space at 4 MB, plus it saves all your recordings as TXT files. Plus, you won’t need to open the app to use it either; you can tap on a widget to access Speechnotes. Keep in mind that Speechnotes is only available on your browser and Android. 

Voice Notes

We chose Voice Notes for its efficient layout to help you store notes.

Recognizes 120 languages

Only available on Android phones

Voice Notes has speech recognition that allows you to create notes efficiently. You can then organize your notes into categories and create reminders by customizing alerts synced with your phone calendar. The interface is intuitive and easy to use; simply press the microphone button and speak to record. You’ll even be able to make your notes with the phone screen turned off.

The app can recognize up to 120 languages, just in case you need to record notes in something other than English. The app is free, though you can subscribe to a premium plan to support the developer.

Of course, there are a few caveats. Voice Notes is a popular app, but the one major limitation is that it's only available on Android phones. Plus, you need to have Google voice search installed to use it.

Best for Messages: SpeechTexter - Speech to Text

SpeechTexter - Speech to Text

  • Accuracy Rate: Better than 90 percent

SpeechTexter is a useful tool to help you draft texts, notes, emails, reports, and more with your voice. 

Desktop and android versions available

Over 70 languages supported

Customizable commands

Offline mode is less accurate

Need to send a quick message but find your hands occupied with other tasks? Here’s a quick solution. Using Google’s backend, SpeechTexter allows you to create text notes, emails, and reports with your own voice. The easy-to-use app supports over 70 languages with an accuracy rate higher than 90 percent. You can customize your own commands for punctuation as well.

It's possible to use the app when you're not connected to the Internet, though keep in mind that the accuracy lowers in offline mode and the recognition speed depends on your Internet connectivity. To use the app offline, make sure that you install language packs of your preference.

iTranslate Converse

  • Price: $6 per month or $50 per year
  • Free Trial: Yes

We chose iTranslate Converse because it is designed to help you translate languages on the go in noisy environments.

Works well in noisy environments

Enables real-time communication with someone in another language

38 languages recognized

Subscription fee

Unknown accuracy rate

Brought to you by the same developers behind the popular iTranslate app, iTranslate Converse is as close to real-time translation as you’ll get, which is convenient if you need to communicate with clients who don’t speak the same language as you or if you’re traveling abroad. All you have to do is set the two languages. Then tap, hold, and speak into your phone.

The app will pick up on the language that you’re speaking, then issue out a translation—yes, even in noisy environments. The app is capable of recognizing 38 languages. After your conversation is done, you can download full transcriptions. It’s not always perfect, of course, but it’s faster than going through a personal assistant app to look up translations for you.

While it has a subscription fee, iTranslate won't stretch your budget significantly. When you download it, you'll receive a free trial. After that runs out, you'll be upgraded to the pro version for $6 per month or $50 per year. You must cancel at least 24 hours before the end of the trial to avoid being put on a paid membership.

  • Price: $0-$399
  • Free Trial: No
  • Accuracy Rate: 99%

Briana can help you utilize voice-to-text in a jargon-filled industry.

Personal A.I. builds to recognize your industry jargon

Over 100 languages recognized

May take some time to customize

Braina is a personal A.I. for Windows P.C.s with companion Android and IOS apps. The program can convert your voice into text for any website or software program, including a word processor. It recognizes most medical, legal, and scientific terms, which makes it ideal if you work in a niche industry with technical jargon. You can also teach Braina uncommon names and vocabulary with ease.

Braina has other helpful voice recognition features besides learning niche industry terms. For example, it can recognize over 100 languages to serve non-English users. The program also includes convenient dictation commands for deleting, tabbing, and casing.

The app has a few price tiers; there is a free version with limited access to features, while the pro version costs $79 per year or $399 for lifetime access (which often goes on sale for $199).

Final Verdict

Dragon Anywhere is our pick for the best overall voice-to-text app thanks to its streamlined tools, high accuracy rating, and accessible computer synchronization. The app costs a bit more than other popular options, but discounts are available on annual subscriptions, and it has no limit on words.

As a bonus, Dragon Anywhere also allows users to customize their experience for specific industry lingo and other terms. This app is also accessible for Android and iOS devices and features simple sharing options to multiple apps or email accounts.

Compare the Best Voice to Text Apps

Guide to choosing a voice-to-text app.

Not sure how to choose a voice-to-text app? Consider the following factors to select the best option for your needs:

  • Accuracy rating
  • Available languages
  • Limits on words or usage
  • Platform (Android or iOS)
  • Exporting files
  • Translation
  • Customizable terms or industry language

Frequently Asked Questions

What is the best voice to text app.

Dragon Anywhere is the best voice-to-text app on our list. This app is available for both Android and iOS users, has a high accuracy rating, and makes it easy to export files to your computer, email, or other apps.

What Is the Best Free Voice to Text App?

Speechnotes, Voice Notes, Google Assistant, and SpeechTexter are all great choices for free voice-to-text apps. Choose the best option for your specific needs based on maximum length of recording, available languages, and exporting options.

What Is the Best Way to Convert Voice to Text?

Voice-to-text apps and computer programs are both helpful ways to convert your voice to text. If you need to record notes on the go or away from your computer, a mobile app is likely best for you. On the other hand, some people prefer apps downloaded to their computers to take notes during meetings or classes.

What Is the Most Realistic Speech-to-Text?

Dragon Anywhere has the highest accuracy rating of voice-to-text apps compared in this list. Additionally, this app allows users to customize specific industry language and commonly used terms to make their transcriptions more realistic.

Methodology

To find the best voice-to-text apps we compiled a list of the most popular options available. Next, we took a closer look at several factors, including the price, free trial options, accuracy rates, and more. Finally, we decided which providers were best suited for what our readers needed.

Get the Latest Tech News Delivered Every Day

  • This Always-Recording AI Microphone Will Make Your Coworkers Hate You
  • 2024's Best Budget-Friendly Phone Plans
  • The 8 Best TV Streaming Apps of 2024
  • The 5 Best Translation Apps of 2024
  • The 6 Best Antivirus Apps for iPhones in 2024
  • The 11 Best Note-Taking Apps for iPad and iPad Pro in 2024
  • The 7 Best Senior Cell Phone Plans of 2024
  • The 10 Best Writing Apps of 2024
  • The 6 Best Offline Translators of 2024
  • The 5 Best Walkie-Talkie Apps of 2024
  • Best Visual Voicemail Apps of 2024
  • The 8 Best Apps to Record Phone Calls on iPhone of 2024
  • The Best Brainstorming Tools for 2024
  • The 6 Best Texting Apps for Android Tablets in 2024
  • How to Use Speech-to-Text on Android
  • The 8 Best Microsoft Office Alternatives

The best dictation software in 2024

These speech-to-text apps will save you time without sacrificing accuracy..

Best text dictation apps hero

The early days of dictation software were like your friend that mishears lyrics: lots of enthusiasm but little accuracy. Now, AI is out of Pandora's box, both in the news and in the apps we use, and dictation apps are getting better and better because of it. It's still not 100% perfect, but you'll definitely feel more in control when using your voice to type.

I took to the internet to find the best speech-to-text software out there right now, and after monologuing at length in front of dozens of dictation apps, these are my picks for the best.

The best dictation software

Windows 11 Speech Recognition for free dictation software on Windows

Dragon by Nuance for a customizable dictation app

Google Docs voice typing for dictating in Google Docs

Gboard for a free mobile dictation app

Otter for collaboration

What is dictation software?

When searching for dictation software online, you'll come across a wide range of options. The ones I'm focusing on here are apps or services that you can quickly open, start talking, and see the results on your screen in (near) real-time. This is great for taking quick notes , writing emails without typing, or talking out an entire novel while you walk in your favorite park—because why not.

Beyond these productivity uses, people with disabilities or with carpal tunnel syndrome can use this software to type more easily. It makes technology more accessible to everyone .

If this isn't what you're looking for, here's what else is out there:

AI assistants, such as Apple's Siri, Amazon's Alexa, and Microsoft's Cortana, can help you interact with each of these ecosystems to send texts, buy products, or schedule events on your calendar.

AI meeting assistants will join your meetings and transcribe everything, generating meeting notes to share with your team.

AI transcription platforms can process your video and audio files into neat text.

Transcription services that use a combination of dictation software, AI, and human proofreaders can achieve above 99% accuracy.

There are also advanced platforms for enterprise, like Amazon Transcribe and Microsoft Azure's speech-to-text services.

What makes a great dictation app?

How we evaluate and test apps.

Our best apps roundups are written by humans who've spent much of their careers using, testing, and writing about software. Unless explicitly stated, we spend dozens of hours researching and testing apps, using each app as it's intended to be used and evaluating it against the criteria we set for the category. We're never paid for placement in our articles from any app or for links to any site—we value the trust readers put in us to offer authentic evaluations of the categories and apps we review. For more details on our process, read the full rundown of how we select apps to feature on the Zapier blog .

Dictation software comes in different shapes and sizes. Some are integrated in products you already use. Others are separate apps that offer a range of extra features. While each can vary in look and feel, here's what I looked for to find the best:

High accuracy. Staying true to what you're saying is the most important feature here. The lowest score on this list is at 92% accuracy.

Ease of use. This isn't a high hurdle, as most options are basic enough that anyone can figure them out in seconds.

Availability of voice commands. These let you add "instructions" while you're dictating, such as adding punctuation, starting a new paragraph, or more complex commands like capitalizing all the words in a sentence.

Availability of the languages supported. Most of the picks here support a decent (or impressive) number of languages.

Versatility. I paid attention to how well the software could adapt to different circumstances, apps, and systems.

I tested these apps by reading a 200-word script containing numbers, compound words, and a few tricky terms. I read the script three times for each app: the accuracy scores are an average of all attempts. Finally, I used the voice commands to delete and format text and to control the app's features where available.

I used my laptop's or smartphone's microphone to test these apps in a quiet room without background noise. For occasional dictation, an equivalent microphone on your own computer or smartphone should do the job well. If you're doing a lot of dictation every day, it's probably worth investing in an external microphone, like the Jabra Evolve .

What about AI?

Before the ChatGPT boom, AI wasn't as hot a keyword, but it already existed. The apps on this list use a combination of technologies that may include AI— machine learning and natural language processing (NLP) in particular. While they could rebrand themselves to keep up with the hype, they may use pipelines or models that aren't as bleeding-edge when compared to what's going on in Hugging Face or under OpenAI Whisper 's hood, for example. 

Also, since this isn't a hot AI software category, these apps may prefer to focus on their core offering and product quality instead, not ride the trendy wave by slapping "AI-powered" on every web page.

Tips for using voice recognition software

Though dictation software is pretty good at recognizing different voices, it's not perfect. Here are some tips to make it work as best as possible.

Speak naturally (with caveats). Dictation apps learn your voice and speech patterns over time. And if you're going to spend any time with them, you want to be comfortable. Speak naturally. If you're not getting 90% accuracy initially, try enunciating more.  

Punctuate. When you dictate, you have to say each period, comma, question mark, and so forth. The software isn't always smart enough to figure it out on its own.

Learn a few commands . Take the time to learn a few simple commands, such as "new line" to enter a line break. There are different commands for composing, editing, and operating your device. Commands may differ from app to app, so learn the ones that apply to the tool you choose.

Know your limits. Especially on mobile devices, some tools have a time limit for how long they can listen—sometimes for as little as 10 seconds. Glance at the screen from time to time to make sure you haven't blown past the mark. 

Practice. It takes time to adjust to voice recognition software, but it gets easier the more you practice. Some of the more sophisticated apps invite you to train by reading passages or doing other short drills. Don't shy away from tutorials, help menus, and on-screen cheat sheets.

The best dictation software at a glance

Best free dictation software for apple devices, apple dictation (ios, ipados, macos).

The interface for Apple Dictation, our pick for the best free dictation app for Apple users

Look no further than your Mac, iPhone, or iPad for one of the best dictation tools. Apple's built-in dictation feature, powered by Siri (I wouldn't be surprised if the two merged one day), ships as part of Apple's desktop and mobile operating systems. On iOS devices, you use it by pressing the microphone icon on the stock keyboard. On your desktop, you turn it on by going to System Preferences > Keyboard > Dictation , and then use a keyboard shortcut to activate it in your app.

If you want the ability to navigate your Mac with your voice and use dictation, try Voice Control . By default, Voice Control requires the internet to work and has a time limit of about 30 seconds for each smattering of speech. To remove those limits for a Mac, enable Enhanced Dictation, and follow the directions here for your OS (you can also enable it for iPhones and iPads). Enhanced Dictation adds a local file to your device so that you can dictate offline.

You can format and edit your text using simple commands, such as "new paragraph" or "select previous word." Tip: you can view available commands in a small window, like a little cheat sheet, while learning the ropes. Apple also offers a number of advanced commands for things like math, currency, and formatting. 

Apple Dictation price: Included with macOS, iOS, iPadOS, and Apple Watch.

Apple Dictation accuracy: 96%. I tested this on an iPhone SE 3rd Gen using the dictation feature on the keyboard.

Recommendation: For the occasional dictation, I'd recommend the standard Dictation feature available with all Apple systems. But if you need more custom voice features (e.g., medical terms), opt for Voice Control with Enhanced Dictation. You can create and import both custom vocabulary and custom commands and work while offline.

Apple Dictation supported languages: 59 languages and dialects .

While Apple Dictation is available natively on the Apple Watch, if you're serious about recording plenty of voice notes and memos, check out the Just Press Record app. It runs on the same engine and keeps all your recordings synced and organized across your Apple devices.

Best free dictation software for Windows

Windows 11 speech recognition (windows).

The interface for Windows Speech Recognition, our pick for the best free dictation app for Windows

Windows 11 Speech Recognition (also known as Voice Typing) is a strong dictation tool, both for writing documents and controlling your Windows PC. Since it's part of your system, you can use it in any app you have installed.

To start, first, check that online speech recognition is on by going to Settings > Time and Language > Speech . To begin dictating, open an app, and on your keyboard, press the Windows logo key + H. A microphone icon and gray box will appear at the top of your screen. Make sure your cursor is in the space where you want to dictate.

When it's ready for your dictation, it will say Listening . You have about 10 seconds to start talking before the microphone turns off. If that happens, just click it again and wait for Listening to pop up. To stop the dictation, click the microphone icon again or say "stop talking."  

As I dictated into a Word document, the gray box reminded me to hang on, we need a moment to catch up . If you're speaking too fast, you'll also notice your transcribed words aren't keeping up. This never posed an issue with accuracy, but it's a nice reminder to keep it slow and steady. 

To activate the computer control features, you'll have to go to Settings > Accessibility > Speech instead. While there, tick on Windows Speech Recognition. This unlocks a range of new voice commands that can fully replace a mouse and keyboard. Your voice becomes the main way of interacting with your system.

While you can use this tool anywhere inside your computer, if you're a Microsoft 365 subscriber, you'll be able to use the dictation features there too. The best app to use it on is, of course, Microsoft Word: it even offers file transcription, so you can upload a WAV or MP3 file and turn it into text. The engine is the same, provided by Microsoft Speech Services.

Windows 11 Speech Recognition price: Included with Windows 11. Also available as part of the Microsoft 365 subscription.

Windows 11 Speech Recognition accuracy: 95%. I tested it in Windows 11 while using Microsoft Word. 

Windows 11 Speech Recognition languages supported : 11 languages and dialects .

Best customizable dictation software

Dragon by nuance (android, ios, macos, windows).

The interface for Dragon, our pick for the best customizable dictation software

In 1990, Dragon Dictate emerged as the first dictation software. Over three decades later, we have Dragon by Nuance, a leader in the industry and a distant cousin of that first iteration. With a variety of software packages and mobile apps for different use cases (e.g., legal, medical, law enforcement), Dragon can handle specialized industry vocabulary, and it comes with excellent features, such as the ability to transcribe text from an audio file you upload. 

For this test, I used Dragon Anywhere, Nuance's mobile app, as it's the only version—among otherwise expensive packages—available with a free trial. It includes lots of features not found in the others, like Words, which lets you add words that would be difficult to recognize and spell out. For example, in the script, the word "Litmus'" (with the possessive) gave every app trouble. To avoid this, I added it to Words, trained it a few times with my voice, and was then able to transcribe it accurately.

It also provides shortcuts. If you want to shorten your entire address to one word, go to Auto-Text , give it a name ("address"), and type in your address: 1000 Eichhorn St., Davenport, IA 52722, and hit Save . The next time you dictate and say "address," you'll get the entire thing. Press the comment bubble icon to see text commands while you're dictating, or say "What can I say?" and the command menu pops up. 

Once you complete a dictation, you can email, share (e.g., Google Drive, Dropbox), open in Word, or save to Evernote. You can perform these actions manually or by voice command (e.g., "save to Evernote.") Once you name it, it automatically saves in Documents for later review or sharing. 

Accuracy is good and improves with use, showing that you can definitely train your dragon. It's a great choice if you're serious about dictation and plan to use it every day, but may be a bit too much if you're just using it occasionally.

Dragon by Nuance price: $15/month for Dragon Anywhere (iOS and Android); from $200 to $500 for desktop packages

Dragon by Nuance accuracy: 97%. Tested it in the Dragon Anywhere iOS app.

Dragon by Nuance supported languages: 6 languages and dialects in Dragon Anywhere and 8 languages and dialects in Dragon Desktop.  

Best free mobile dictation software

Gboard (android, ios).

The interface for Gboard, our pick for the best mobile dictation software

Gboard, also known as Google Keyboard, is a free keyboard native to Android phones. It's also available for iOS: go to the App Store, download the Gboard app , and then activate the keyboard in the settings. In addition to typing, it lets you search the web, translate text, or run a quick Google Maps search.

Back to the topic: it has an excellent dictation feature. To start, press the microphone icon on the top-right of the keyboard. An overlay appears on the screen, filling itself with the words you're saying. It's very quick and accurate, which will feel great for fast-talkers but probably intimidating for the more thoughtful among us. If you stop talking for a few seconds, the overlay disappears, and Gboard pastes what it heard into the app you're using. When this happens, tap the microphone icon again to continue talking.

Wherever you can open a keyboard while using your phone, you can have Gboard supporting you there. You can write emails or notes or use any other app with an input field.

The writer who handled the previous update of this list had been using Gboard for seven years, so it had plenty of training data to adapt to his particular enunciation, landing the accuracy at an amazing 98%. I haven't used it much before, so the best I had was 92% overall. It's still a great score. More than that, it's proof of how dictation apps improve the more you use them.

Gboard price : Free

Gboard accuracy: 92%. With training, it can go up to 98%. I tested it using the iOS app while writing a new email.

Gboard supported languages: 916 languages and dialects .

Best dictation software for typing in Google Docs

Google docs voice typing (web on chrome).

The interface for Google Docs voice typing, our pick for the best dictation software for Google Docs

Just like Microsoft offers dictation in their Office products, Google does the same for their Workspace suite. The best place to use the voice typing feature is in Google Docs, but you can also dictate speaker notes in Google Slides as a way to prepare for your presentation.

To get started, make sure you're using Chrome and have a Google Docs file open. Go to Tools > Voice typing , and press the microphone icon to start. As you talk, the text will jitter into existence in the document.

You can change the language in the dropdown on top of the microphone icon. If you need help, hover over that icon, and click the ? on the bottom-right. That will show everything from turning on the mic, the voice commands for dictation, and moving around the document.

It's unclear whether Google's voice typing here is connected to the same engine in Gboard. I wasn't able to confirm whether the training data for the mobile keyboard and this tool are connected in any way. Still, the engines feel very similar and turned out the same accuracy at 92%. If you start using it more often, it may adapt to your particular enunciation and be more accurate in the long run.

Google Docs voice typing price : Free

Google Docs voice typing accuracy: 92%. Tested in a new Google Docs file in Chrome.

Google Docs voice typing supported languages: 118 languages and dialects ; voice commands only available in English.

Google Docs integrates with Zapier , which means you can automatically do things like save form entries to Google Docs, create new documents whenever something happens in your other apps, or create project management tasks for each new document.

Best dictation software for collaboration

Otter (web, android, ios).

Otter, our pick for the best dictation software for collaboration

Most of the time, you're dictating for yourself: your notes, emails, or documents. But there may be situations in which sharing and collaboration is more important. For those moments, Otter is the better option.

It's not as robust in terms of dictation as others on the list, but it compensates with its versatility. It's a meeting assistant, first and foremost, ready to hop on your meetings and transcribe everything it hears. This is great to keep track of what's happening there, making the text available for sharing by generating a link or in the corresponding team workspace.

The reason why it's the best for collaboration is that others can highlight parts of the transcript and leave their comments. It also separates multiple speakers, in case you're recording a conversation, so that's an extra headache-saver if you use dictation software for interviewing people.

When you open the app and click the Record button on the top-right, you can use it as a traditional dictation app. It doesn't support voice commands, but it has decent intuition as to where the commas and periods should go based on the intonation and rhythm of your voice. Once you're done talking, Otter will start processing what you said, extract keywords, and generate action items and notes from the content of the transcription.

If you're going for long recording stretches where you talk about multiple topics, there's an AI chat option, where you can ask Otter questions about the transcript. This is great to summarize the entire talk, extract insights, and get a different angle on everything you said.

Not all meeting assistants offer dictation, so Otter sits here on this fence between software categories, a jack-of-two-trades, quite good at both. If you want something more specialized for meetings, be sure to check out the best AI meeting assistants . But if you want a pure dictation app with plenty of voice commands and great control over the final result, the other options above will serve you better.

Otter price: Free plan available for 300 minutes / month. Pro plan starts at $16.99, adding more collaboration features and monthly minutes.

Otter accuracy: 93% accuracy. I tested it in the web app on my computer.

Otter supported languages: Only American and British English for now.

Is voice dictation for you?

Dictation software isn't for everyone. It will likely take practice learning to "write" out loud because it will feel unnatural. But once you get comfortable with it, you'll be able to write from anywhere on any device without the need for a keyboard. 

And by using any of the apps I listed here, you can feel confident that most of what you dictate will be accurately captured on the screen. 

Related reading:

The best transcription services

Catch typos by making your computer read to you

Why everyone should try the accessibility features on their computer

What is Otter.ai?

The best voice recording apps for iPhone

This article was originally published in April 2016 and has also had contributions from Emily Esposito, Jill Duffy, and Chris Hawkins. The most recent update was in November 2023.

Get productivity tips delivered straight to your inbox

We’ll email you 1-3 times per week—and never share your information.

Miguel Rebelo picture

Miguel Rebelo

Miguel Rebelo is a freelance writer based in London, UK. He loves technology, video games, and huge forests. Track him down at mirebelo.com.

  • Video & audio
  • Google Docs

Related articles

Best appointment scheduling apps hero

The best appointment schedulers and booking apps in 2024

The best appointment schedulers and booking...

A hero image with an icon representing AI writing

The best AI content detectors in 2024

Hero image with the logos of the best AI website builders

The 4 best AI website builders in 2024

A hero image with the logos of the best form builders

The 12 best online form builder apps in 2024

Improve your productivity automatically. Use Zapier to get your apps working together.

A Zap with the trigger 'When I get a new lead from Facebook,' and the action 'Notify my team in Slack'

Icon image

Speech Recognition & Synthesis

Content rating

About this app

Data safety.

Icon image

Ratings and reviews

speech to text app cost

  • Flag inappropriate

speech to text app cost

  • Show review history

speech to text app cost

What's new

App support, more by google llc.

Thumbnail image

Similar apps

Thumbnail image

  • Meta Quest 4
  • Google Pixel 9
  • Google Pixel 8a
  • Apple Vision Pro 2
  • Nintendo Switch 2
  • Samsung Galaxy Ring
  • Yellowstone Season 6
  • Recall an Email in Outlook
  • Stranger Things Season 5

The best speech-to-text software for 2022

If you’re looking to take your productivity up a notch (or if you’re just a really slow typist), the best speech-to-text software is a sure way to do it. The idea is pretty simple: You speak, and the software detects your words and converts them into text format. The applications are nearly endless, from dictating thoughts and jotting down notes to creating long-form documents without having to type a word yourself. Yet despite this, not many businesses and professionals are taking full advantage of what speech-to-text software can give them.

Dragon Anywhere

Amazon transcribe, google docs voice typing.

The good news is that the best speech-to-text software doesn’t have to cost an arm and a leg — or anything at all, depending on your needs. There’s a handful of noteworthy services out there, though, and selecting the right one is important. That’s where we come in. Below, we’ve rounded up the best speech-to-text software platforms out there, with our picks covering a wide spectrum of platforms, features, and price points.

  • Price: $15 per month or $150 per year
  • Free Trial: Yes
  • Platforms: iOS, Android
  • Voice editing and formatting
  • Cloud-based storage and file sharing
  • AI learning adapts to your speech

If you’re already somewhat familiar with the best speech-to-text software then there’s a good chance you’ve heard of Dragon. Dragon Anywhere is a dedicated mobile speech-to-text app that delivers a high degree of accuracy thanks to its industry-leading speech recognition software that can adapt to your own speech patterns. In other words, Dragon Anywhere can actually learn  how you speak, right down to your sentence cadence and word pronunciation. In the off-chance that it does make a mistake, you can edit and format using just your voice. Dragon Anywhere also allows for continuous dictation with no word limits or length cut-offs, and your text documents are stored in the cloud for easy access and sharing with colleagues when you need to.

  • The best business laptops from Apple, Lenovo, Dell, and more
  • The Best Hiring Apps for Recruiters
  • 15 best online jobs for teens in 2022

Dragon Anywhere is by far the best speech-to-text software for mobile users, given that it’s designed entirely for use on iOS and Android devices, making it the ideal choice for translators, lawyers, accountants and other professionals who need to turn spoken dialog into written notes. It’s a bit like having a virtual stenographer. Plus, it’s useful for anybody else who wants to be able to “jot” things down hands-free. Its cloud-based sharing makes Dragon Anywhere great for group work, too.

Dragon Anywhere is a paid service with monthly and yearly subscription plans. You can pay on a monthly basis for $15, although if you like the service, then the $150 annual subscription is a better value (basically getting you two months free each year). If you want to give it a try first, there is a free one-week Dragon Anywhere trial available as well. There are Dragon software suites available for business users on Windows, and Dragon Anywhere syncs with them seamlessly. You also get a Dragon Anywhere subscription at no additional cost — a $150 value — with the Dragon Home and Dragon Professional desktop versions, which might be a better value depending on your needs.

  • Price: Starts at $0.024 per minute
  • Free Trial: Yes, Free Tier provides 60 audio minutes monthly for the first 12 months
  • Platforms: Most devices with a microphone
  • HIPAA- eligible and compatible with electronic health record systems
  • Integrates with AWS cloud services
  • Call Analytics extracts data and insights from customer interactions

If you need a more enterprise-grade solution, then Amazon Transcribe is one of the best speech-to-text software services for businesses large and small. It’s designed to integrate seamlessly with Amazon Web Services, so if your website and/or company already uses any of these, then setup should be a breeze. You can create text documents, transcribe conversations and videos, translate speech, and more. What really sets Amazon Transcribe apart from other speech-to-text apps (aside from its AWS integration) is its bevy of great features tailored for professional environments.

For instance, its Call Analytics feature can automatically extract useful insights from customer interactions, allowing you to tune and tailor your customer service. It’s also HIPAA-eligible and compatible with electronic health record systems for easy uploading and management of medical transcriptions and other patient data. Amazon Transcribe is purpose-built for businesses, especially larger enterprises (not to mention organizations such as hospitals), which should come as no surprise given its integration with Amazon Web Services.

Compared to other dictating software, Amazon Transcribe’s pricing structure is somewhat unique in that its monthly subscription fee is based on how many audio minutes you use, with plans starting at $0.024 per minute and scaling down in price per minute for the higher tiers. If you’re looking for the best speech-to-text software for professional business applications, Amazon Transcribe is hard to beat.

  • Price: $79 for yearly subscription, $200 for lifetime
  • Free Trial: Yes, basic free plan available
  • Platforms: Windows; companion app available for iOS and Android
  • Understands more than 100 languages
  • Acts as a virtual assistant for your PC
  • Remote PC control through Android or iOS mobile devices

If Dragon and Amazon Transcribe are overkill for your needs, Braina is one of the best speech-to-text software suites for individual users. We named it the best multipurpose program in our roundup of the best dictation software , as Braina can be considered more of a virtual assistant for your PC rather than a simple speech-to-text app. Think of it as being much like Siri or Alexa , but more focused on productivity (and much more powerful and versatile in this regard) while being also capable of excellent speech-to-text functions thanks to its impressive speech recognition A.I. that understands more than 100 languages.

If you feel like you could use a hand around the office but don’t want to actually hire a personal assistant, Braina might be worth a go. It’s one of the best speech-to-text software choices for small businesses, home offices, and individual users thanks to its excellent speech recognition capabilities and other features. Perform internet searches, dictate documents, translate different languages, record calls and meetings, set alarms and calendar reminders, sort through your files — you name it. Braina’s companion app even lets you do everything remotely via your iOS or Android phone or tablet when you’re away from your computer.

One major drawback of Braina is that the core software only works on Windows, the aforementioned iOS and Android companion app notwithstanding. Also, multiple people can use Braina without having separate accounts or subscriptions, which is a nice change of pace from most subscription-based software suites. There is a basic free plan available as well. If you want to unlock the full set of features, though, such as non-English language compatibility, then Braina will set you back $79 yearly or $200 for a lifetime key.

  • Price: Free
  • Platforms: Windows, Mac, and Linux (browser-based)
  • If you have a Google account, you already have it
  • Automatically converts text into document format
  • Cloud-based

You might already have access to one of the best speech-to-text software apps without even knowing it, as Google Docs has one build right in. Google’s browser-based word processor (part of the broader Google Drive suite of cloud-based office software) features a Voice Typing feature, and if you have a Google account and a working mic, then you’re already set up to use it. You don’t have to pay a cent for it, either, and for free software, it’s pretty good — although it naturally lacks many of the advanced features and dictation functions of the best speech-to-text software we outlined above.

Google Docs Voice Typing is very simple: You speak into your microphone, and Google Docs dumps the text into a document. It costs nothing to use, so if you’re on the fence about whether you need speech recognition at all, then Google Docs Voice Typing is a free way to try it out before you shell out any cash for any of the best speech-to-text software suites that you have to pay for. Voice Typing is great for those who just need basic dictating software without the bells and whistles offered by paid services, as well.

Since Google Docs is browser-based, you shouldn’t have to worry about platform compatibility. It’s naturally best for use on a computer rather than a mobile device; that said, you can really use it on any device with a microphone and access to Google Docs. Everything you do with Google Docs Voice Typing is automatically stored on the cloud, too, just like any other document you’d create or edit using Google Docs. The Google Drive cloud also makes it easy to share your transcriptions with friends and colleagues if you want.

Editors' Recommendations

  • The 5 best tax software suites for individuals in 2024
  • The best free antivirus software for 2023
  • The best accounting software for your small business
  • The best way to hire employees in 2022
  • The best CRM software for your business in 2022

Lucas Coll

Knowing the best way to hire employees is an important part of finding great employees online fast. However, when it comes to doing so quickly, there can be differences involved in finding the most appropriate approach. That's why we've got all the best insight into the four key ways to find employees online fast.

When time is of the essence, it's important to know exactly what to do so that you're not stuck waiting too long to employ the right candidate for your business. Time is money and if you're short on staff, you need to be able to fill those vacancies quickly. Having said that, you still want the best candidates which is why it's important to go about it the right way. Some ways are more obvious than others but this is the time for efficiency so you get the best value proposition.

Communication is an essential part of doing business online, from the simplest calls and text messages to large-scale video conferences involving dozens or even hundreds of people. Unfortunately, most of the free communication apps most of us use every day aren't really built for anything other than simple messaging and therefore aren't up to meeting the demands of modern companies.

That's why any small business looking to streamline its operations in the digital age should invest in a more comprehensive Voice over Internet Protocol (better known as a VoIP) service. But if you don't even know where to start with this, don't fret. We've got everything you need to know about the best VoIP services for small businesses to set you and your burgeoning enterprise sailing in the right direction. RingCentral

Voice over Internet Protocol, or VoIP, is a popular alternative to landlines, especially in the business world. VoIP providers deliver digital telephone services that rely on the internet for voice and video calls. The main advantages of VoIP are that you can make long-distance calls at a very affordable price and benefit from a faster connection compared to a traditional landline. 

A VoIP service is worth considering if you run a small business or make a lot of international phone calls, but comparing different VoIP providers can be challenging if you’re not familiar with the technology. We’ve compared different VoIP services to help you find the best provider to fit your needs. RingCentral

speech to text app cost

The 6 best free speech-to-text apps for creators

speech to text app cost

What type of content do you primarily create?

Discover the best free speech-to-text apps for seamless transcription! Enhance productivity with accurate and efficient voice recognition.

If you're an online creator who works with video and audio (say, a podcaster or YouTuber), chances are you spend a lot of time or money writing scripts and transcribing your content. Or, you let YouTube automatically caption your videos and hope for the best, often with colorful results .

But it doesn't have to be that way.

You don't have to spend hours manually transcribing or a ton of money for per-minute transcription services. Instead, you can use free speech-to-text software, some of which include artificial intelligence (AI) tools designed for creators , to help you get your words onto the page in minutes.

6 best free speech-to-text apps for creators

  • oTranscribe
  • Apple Dictation
  • Google Docs Voice Typing

What is a speech-to-text app?

A speech-to-text app, or dictation app, is software that lets you record your voice (or upload an audio/video file) and transcribes it into text within the app.

The technology basis of these apps is speech recognition software, which takes a recording and breaks it down into bits it can interpret, converting them into digital text. It's worth noting that speech recognition technology and voice recognition aren't the same; the latter only looks to identify a spoken voice (and often specific voice commands) rather than transcribe what’s being said.

One of the most common use cases for speech-to-text is for transcribing interviews and meetings, which makes them more accessible for those with hearing difficulties and better for SEO purposes.

However, you can also use them for transcribing voiceover videos , vlogs, audio-only podcasts, and more.

How to choose the best free speech-to-text software

In this section, we'll cover a few core features you should look out for when choosing free speech-to-text software for creating content. If the software you're looking at doesn't have these, you'll most likely need to look elsewhere.

Transcription minutes

Of course, you need your speech-to-text app to transcribe. However, not every app or tool will transcribe pre-recorded audio or video and offer 'live' transcription. For apps that do both (and if this feature is what you need), you'll want to pay attention to the amount of transcription you get for free.

On the other hand, if you only want to use speech-to-text for script planning (e.g., voicing your ideas out loud), you may only need a dictation tool that'll put your spoken words into a document. We'll be showing you tools that cater to these different needs in our comparison section below.

Format compatibility and export

If you need software or tools to help you use speech-to-text for transcribing videos and podcasts, you'll need to keep an eye out for import and export format compatibility.

If the software you're considering only accepts .wav audio files, you'll need to convert to that format if your recording is in another. On the other end of the workflow, if you need your transcription to be able to export as a Microsoft Word document, you'll need to make sure your software exports Word docs before you waste your time.

Storage and organization

Whether you're only using a dictation tool or full speech-to-text software, you'll want your words to be easily accessible. Some software (if not all) will have storage limits, so if you record a lot of content, look for one with a generous amount of storage.

You'll also want to consider the organization of your files — granted, this point is entirely subjective and depends on what kind of user interface you like to use. Since we're specifically looking at free options (or software with free plans), it won't hurt to try a few out to see which you like best.

Automatic speaker labels

If you record a podcast or other video content with guests, you'll need to be able to separate who's who in your transcription. You can manually separate speakers in your transcription, but the best way to save time here is to use software that automatically adds speaker labels.

Usually, this means the software will ask you to identify the speakers first; then, it'll handle the rest of the transcription (typically with AI).

An easy-to-use editor

The final feature you want to consider is editing. No transcription software is 100% accurate, so you'll want to use one that has a smooth and easy editor to help you get the job done faster and more easily.

6 best speech-to-text apps for creators

With all of the above in mind, let's get into the details of some of the best speech-to-text software tools currently available that are most suitable for creators.

We make this distinction because some speech-to-text software tools are specifically designed for professional industry use (e.g., medical and legal) and are costly because of that specialization.

1. De‎script

‎ Key features:

  • Automatic high-quality transcription (up to an hour free) with up to 95% accuracy
  • Automatically remove filler words and periods of silence with Descript AI tools
  • Easy document-style editing, which adjusts both the script and media
  • Highlights potential errors to help you proofread and review
  • Easily add subtitles to your video with the transcription
  • Descript supports 23+ different languages 

Upgrade options: The Creator plan (from $12/month) includes 10 transcription hours, and the Pro plan (from $24/month) includes 30 transcription hours. Each comes with even more features besides more hours.

Platforms: Web app, Windows 10 (or newer), Mac OS High Sierra (or newer).

Descript's speech-to-text transcription tool is embedded within its editor software and is one of the best free options specifically for creators. You can create a project for either an existing video to upload or record a new one straight into the software, and the audio-text feature will add the words to your script.

When I added a video of one of my virtual academic conference presentations (originally 12:53 in duration), it transcribed my words in about a minute and a half with suprising accuracy, given that I was using some highbrow academic language.

After editing, using filler words and word gap removal, I cut my video down to 11:29 in just a few seconds and made the video a lot more presentable (unfortunately for me, I didn't have Descript when I initially presented at that conference). 

Descript also lets you use Studio Sound to improve the overall sound quality—it’s free for files up to 10 minutes on the free plan, and unlimited on paid plans.

2. oT‎ranscribe

Key features:

  • A simple HTML web app means good cross-platform accessibility
  • Keyboard shortcuts for easy playback, rewind, and fast-forward
  • Integrated video player to stop tab/software switching
  • Interactive timestamps
  • Automatic saving to your browser's storage every second
  • Export to Markdown, Plain Text, and Google Docs

Upgrade options: Completely free, no plans or upgrade options.

Platforms: Web app (worked in Chrome and Safari at the time of writing).

This one, admittedly, is cheating a little. oTranscribe is technically a transcription-specific tool, so there's no speech-recognition tech involved. But it's a great tool if you want to work on your video or audio manually. For example, suppose you're using a lot of niche vocabulary (fantasy names, industry-specific terms, etc.). In that case, you can sometimes spend more time editing a generated transcript than writing it with better accuracy.

It has a simple HTML interface with a familiar-looking document editor and immediately tells you the most important keyboard shortcuts to use. Using it on the same conference video test made manual transcription much easier than I remember compared to previous projects.

While this is fine for creating a standalone transcript, it doesn't help you add captions or do anything else (e.g., text summaries, repurposing your script, etc.).

3. Di‎ctanote

  • Familiar notebook-style file organization of your notes
  • Basic text editing, which is easy to pick up
  • You can install its dedicated app instead of using the web
  • Decent speech-to-text accuracy
  • Dictation is completely free

Upgrade options: You can pay 10 cents per minute for AI transcription of existing audio files.

Platforms: Web app, Chrome app (when it asked me to install, it installed on my MacBook as a Chrome app).

If you want to use a tool to help you type as you speak, Dictanote is a great option. It's packaged as a note-taking app, where you can easily store and organize notes you've made. You can type notes as usual, but its key feature is its speech-to-text function and voice commands.

If you've never dictated before, it takes some getting used to, i.e., voicing punctuation and new lines. However, once you get the hang of it, speaking your thoughts can be much faster than typing them by hand.

This option is mainly for creators who want their creative ideas out of their heads and onto the page and provide a dedicated space for their ideas.

For the downsides, while testing the app, it didn't seem to like my AirPods when dictating (it didn't register my voice at all, even after granting permissions), and I had to switch to my Macbook Air microphone. That might be down to me not having the correct settings, but it's worth mentioning. Also, not having any free transcription options for existing media can be a deal-breaker for creators who primarily record content on the fly.

4. ‎ Apple Dictation

  • No internet connection required (with Apple Silicon devices)
  • Setting up Voice Control can add even more functionality to dictation
  • User-friendly; use it anywhere you’d usually type
  • Up to 96% accuracy

Upgrade options: Comes free with Apple devices.

Platforms: Apple Mac and iOS devices only.

To test Apple dictation, I've decided to use it to write this section of the article using the Apple Notes app, then copy and paste what I've written into my draft (with a bit of editing).

It's a great tool to help you write as you speak; what’s more, it’s entirely free because it comes embedded within Apple products, including iPhones, iPads, and MacBooks.

Another great benefit of using Apple dictation is that you can easily swap between using your voice and typing, making editing easy for simple mistakes (such as capitalizing brand names). However, when you set it up with voice commands, you can also use dictation to edit instead. Apple dictation also switches off if it doesn’t detect your voice after about 15 seconds or so.

Of course, if you're not an Apple user, Apple dictation is not the tool for you. However, Microsoft has an equivalent dictation tool with an equally reasonable accuracy rate. If you're the type of creator who likes to think out loud and can get used to voicing punctuation and new lines quickly, then Apple dictation is the right tool to help you get thoughts on the page.

As a downside, I found that Apple dictation works best with other Apple software products, such as the Notes app. The dictation keyboard shortcut doesn't work at all in Google Docs, which is likely because Google Docs has its own dictation tool, which we’ll be looking at next.

5. ‎ Google Docs Voice Typing

  • Google Docs is an extremely widely used, cross-platform tool for professionals and creators, making collaboration easy.
  • Activate voice typing with a keyboard shortcut no matter where you are on the page
  • Clear, large icon indicates you've started voice typing

Upgrade options: It comes as a free feature of Google Docs; there's no upgraded version.

Platforms: Web (I'd recommend Chrome specifically for Google Docs, but other browsers may work just as well). It may also work on the Docs app using the Gboard keyboard, but it doesn't work with the default iOS keyboard.

I've used Google Docs as the main deliverable format in my career for years, and I'd never thought to use the native Google speech-to-text feature. However, as a speech-to-text option, it works in the same way as Apple Dictation and Dictanote.

The main difference between these dictation options is the software platform and UI. If you're a creator who uses Google Docs for your ideas, transcripts, collaboration opportunities, and Google Drive for storage, then voice typing directly into Google Docs could be a great option.

However, as with the other dictation tools we've covered, they don't help you with existing media; they’re only for live speech. This lack of transcription can add to your work rather than make your workflow smoother.

6. ‎ Otter.ai

  • AI meeting assistant that keeps audio recordings, transcribes, captures slides, and generates summaries in real time.
  • Automatically integrates with Zoom, Google Meet, and MS Team to write and share notes
  • 300 transcription minutes and up to 30 minutes per conversation on the free plan
  • You can import up to 3 audio or video files for transcription (period). You get a monthly limit if you upgrade.

Upgrade options: Pro from $10/month, Business from $20/month (gets you 1,200 and 6,000 transcription minutes, respectively).

Platforms: Web, iOS app, Android app

My personal experience with Otter.ai started when a client of mine would send me interview transcripts she'd made with it. While they helped create content based on the interviews, the transcripts were never super accurate (I'd say roughly 75%).

However, using my conference presentation video, the accuracy is more within the 90% range. I imagine this huge difference comes from the fact that with more than one person speaking, it can be difficult for the AI to keep speakers separated — and on top of that, neither my client nor the interviewees ever seemed to use dedicated microphones.

For creators who post a lot of videos or audio content online, Otter.ai can be a time saver for transcribing podcast interviews you've recorded on Zoom , Google Meets, or MS Teams.

On the other hand, while you can edit the transcript within the Otter.ai software, you can't edit the media the transcript came from. So, if you need a tool to do both, Otter.ai can't help you. Otter.ai also only works in English, so if you need to use another language, you'll need to look elsewhere.

Honorable mention: Just Press Record

If you're a creator with an iPhone or Apple Watch who finds yourself coming up with content ideas in the most random places, and you typically make voice notes with the Voice Memo mobile app to record your ideas, Just Press Record is a great on-the-go speech-to-text service. It's an honorable mention here because it has a one-time purchase fee from the app store ($/£4.99).

With the iPhone app, you can record pro-level audio (if you've got a plug-in microphone), transcribe every word with high accuracy (no limits), edit the transcript in-app, sync across iCloud, and organize your notes by folder.

However, you can also cut/trim the audio to better match an edited transcript, though you have to do this manually.

Another software often cited as a great choice is Nuance Dragon Professional and Dragon Anywhere mobile app. However, upon researching, I discovered that the app has a lot of poor reviews (it's sitting at 2.4/5 on the app store at the time of writing). So, I decided not to include it in this list.

Quick tip for the best speech-to-text results

No matter which type of speech-to-text tool you use, to get the best results, you'll want to use a good-quality microphone so that the audio is as clear as possible.

If you still have trouble with inaccurate dictation or transcription, try speaking more clearly and making sure you don't have too much background noise.

Best free speech-to-text app FAQs

Is there a free app for voice-to-text transcription.

Yes. There are several free voice-to-text transcription apps available. Descript is one of the best options for creators. However, many people can use their device's onboard dictation solution with a note-taking app.

What is the best AI speech-to-text tool?

Descript is the best transcription option for creators who want to use speech-to-text alongside media editing — editing the transcript also edits the media.

On the other hand, if you don't need to edit media, Otter.ai is another great option for transcribing personal meetings and internal interviews.

What are the benefits of using a speech-to-text app?

  • Saves time. People often speak much faster than they can type, so a speech-to-text tool can help you get words onto a page more quickly.
  • Saves money. Many speech-to-text apps are reasonably accurate and free, which saves you from needing to pay for professional transcriptions (unless you really need human transcription services).

Greater accessibility. People with specific disabilities find it difficult, if not impossible, to type by hand, and so speech-to-text is a critical tool for those who need it.

Related articles

speech to text app cost

Featured articles:

speech to text app cost

32 best podcast tools to produce, edit, host, and grow your show

We scoured forums and interviewed experts to find the best podcast tools for planning episodes, editing audio, growing your audience, and more.

speech to text app cost

11 amazing Instagram video editing apps for creators

Discover the top Instagram video editing apps to take your Reels, Stories, and grid posts to the next level.

speech to text app cost

The 8 best apps for making Reels on Instagram

Discover the best apps for making Instagram Reels in this complete guide!

speech to text app cost

AI for Creators

8 best AI copywriting tools to save time

Discover the best AI copywriting tools for effortless content creation.

speech to text app cost

The best ways to remote record a podcast interview, ranked

An experienced audio engineer ranks the best ways to remote record a podcast interview, from lowest to highest quality.

speech to text app cost

9 AI content creation tools to supercharge your creativity

AI content creation is exploding, but some tools are better than others. Find the best in this guide.

Articles you might find interesting

speech to text app cost

Ultimate guide to writing a killer fiction podcast

Writing a fiction podcast is similar to writing any type of scripted material — you’ll need a gripping story, compelling characters, and the right software to get it all down.

speech to text app cost

Using EQ and compression for better video sound

Sound is a crucial component of video, and the two most important tools for editing video sound are compression and EQ.

speech to text app cost

How to edit short form video to attract the most views

Short-form video follows its own rules. Learn best practices for short-form platforms, plus tips for making the most engaging videos.

speech to text app cost

What are deepfakes? How to spot fake AI video and audio

What are deepfakes? Check out our guide and learn everything related to deepfakes’ risks and implications.

speech to text app cost

The best AI tools for podcasters: From scriptwriting to audio editing to marketing your show

You’re not going to love every part of podcasting all the time, and that’s okay. For everything else, there’s AI. We’ve rounded up the AI tools we find most useful.

speech to text app cost

How They Made It

Ross Sutherland of Imaginary Advice on podcasting's parallels with poetry

We spoke with Ross about how poetry led him to podcasting, how budget constraints led to his signature style, and why pursuing your passion as a full-time job may not be your best option.

speech to text app cost

Join millions of creators who already have a head start.

Get free recording and editing tips, and resources delivered to your inbox.

Related articles:

Share this article

  • Accessories
  • Meet the team
  • Advertise with us
  • Privacy Policy

iGeeksBlog

7 Best speech to text apps for iPhone in 2024

Darryl

Some people can type hell for leather, but we have to agree that it’s not everyone’s cup of tea. How would it be if you could type as fast as you speak? Well, why think when you can execute! While iPhone’s native Notes app can handle your basic transcribing needs, it lacks added functionality.

So, here’s a list of the best speech-to-text apps for your iPhone!

  • dictate2us Record & Transcribe
  • Rev Voice Recorder & Memos
  • Day One Journal

1. Evernote: Notes organizer – Editor’s choice

Evernote speech to text app for iPhone

Evernote is a widely used application for note-taking. However, it serves multiple purposes like clipping articles, scanning documents, and also allows adding content (pictures, videos, audio, etc.) with notes.  

Open the virtual keyboard and tap the mic below the return key to dictate text. You can now dictate, and it’ll not only be converted but also sync across your devices.

If you’re a student, you can use the app to take notes, track them, and even create separate notebooks. Further, you can also bring people, projects, and ideas together with shared Spaces. This is especially helpful for businesses or if you are working on a group project. Evernote also has multiple other features and is free to use.  

  • Scan and convert handwritten notes
  • Transcription works flawlessly
  • Numerous useful templates
  • Note linking
  • Excellent organization system
  • Cannot save data offline
  • Lean cloud storage
  • Cannot sync with more than 2 devices with the basic plan

Price : Free (In-App Purchases start at $10.00)

2. dictate2us Record & Transcribe – Fastest transcription service

dictate2us Record & Transcribe app for iPhone

Dicatate2us is one of the leaders in transcription services, and you can get the audio transcribed within an hour regardless of the time, 365 days a year. You can record letters, reports, case notes, interviews, etc.  

All you need to do is save a recording which will get uploaded to the platform. A person, yes, an actual person, will sit and transcribe the audio, after which an editor will check if there are errors, and then the Google doc file will be sent to you.  

Dicatate2us can be downloaded for free, and you get a $15 free trial credit. After the trial ends, you are charged a minute rate depending on the turnaround time. D2U uses military-grade encryption, so you need not worry about confidential data being hacked.

  • Military-grade encryption
  • Fast turnaround time
  • $15 Trial credit
  • Can’t transcribe instantly

Price : Free ($15 trial credit) 

3. Transcribe: Speech to Text – Instant transcription

Transcribe - Speech to Text app for iPhone

Some people repeatedly listen to their voice memos as reminders. However, listening to the whole message to get to something you want to hear might be a time-consuming event, and Transcribe comes in. It is among the fastest transcription apps. A tap of a button is all it takes to transcribe your audio or video files. 

It lets you transcribe any voice or video file into text. All is done with the help of AI. You get 15 minutes of the free trial, after which you will need to either upgrade to the Pro version or buy add-on hours. Transcribe Pro offers features like exporting files in TXT, PDF, and DOCX formats. You also get 5 hours of transcription every month and can save up to 50GB of files on its cloud.  

  • Instant transcription
  • Supports both audio and video files
  • Supports multiple file formats
  • Not great for longer duration transcriptions
  • Users have reported frequent app crashes

Price : Free (In-app purchase starts at $4.99)

4. Rev Voice Recorder & Memos – Simplified design

Rev Voice Recorder & Memos app for iPhone

Rev Voice is yet another transcription service that people have employed to transcribe audio. The company claims a 99% plus accuracy rate alongside fast turnaround times. You can get a 30-minute recording transcribed under 12 hours.  

The Rev Voice app has a clean and minimal user interface. Besides transcribing audio, it also lets you record audio, trim the audio, etc. You can use the recorder in the background, and it will automatically pause when you receive a call.  

While the application is free to download, Rev charges 1 credit for a minute of audio transcribed, and if you avail of their services through their website, it’ll cost you $1.50 for a minute of audio transcribed. You can also view the progress of your order, and once done, you’ll receive it in your email inbox and the app.

  • Affordable rates
  • Effortless editing
  • High accuracy rate
  • The subscription model would make the service more appealing
  • Turnaround times are longer

Price : Free (1 credit costs $1.99)

5: Dictation: Speech to text – Translate your transcription

Dictation - Speech to text app for iPhone

Dictation serves one purpose, converting speech to text, and it does the job quite decently. But there are instances where it might miss a few words. You can not only record audio in the app but also import audio files.

Further, translation is a unique feature offered by Dictation. It supports over 40 languages. All you have to do is tap a button to translate the transcribed text.

The app also has iCloud Synchronization. Once turned on, you can dictate text from your iPhone, iPad, Apple Watch, etc.

Besides, you can also share the text across several social media platforms like Twitter, WhatsApp, etc. However, the app is infested with ads. You will need to upgrade to the pro version to get rid of ads, costing $17.49 per year.

  • Translation with support for 40 languages
  • Works flawlessly across multiple iOS devices
  • Cluttered UI 
  • Infested with ads 

Price : Free (Pro version costs $4.99 per month)

6. Day One Journal: Private diary – Journalling made fun

Day One Journal app for iPhone

If you want to get into the habit of maintaining a daily journal but have been postponing the idea for a long time because you find writing to be a tedious process, Day One Journal is here to help. You can add notes to your journal at any time of the day as the app provides speech-to-text services.

It’s not just words; you can also add videos and images alongside handwritten entries or drawings to your journal. However, this feature is limited to the premium version. You can also take notes from your Apple watch. If that was not enough, you could also send text messages or emails to your journals.  Pretty impressive, right?

  • Easy to use
  • Supports and integrates images, videos, etc., in the journal
  • Can only be used as a diary

Price : Free (Premium costs $5.99 per month)

7. Otter: Transcribe voice notes – Best for businesses

Otter- Transcribe Voice Notes app for iPhone

Otter works flawlessly for transcribing text and is an excellent tool for personal use. But it shines more when put to use in a corporate environment. You can automatically take notes (through speech to text), save these notes, and share them with everyone on the team.

Another great feature of Otter is that it provides live captioning to the deaf or people with special needs. The app also lets users highlight text, and the punctuation is quite accurate.

The integration of Otter with iOS helps you start recording through Siri shortcuts and syncs notes with the iOS calendar, contacts, etc. You can also record with AirPods. The app offers all these features for a limited amount of time as a free trial, after which the Pro subscription offers 6000 transcription minutes per month. A great overall app for businesses!

  • Caters to specially-abled users
  • Great iOS integration
  • The free plan is good for personal use
  • Transcription is a hit or miss for longer audio files
  • No support for images in notes

Price : Free (Premium Costs $16.99 per month) 

Wrapping up!

Well, this was my list of the best speech-to-text apps for iPhone and iPad, which are beneficial for both students and professionals. If you found the right speech-to-text app that suits your requirements, share your thoughts in the comments below!

  • Best translation apps for iPhone and iPad
  • Best voice changer for Mac
  • Top alarm apps for iPhone and iPad
  • Best voice recorder apps for iPhone

Darryl

Consumer Technology and Motorcycles are the two things that excite Darryl the most. Why? Because Tech helps better people's lives, and solving people's problems related to tech is something he enjoys. And what about bikes, you ask? Well, drop a gear and disappear.

View all posts

🗣️ Our site is supported by our readers like you. When you purchase through our links, we earn a small commission. Read Disclaimer .

  • iPhone Apps

LEAVE A REPLY Cancel reply

Save my name, email, and website in this browser for the next time I comment.

Related Articles

Ipad and iphone apps on sale today, apps gone free today on the app store, best mac cleaner software and apps to keep your mac clean in no time, best pdf editors for iphone and ipad in 2024.

Speech to Text - Voice Typing & Transcription

Take notes with your voice for free, or automatically transcribe audio & video recordings. secure, accurate & blazing fast..

~ Proudly serving millions of users since 2015 ~

I need to >

Dictate Notes

Start taking notes, on our online voice-enabled notepad right away, for free.

Transcribe Recordings

Automatically transcribe (and optionally translate) audios & videos - upload files from your device or link to an online resource (Drive, YouTube, TikTok or other). Export to text, docx, video subtitles and more.

Speechnotes is a reliable and secure web-based speech-to-text tool that enables you to quickly and accurately transcribe your audio and video recordings, as well as dictate your notes instead of typing, saving you time and effort. With features like voice commands for punctuation and formatting, automatic capitalization, and easy import/export options, Speechnotes provides an efficient and user-friendly dictation and transcription experience. Proudly serving millions of users since 2015, Speechnotes is the go-to tool for anyone who needs fast, accurate & private transcription. Our Portfolio of Complementary Speech-To-Text Tools Includes:

Voice typing - Chrome extension

Dictate instead of typing on any form & text-box across the web. Including on Gmail, and more.

Transcription API & webhooks

Speechnotes' API enables you to send us files via standard POST requests, and get the transcription results sent directly to your server.

Zapier integration

Combine the power of automatic transcriptions with Zapier's automatic processes. Serverless & codeless automation! Connect with your CRM, phone calls, Docs, email & more.

Android Speechnotes app

Speechnotes' notepad for Android, for notes taking on your mobile, battle tested with more than 5Million downloads. Rated 4.3+ ⭐

iOS TextHear app

TextHear for iOS, works great on iPhones, iPads & Macs. Designed specifically to help people with hearing impairment participate in conversations. Please note, this is a sister app - so it has its own pricing plan.

Audio & video converting tools

Tools developed for fast - batch conversions of audio files from one type to another and extracting audio only from videos for minimizing uploads.

Our Sister Apps for Text-To-Speech & Live Captioning

Complementary to Speechnotes

Reads out loud texts, files & web pages

Reads out loud texts, PDFs, e-books & websites for free

Speechlogger

Live Captioning & Translation

Live captions & translations for online meetings, webinars, and conferences.

Need Human Transcription? We Can Offer a 10% Discount Coupon

We do not provide human transcription services ourselves, but, we partnered with a UK company that does. Learn more on human transcription and the 10% discount .

Dictation Notepad

Start taking notes with your voice for free

Speech to Text online notepad. Professional, accurate & free speech recognizing text editor. Distraction-free, fast, easy to use web app for dictation & typing.

Speechnotes is a powerful speech-enabled online notepad, designed to empower your ideas by implementing a clean & efficient design, so you can focus on your thoughts. We strive to provide the best online dictation tool by engaging cutting-edge speech-recognition technology for the most accurate results technology can achieve today, together with incorporating built-in tools (automatic or manual) to increase users' efficiency, productivity and comfort. Works entirely online in your Chrome browser. No download, no install and even no registration needed, so you can start working right away.

Speechnotes is especially designed to provide you a distraction-free environment. Every note, starts with a new clear white paper, so to stimulate your mind with a clean fresh start. All other elements but the text itself are out of sight by fading out, so you can concentrate on the most important part - your own creativity. In addition to that, speaking instead of typing, enables you to think and speak it out fluently, uninterrupted, which again encourages creative, clear thinking. Fonts and colors all over the app were designed to be sharp and have excellent legibility characteristics.

Example use cases

  • Voice typing
  • Writing notes, thoughts
  • Medical forms - dictate
  • Transcribers (listen and dictate)

Transcription Service

Start transcribing

Fast turnaround - results within minutes. Includes timestamps, auto punctuation and subtitles at unbeatable price. Protects your privacy: no human in the loop, and (unlike many other vendors) we do NOT keep your audio. Pay per use, no recurring payments. Upload your files or transcribe directly from Google Drive, YouTube or any other online source. Simple. No download or install. Just send us the file and get the results in minutes.

  • Transcribe interviews
  • Captions for Youtubes & movies
  • Auto-transcribe phone calls or voice messages
  • Students - transcribe lectures
  • Podcasters - enlarge your audience by turning your podcasts into textual content
  • Text-index entire audio archives

Key Advantages

Speechnotes is powered by the leading most accurate speech recognition AI engines by Google & Microsoft. We always check - and make sure we still use the best. Accuracy in English is very good and can easily reach 95% accuracy for good quality dictation or recording.

Lightweight & fast

Both Speechnotes dictation & transcription are lightweight-online no install, work out of the box anywhere you are. Dictation works in real time. Transcription will get you results in a matter of minutes.

Super Private & Secure!

Super private - no human handles, sees or listens to your recordings! In addition, we take great measures to protect your privacy. For example, for transcribing your recordings - we pay Google's speech to text engines extra - just so they do not keep your audio for their own research purposes.

Health advantages

Typing may result in different types of Computer Related Repetitive Strain Injuries (RSI). Voice typing is one of the main recommended ways to minimize these risks, as it enables you to sit back comfortably, freeing your arms, hands, shoulders and back altogether.

Saves you time

Need to transcribe a recording? If it's an hour long, transcribing it yourself will take you about 6! hours of work. If you send it to a transcriber - you will get it back in days! Upload it to Speechnotes - it will take you less than a minute, and you will get the results in about 20 minutes to your email.

Saves you money

Speechnotes dictation notepad is completely free - with ads - or a small fee to get it ad-free. Speechnotes transcription is only $0.1/minute, which is X10 times cheaper than a human transcriber! We offer the best deal on the market - whether it's the free dictation notepad ot the pay-as-you-go transcription service.

Dictation - Free

  • Online dictation notepad
  • Voice typing Chrome extension

Dictation - Premium

  • Premium online dictation notepad
  • Premium voice typing Chrome extension
  • Support from the development team

Transcription

$0.1 /minute.

  • Pay as you go - no subscription
  • Audio & video recordings
  • Speaker diarization in English
  • Generate captions .srt files
  • REST API, webhooks & Zapier integration

Compare plans

Privacy policy.

We at Speechnotes, Speechlogger, TextHear, Speechkeys value your privacy, and that's why we do not store anything you say or type or in fact any other data about you - unless it is solely needed for the purpose of your operation. We don't share it with 3rd parties, other than Google / Microsoft for the speech-to-text engine.

Privacy - how are the recordings and results handled?

- transcription service.

Our transcription service is probably the most private and secure transcription service available.

  • HIPAA compliant.
  • No human in the loop. No passing your recording between PCs, emails, employees, etc.
  • Secure encrypted communications (https) with and between our servers.
  • Recordings are automatically deleted from our servers as soon as the transcription is done.
  • Our contract with Google / Microsoft (our speech engines providers) prohibits them from keeping any audio or results.
  • Transcription results are securely kept on our secure database. Only you have access to them - only if you sign in (or provide your secret credentials through the API)
  • You may choose to delete the transcription results - once you do - no copy remains on our servers.

- Dictation notepad & extension

For dictation, the recording & recognition - is delegated to and done by the browser (Chrome / Edge) or operating system (Android). So, we never even have access to the recorded audio, and Edge's / Chrome's / Android's (depending the one you use) privacy policy apply here.

The results of the dictation are saved locally on your machine - via the browser's / app's local storage. It never gets to our servers. So, as long as your device is private - your notes are private.

Payments method privacy

The whole payments process is delegated to PayPal / Stripe / Google Pay / Play Store / App Store and secured by these providers. We never receive any of your credit card information.

More generic notes regarding our site, cookies, analytics, ads, etc.

  • We may use Google Analytics on our site - which is a generic tool to track usage statistics.
  • We use cookies - which means we save data on your browser to send to our servers when needed. This is used for instance to sign you in, and then keep you signed in.
  • For the dictation tool - we use your browser's local storage to store your notes, so you can access them later.
  • Non premium dictation tool serves ads by Google. Users may opt out of personalized advertising by visiting Ads Settings . Alternatively, users can opt out of a third-party vendor's use of cookies for personalized advertising by visiting https://youradchoices.com/
  • In case you would like to upload files to Google Drive directly from Speechnotes - we'll ask for your permission to do so. We will use that permission for that purpose only - syncing your speech-notes to your Google Drive, per your request.

Scraibe - Speech to Text 4+

Fast & accurate transcripts, florian ernst, designed for ipad.

  • 4.7 • 3 Ratings
  • Offers In-App Purchases

Screenshots

Description.

Meet Scraibe: The easiest way to turn Audio & Video files into Text! Direct File-to-Text Transcription Leverage the precision of OpenAI's Whisper and Apple's Neural Engine with Scraibe. Convert your audio and video files into readable text directly on your device or through our swift cloud-based service. Perfect for all your transcription needs. Unwavering Commitment to Privacy For on-device transcription, rest assured: your files and transcriptions stay with you. When utilizing our cloud option, we prioritize data safety and confidentiality. Broad Format Compatibility From podcasts to interviews, Scraibe seamlessly handles a wide range of audio and video formats. Plus, fine-tune your results with our audio track selection tool. Multilingual Support at Its Best Catering to a global audience? Scraibe recognizes and transcribes 90+ languages. And if you require English translations, we’re on it. Structured & Ready to Share Stay organized with Scraibe's user-friendly timeline and folder system. When it's sharing time, choose between TXT and SRT formats. Batch & Unlimited Cloud Transcriptions Queue files for on-device transcription or harness the power of cloud for unlimited parallel transcriptions. Your choice, our expertise. Transparent Pricing Model On-device services? One payment, full access. Cloud-based transcriptions? Use credits as you go. No subscriptions, no hidden fees. Privacy Policy: https://scraibe.app/privacy Terms of Service: https://scraibe.app/tos

Version 1.2.8

- Small bug fixes.

Ratings and Reviews

Good start but needs to become a share target.

good but can't share files from other apps to this app

Developer Response ,

Hey DoomBlah, thanks for your feedback, I'm happy to let you know that Scraibe now (version 1.2.6) supports importing files from other apps via the "Share" functionality!

App Privacy

The developer, Florian Ernst , indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy .

Data Not Linked to You

The following data may be collected but it is not linked to your identity:

  • Diagnostics

Privacy practices may vary, for example, based on the features you use or your age. Learn More

Information

  • On-Device Lifetime Pro $24.99
  • 300 Cloud-Minutes $9.99
  • 1000 Cloud-Minutes $19.99
  • 4000 Cloud-Minutes $49.99
  • Developer Website
  • App Support
  • Privacy Policy

More By This Developer

NutrisnapAI - Food Tracker

Long Weekends

You Might Also Like

VOMO: AI Voice Memos

Transcribe voice to text - Pro

AudioNotes: Speech To Text

Speech To Text & Whisper

Lexi: write well by talking

Voice to Text AI

Plans & Pricing

Automatic transcription, otterpilot™ for sales, recording and playback, collaboration, administration and security, product support, frequently asked questions.

websights

speech to text app cost

Speech to text

An AI Speech feature that accurately transcribes spoken audio to text.

Make spoken audio actionable

Quickly and accurately transcribe audio to text in more than 100 languages and variants. Customize models to enhance accuracy for domain-specific terminology. Get more value from spoken audio by enabling search or analytics on transcribed text or facilitating action—all in your preferred programming language.

speech to text app cost

High-quality transcription

Get accurate audio to text transcriptions with state-of-the-art speech recognition.

speech to text app cost

Customizable models

Add specific words to your base vocabulary or build your own speech-to-text models.

speech to text app cost

Flexible deployment

Run Speech to Text anywhere—in the cloud or at the edge in containers.

speech to text app cost

Production-ready

Access the same robust technology that powers speech recognition across Microsoft products.

Accurately transcribe speech from various sources

Convert audio to text from a range of sources, including  microphones ,  audio files , and  blob storage . Use speaker diarisation to determine who said what and when. Get readable transcripts with automatic formatting and punctuation.

Customize speech models to your needs

Tailor your speech models to understand organization- and industry-specific terminology. Overcome speech recognition barriers such as background noise, accents, or unique vocabulary.  Customize your models  by uploading audio data and transcripts. Automatically  generate custom models using Office 365 data  to optimize speech recognition accuracy for your organization.

Deploy anywhere

Run Speech to Text wherever your data resides. Build speech applications that are optimized for robust cloud capabilities and on-premises using  containers .

Fuel App Innovation with Cloud AI Services

Learn 5 key ways your organization can get started with AI to realize value quickly.

The report titled Fuel App Innovation with Cloud AI Services

Comprehensive privacy and security

AI Speech, part of Azure AI Services, is  certified  by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO.

View and delete your custom speech data and models at any time. Your data is encrypted while it's in storage.

Your data remains yours. Your audio input and transcription data aren't logged during audio processing.

Backed by Azure infrastructure, AI Speech offers enterprise-grade security, availability, compliance, and manageability.

Comprehensive security and compliance, built in

Microsoft invests more than $1 billion annually on cybersecurity research and development.

speech to text app cost

We employ more than 3,500 security experts who are dedicated to data security and privacy.

speech to text app cost

Azure has more certifications than any other cloud provider. View the comprehensive list .

speech to text app cost

Flexible pricing gives you the control you need

With Speech to Text, pay as you go based on the number of hours of audio you transcribe, with no upfront costs.

Get started with an Azure free account

speech to text app cost

After your credit, move to  pay as you go  to keep building with the same free services. Pay only if you use more than your free monthly amounts.

speech to text app cost

Documentation and resources

Get started.

Browse the  documentation

Create an AI Speech service with the  Microsoft Learn course

Explore code samples

Check out our  sample code

See customization resources

Explore and customize your voice-to-text solution with  Speech Studio . No code required.

Frequently asked questions about Speech to Text

What is speech to text.

It is a feature within the Speech service that accurately and quickly transcribes audio to text.

What are Azure AI Services?

AI Services  are a collection of customizable, prebuilt AI models that can be used to add AI to applications. There are a variety of domains, including Speech, Decision, Language, and Vision. Speech to Text is one feature within the Speech service. Other Speech related features include  Text to Speech ,  Speech Translation , and  Speaker Recognition . An example of a Decision service is  Personalizer , which allows you to deliver personalized, relevant experiences. Examples of AI Languages include  Language Understanding ,  Text Analytics  for natural language processing,  QnA Maker  for FAQ experiences, and  Translator  for language translation.

Start building with AI Services

Straightforward pricing that rewards you as you grow

Whether you’re just exploring or ready for commitment, our pricing plans make it easy to get started. All plans give you access to speech-to-text, audio intelligence, and text-to-speech   models and endpoints.

Speechify pricing.

Upgrade to Speechify Premium and power through your reading with our most advanced tools and features.  

  • 30+ natural, human-like voices  
  • 15+ exclusive languages  
  • 5x faster listening speeds
  • Advanced highlighting, note taking, and importing tools​

Speechify Text to Speech Pricing & Plans

Pricing for all budgets. Try it for free or get the full Premium experience.

SPEECHIFY LIMITED Free  

  • 10 standard reading voices
  • Listen at speeds up to 1x
  • Listen anywhere
  • Text to speech features only

SPEECHIFY PREMIUM $139/y

  • 30+ high quality, natural reading voices
  • 20+ different languages
  • Scan and listen to any printed text
  • Listen at 5x faster speeds
  • Advanced skipping and importing

Speechify Studio Pricing

Get our entire suite of AI studio products bundled into one transparent price.

Pricing Plans

Simple way to get started

$0 per month forever

  • No Downloads
  • AI Voice Over
  • Video, Slide, and Image support
  • Try all 200+ voices
  • All 20+ languages & accents
  • Support adding pauses
  • 10 minutes of voice generation
  • Support adjusting pronunciation
  • Support uploading of .txt, .docx, .srt scripts, as well as Youtube URLs

The basics for individuals

$69 per month / user

Everything in Free

  • Download as video, audio, or text
  • Video and audio Dubbing
  • Video and audio Transcription
  • 50 hours of voice generation per user/year
  • 12 hours of Dubbing per user/year
  • 50 hours of Transcription per user/year
  • Commercial usage rights
  • 8000+ licensed soundtracks
  • Thousands of Stock Images & Videos

MOST POPULAR

Professional

For professionals and teams

$99 per month / user

Everything in Basic

  • Voice Cloning
  • 100 hours of voice generation per user/year
  • 36 hours of Dubbing per user/year
  • 100 hours Video and Audio Transcription
  • 1 hour of AI Avatar Video/year

Customizable capability based on your business needs

Everything in Professional

  • Multiple seats
  • 1,000+ hours of voice generation per user/year
  • 500+ hours of Dubbing per user/year
  • 1,000+ hours Video and Audio Transcription
  • 20+ hours of AI Avatar Video/year
  • White Glove Procurement Assistance
  • Dedicated Customer Success Manager
  • Share, Editing, Commenting & Enterprise Collaboration Features
  • Custom Invoices
  • SOC2 Compliant
  • Company-wide on-boarding & Training

Speechify Text to Speech API Pricing

We’re thrilled to unveil the development of a text-to-speech API that delivers Speechify’s most natural and beloved AI voices directly to developers worldwide.

Speechify Audiobooks Pricing & Plans

Speechify audiobooks $9.99/month $120 billed annually..

  • Actor-narrated audiobooks
  • 1 free credit with trial
  • 12 credits per year
  • Access to 60,000+ titles
  • Newest releases
  • All best-sellers
  • 1000’s of free audiobooks

Require custom solutions across any of our products?

Talk to a sales engineer

Works everywhere you do

  • Available on iOS and Android
  • Hear anything you visit on Safari with the Speechify Safari extension
  • Syncs automatically with desktop
  • Listen to articles, Google Docs, or anything in Gmail with our Chrome extension
  • Listen to PDFs, articles, and more in the Web App library Save anything for later
  • Syncs automatically with mobile  

Chrome extension

Safari extension.

Have questions? Get help & support

Join millions of listeners

Chrome extension, apple store, google play, speechify free vs premium, don’t take our word for it.

speechify is giving me the ounce of hope i needed to stay in college — sophie brown 🌱 (she/her) (@sophgbrown) January 24, 2022
Speechify has saved my academic career! If you’re a student that struggles with reading comprehension or is a primarily auditory learner, get Speechify for school. You can scan any book and it will read it to you (works for online stuff as well). Thank you @Speechify_audio ! — Leila (@lkbmotion) June 17, 2021
For all you learners out there there is an app called Speechify that will turn pretty much any digital article, book, etc into an audio book or quick listen for you. It’s an absolutely game changer and I’m not even getting paid to tweet this. @Speechify_audio — Sean Buchanan (@Buc_28) September 10, 2021
Oh my god! Oh my god! @Speechify_audio changed my life a 12 hour long book converted to a 7 hour book 🔥🔥🔥 — Vecham Ram (@vsriram_2001) August 14, 2021
Raise your hand if you have ever asked yourself: “whyyyyy does it take me so long to read ONE article?!” 😵‍💫🙋🏻‍♀️ @OpenAcademics — Sandra Portocarrero, Ph.D. (@SandraNoDuerme) November 18, 2021
Might be late to the party with this, but I’ve added @Speechify_audio to my Chrome browser and it helps SO MUCH. I find it hard to read big documents, so now I can do the work while listening to important docs 🤯 — Matt Clements (@ClemFandangoX) November 26, 2021
If you do a lot of reading of briefs, books, presentations, coursework, transcripts, reports, etc, I *highly recommend @Speechify_audio . 📚 #ProductivityHacks — Collette Smith, PhD💎 (@DrCJASmith) February 15, 2022
On a whim, I got the @Speechify_audio app to have my scripts read to me to help memorize, you can choose the voice, and I chose Gwyneth Paltrow’s voice. Très magnifique 💋👌🏽I feel like I’m in Brentwood already 😎 — Salma 💋 (@salma_khan) January 22, 2022
Thank you @Speechify_audio for actually changing my life right now! I am finally processing journal articles without wanting to cry/ breakdown — Victoria Williamson #ActuallyAutistic #XR (@asdpsychnerd) February 10, 2022

Only available on iPhone and iPad

To access our catalog of 100,000+ audiobooks, you need to use an iOS device.

Coming to Android soon...

Join the waitlist

Enter your email and we will notify you as soon as Speechify Audiobooks is available for you.

You’ve been added to the waitlist. We will notify you as soon as Speechify Audiobooks is available for you.

man at desk connected to sound bars and documents

IBM Watson® Speech to Text technology enables fast and accurate speech transcription in multiple languages for a variety of use cases, including but not limited to customer self-service, agent assistance and speech analytics. Get started fast with our advanced machine learning models out-of-the-box or customize them for your use case.

IBM Watson Speech to Text is now available as a containerized library for IBM partners to embed AI technology in their commercial applications.

Our best-in-class AI, embedded within Watson Speech to Text, truly understands your customers.

Train Watson Speech to Text on your unique domain language and specific audio characteristics.

Enjoy the security of IBM’s world-class data governance practices.

Built to support global languages and deployable on any cloud — public, private, hybrid, multicloud, or on-premises.

Enable your voice applications using neural technologies for speech recognition powered by IBM Watson.

Improve speech recognition accuracy for your use case with language and acoustic training options.

Activate your voice application with speech models tuned for the customer care domain.

Improve speech recognition accuracy for extracting phrases, words, letters, numbers or lists.

Use our models optimized for low latency in real-time speech applications.

Analyze and correct weak audio signals before transcription begins.

Improve application response times by using speech transcription as it is generated and throughout the finalization process.

Transcribe dates, times, numbers, currency values, email and website addresses in your final transcripts by converting them into conventional forms.

Recognize who said what in a multi-participant voice exchange. Currently optimized for two-way call center conversations but can detect up to 6 different speakers.

Filter for specific words or inappropriate content by using our keyword spotting and profanity filtering features. (US English only)

Accelerate your business growth as an Independent Software Vendor (ISV) by innovating with IBM. Partner with us to deliver enhanced commercial solutions embedded with AI to better address clients’ needs.

Build AI-based solutions faster with IBM embeddable AI

Hear how a large call center transformed its operations with AI. (2:31)

Get started for free or view a demo . 

500 minutes of free speech recognition a month and 38 pre-trained speech models.

As low as USD 0.01 per minute

Tune your speech models to improve accuracy in recognition as well as transcription. Plus version includes unlimited minutes per month and 100 concurrent transcriptions.

Contact us for pricing

Provides large and security-sensitive firms with more capacity and data protection. Premium includes unlimited minutes per month and unlimited concurrent transcriptions.

Deploy Anywhere

Deploy behind your firewall or on any cloud with the flexibility of  IBM Cloud Pak for Data . The Deploy Anywhere version includes unlimited minutes per month and unlimited concurrent transcriptions, along with noise detection, speech customization and data isolation. 

Technical API specifications for all of your development needs.

The Watson SDK repository in GitHub.

See documentation about our enhanced security features that ensure your data is isolated and encrypted end-to-end, while in transit and at rest.

Learn how to create custom speech models using IBM Watson quickly — without knowing how to code.

Read about Watson Speech to Text requirements, the methodology and some best practices inspired by actual clients.

Guidelines on how to add a new or existing virtual assistant to your brand-new Watson IVR.

Improve customer engagement by interacting with users in their own language using any written text.

Solve customer issues the first time using an AI virtual assistant across any application, device, or channel.

Infuse powerful natural language AI into commercial applications with a containerized library designed to empower IBM partners with greater flexibility.

See Watson Speech to Text capabilities in action.

Google Gemini: Everything you need to know about the new generative AI platform

speech to text app cost

Google’s trying to make waves with Gemini, its flagship suite of generative AI models, apps and services.

So what is Gemini? How can you use it? And how does it stack up to the competition ?

To make it easier to keep up with the latest Gemini developments, we’ve put together this handy guide, which we’ll keep updated as new Gemini models, features and news about Google’s plans for Gemini are released.

What is Gemini?

Gemini is Google’s long-promised , next-gen GenAI model family, developed by Google’s AI research labs DeepMind and Google Research. It comes in three flavors:

  • Gemini Ultra , the most performant Gemini model.
  • Gemini Pro , a “lite” Gemini model.
  • Gemini Nano , a smaller “distilled” model that runs on mobile devices like the Pixel 8 Pro .

All Gemini models were trained to be “natively multimodal” — in other words, able to work with and use more than just words. They were pretrained and fine-tuned on a variety of audio, images and videos, a large set of codebases and text in different languages.

This sets Gemini apart from models such as Google’s own LaMDA , which was trained exclusively on text data. LaMDA can’t understand or generate anything other than text (e.g., essays, email drafts), but that isn’t the case with Gemini models.

What’s the difference between the Gemini apps and Gemini models?

Google's Bard

Image Credits: Google

Google, proving once again that it lacks a knack for branding, didn’t make it clear from the outset that Gemini is separate and distinct from the Gemini apps on the web and mobile (formerly Bard). The Gemini apps are simply an interface through which certain Gemini models can be accessed — think of it as a client for Google’s GenAI.

Incidentally, the Gemini apps and models are also totally independent from Imagen 2 , Google’s text-to-image model that’s available in some of the company’s dev tools and environments.

What can Gemini do?

Because the Gemini models are multimodal, they can in theory perform a range of multimodal tasks, from transcribing speech to captioning images and videos to generating artwork. Some of these capabilities have reached the product stage yet (more on that later), and Google’s promising all of them — and more — at some point in the not-too-distant future.

Of course, it’s a bit hard to take the company at its word.

Google seriously underdelivered with the original Bard launch. And more recently it ruffled feathers with a video purporting to show Gemini’s capabilities that turned out to have been heavily doctored and was more or less aspirational.

Google’s best Gemini demo was faked

Still, assuming Google is being more or less truthful with its claims, here’s what the different tiers of Gemini will be able to do once they reach their full potential:

Gemini Ultra

Google says that Gemini Ultra — thanks to its multimodality — can be used to help with things like physics homework, solving problems step-by-step on a worksheet and pointing out possible mistakes in already filled-in answers.

Gemini Ultra can also be applied to tasks such as identifying scientific papers relevant to a particular problem, Google says — extracting information from those papers and “updating” a chart from one by generating the formulas necessary to re-create the chart with more recent data.

Gemini Ultra technically supports image generation, as alluded to earlier. But that capability hasn’t made its way into the productized version of the model yet — perhaps because the mechanism is more complex than how apps such as ChatGPT generate images. Rather than feed prompts to an image generator (like DALL-E 3 , in ChatGPT’s case), Gemini outputs images “natively,” without an intermediary step.

Gemini Ultra is available as an API through Vertex AI, Google’s fully managed AI developer platform, and AI Studio, Google’s web-based tool for app and platform developers. It also powers the Gemini apps — but not for free. Access to Gemini Ultra through what Google calls Gemini Advanced requires subscribing to the Google One AI Premium Plan, priced at $20 per month.

The AI Premium Plan also connects Gemini to your wider Google Workspace account — think emails in Gmail, documents in Docs, presentations in Sheets and Google Meet recordings. That’s useful for, say, summarizing emails or having Gemini capture notes during a video call.

Google says that Gemini Pro is an improvement over LaMDA in its reasoning, planning and understanding capabilities.

An independent study by Carnegie Mellon and BerriAI researchers found that the initial version of Gemini Pro was indeed better than OpenAI’s GPT-3.5 at handling longer and more complex reasoning chains. But the study also found that, like all large language models, this version of Gemini Pro particularly struggled with mathematics problems involving several digits, and users found examples of bad reasoning and obvious mistakes.

Early impressions of Google’s Gemini aren’t great

Google promised remedies, though — and the first arrived in the form of Gemini 1.5 Pro .

Designed to be a drop-in replacement, Gemini 1.5 Pro is improved in a number of areas compared with its predecessor, perhaps most significantly in the amount of data that it can process. Gemini 1.5 Pro can take in ~700,000 words, or ~30,000 lines of code — 35x the amount Gemini 1.0 Pro can handle. And — the model being multimodal — it’s not limited to text. Gemini 1.5 Pro can analyze up to 11 hours of audio or an hour of video in a variety of different languages, albeit slowly (e.g., searching for a scene in a one-hour video takes 30 seconds to a minute of processing).

Gemini 1.5 Pro entered public preview on Vertex AI in April .

An additional endpoint, Gemini Pro Vision, can process text and imagery — including photos and video — and output text along the lines of OpenAI’s GPT-4 with Vision model.

Gemini

Using Gemini Pro in Vertex AI. Image Credits: Gemini

Within Vertex AI, developers can customize Gemini Pro to specific contexts and use cases using a fine-tuning or “grounding” process. Gemini Pro can also be connected to external, third-party APIs to perform particular actions.

Google brings Gemini Pro to Vertex AI

In AI Studio, there’s workflows for creating structured chat prompts using Gemini Pro. Developers have access to both Gemini Pro and the Gemini Pro Vision endpoints, and they can adjust the model temperature to control the output’s creative range and provide examples to give tone and style instructions — and also tune the safety settings.

Gemini Nano

Gemini Nano is a much smaller version of the Gemini Pro and Ultra models, and it’s efficient enough to run directly on (some) phones instead of sending the task to a server somewhere. So far, it powers a couple of features on the Pixel 8 Pro, Pixel 8 and Samsung Galaxy S24, including Summarize in Recorder and Smart Reply in Gboard.

The Recorder app, which lets users push a button to record and transcribe audio, includes a Gemini-powered summary of your recorded conversations, interviews, presentations and other snippets. Users get these summaries even if they don’t have a signal or Wi-Fi connection available — and in a nod to privacy, no data leaves their phone in the process.

Gemini Nano is also in Gboard, Google’s keyboard app. There, it powers a feature called Smart Reply, which helps to suggest the next thing you’ll want to say when having a conversation in a messaging app. The feature initially only works with WhatsApp but will come to more apps over time, Google says.

And in the Google Messages app on supported devices, Nano enables Magic Compose, which can craft messages in styles like “excited,” “formal” and “lyrical.”

Is Gemini better than OpenAI’s GPT-4?

Google has several times touted Gemini’s superiority on benchmarks, claiming that Gemini Ultra exceeds current state-of-the-art results on “30 of the 32 widely used academic benchmarks used in large language model research and development.” The company says that Gemini 1.5 Pro, meanwhile, is more capable at tasks like summarizing content, brainstorming and writing than Gemini Ultra in some scenarios; presumably this will change with the release of the next Ultra model.

But leaving aside the question of whether benchmarks really indicate a better model, the scores Google points to appear to be only marginally better than OpenAI’s corresponding models. And — as mentioned earlier — some early impressions haven’t been great, with users and academics pointing out that the older version of Gemini Pro tends to get basic facts wrong, struggles with translations and gives poor coding suggestions.

How much does Gemini cost?

Gemini 1.5 Pro is free to use in the Gemini apps and, for now, AI Studio and Vertex AI.

Once Gemini 1.5 Pro exits preview in Vertex, however, the model will cost $0.0025 per character while output will cost $0.00005 per character. Vertex customers pay per 1,000 characters (about 140 to 250 words) and, in the case of models like Gemini Pro Vision, per image ($0.0025).

Let’s assume a 500-word article contains 2,000 characters. Summarizing that article with Gemini 1.5 Pro would cost $5. Meanwhile, generating an article of a similar length would cost $0.1.

Ultra pricing has yet to be announced.

Where can you try Gemini?

The easiest place to experience Gemini Pro is in the Gemini apps . Pro and Ultra are answering queries in a range of languages.

Gemini Pro and Ultra are also accessible in preview in Vertex AI via an API. The API is free to use “within limits” for the time being and supports certain regions, including Europe, as well as features like chat functionality and filtering.

Elsewhere, Gemini Pro and Ultra can be found in AI Studio. Using the service, developers can iterate prompts and Gemini-based chatbots and then get API keys to use them in their apps — or export the code to a more fully featured IDE.

Code Assist (formerly Duet AI for Developers ), Google’s suite of AI-powered assistance tools for code completion and generation, is using Gemini models. Developers can perform “large-scale” changes across codebases, for example updating cross-file dependencies and reviewing large chunks of code.

Google’s brought Gemini models to its dev tools for Chrome and Firebase mobile dev platform, and its database creation and management tools . And it’s launched new security products underpinned by Gemini , like Gemini in Threat Intelligence, a component of Google’s Mandiant cybersecurity platform that can analyze large portions of potentially malicious code and let users perform natural language searches for ongoing threats or indicators of compromise.

Gemini Nano is on the Pixel 8 Pro, Pixel 8 and Samsung Galaxy S24 — and will come to other devices in the future. Developers interested in incorporating the model into their Android apps can sign up  for a sneak peek.

Is Gemini coming to the iPhone?

It might! Apple and Google are reportedly in talks to put Gemini to use for a number of features to be included in an upcoming iOS update later this year. Nothing’s definitive, as Apple is also reportedly in talks with OpenAI and has been working on developing its own GenAI capabilities .

This post was originally published Feb. 16, 2024 and has since been updated to include new information about Gemini and Google’s plans for it.

  • Share full article

For more audio journalism and storytelling, download New York Times Audio , a new iOS app available for news subscribers.

The Secret Push That Could Ban TikTok

U.s. lawmakers have long worried that the chinese government could use the app to spread propaganda..

This transcript was created using speech recognition software. While it has been reviewed by human transcribers, it may contain errors. Please review the episode audio before quoting from this transcript and email [email protected] with any questions.

From “The New York Times,” I’m Sabrina Tavernise. And this is “The Daily.”

[THEME MUSIC]

American lawmakers have tried for years to ban the video app TikTok over concerns that its ties to China pose a national security risk. Last week, they passed a law that might finally do it. Today, my colleague, Sapna Maheshwari, on the secret effort behind the law and what a ban would mean for the company’s 170 million American users.

It’s Tuesday, April 30.

So Sapna, tell me about this law that just passed that potentially bans the social media app TikTok. We’ve seen efforts in the past to rein in TikTok, but this one really seems like the most substantial yet.

It’s a huge deal. What this law really does is it puts the future of this hugely popular app with 170 million American users into question. TikTok has reshaped the way many people listen to music. It’s changed the way we cook. It’s made a whole different kind of celebrity.

But it’s never been able to shake these concerns around the fact that it has really close ties to China. It’s owned by a Chinese company, ByteDance. And lawmakers, for years, have been worried that the Chinese government could somehow use ByteDance and TikTok to get information on Americans or possibly spread propaganda.

President Trump tried to ban it in 2020. The State of Montana tried to ban this app last year. TikTok has largely survived those challenges, but this time it could actually be banned in the United States.

So let’s talk about this. Why did this effort succeed where the other ones failed?

So it’s an interesting story.

Here we go.

The committee will come to order.

And it really dates back to this hearing about a year ago that Congress had with Shou Chew, the CEO of TikTok.

Mr. Chew, you are here because the American people need the truth about the threat TikTok poses to our national and personal security.

Members of the committee, thank you for your time.

— TikTok has repeatedly said that it has addressed these national security concerns and that there’s no issue here. And you can hear that when Shou Chew testified.

Let me start by addressing a few misconceptions about ByteDance of which we are a subsidiary. ByteDance is not owned or controlled by the Chinese government.

He’s saying American investors are behind ByteDance.

Now, TikTok itself is not available in mainland China. We’re headquartered in Los Angeles and in Singapore.

And I myself am Singaporean. I live in Singapore.

The bottom line is this — American data, stored on American soil, by an American company, overseen by American personnel.

And we are actually going above and beyond what American technology companies do to keep things safe.

And I look forward to your questions. Thank you very much.

And is Congress convinced by that?

Congress is not convinced by that.

ByteDance is a Chinese company?

Well, ByteDance owns many businesses that operates in China.

Is it or is it not a Chinese company?

Congressman, the way we look at it, it was founded by Chinese entrepreneurs.

No, no, no, no. I’m not asking how you look at it. Fact, is it a Chinese company or not? For example, Dell is a company —

It was this really fiery, five-hour hearing, where Republicans and Democrats asked really contentious questions.

We do not promote or remove content at the request of the Chinese government. Will you repeat —

The question is, are you percent certain that they cannot use your company to promote such messages?

It is our commitment to this committee and all our users that we will keep this free from any manipulation by any government.

OK. If you can’t say it 100 percent certain I take that as a no.

There’s this underlying sense of distrust around the company and its ties to China.

I will remind you that making false or misleading statements to Congress is a federal crime.

I understand. Again, you can go on our platform. You will find that content.

And it becomes clear through the hearing that, across the board, Republicans and Democrats largely feel the same way.

Mr. Chew, I got to hand it to you. You’ve actually done something that in the last three to four years has not happened except for the exception of maybe Vladimir Putin. You have unified Republicans and Democrats.

So within weeks of this hearing, this small group of lawmakers gets together. And they say, let’s come up with a law that works where all the other ones have failed and actually make TikTok answer to its Chinese connections once and for all.

So tell me about this small group of lawmakers. Who are they?

So it starts with Republicans. Among them is Steve Scalise, one of the most powerful Republicans in the House. And a small group sort of works together for a few months, but they realize that in order to really make this law work, they’re going to need Democrats. So they end up working through this House Committee that’s focused on China and competition. And this is where the bulk of the work on this bill takes place.

And just to note, this is a really small group. There’s less than 20 key players who are working on this. And all throughout, they are keeping this very, very secret.

And why exactly are they keeping it secret? What’s the point of that?

So this group really wanted to keep this out of the eyes of TikTok, which has a huge lobbying presence in DC, and has successfully worked to kill bills that targeted TikTok in the past. And what they’re really doing is looking at all of the past efforts to either force a sale or a ban of TikTok, and trying to work their way through why those plans didn’t succeed and what they can do differently.

But while the lawmakers are working on this bill, something big happens that kind of changes the politics around it. And that’s October 7.

Your social media feeds are unique to you, but could they be shaping how you view the Israel-Gaza War? The BBC’s —

As the war breaks out and people start getting information about it, a lot of people are getting information about the Israel-Hamas War on TikTok, especially young people.

Social media algorithms seem to be driving some users towards increasingly divisive posts —

And there’s this big messy argument spilling out into living rooms and all over the internet, and, of course on TikTok. And it’s getting very heated. For instance, there’s this moment in the fall where a bunch of TikTok users start sharing this old manifesto.

I read Osama bin Laden’s “Letter to America.”

It’s wild, and everyone should read it.

Go read “A Letter to America.” Seriously, go read it.

That was actually written by Osama bin Laden, defending the 9/11 attacks and criticizing the United States’ support of Israel. TikTok users start trying to tie it to the modern day conflict.

Reading this letter, it becomes apparent to me that the actions of 9/11 were all just the buildup of our government failing other nations.

The way this letter is going viral right now is giving me the greatest sense of relief. Now it’s all coming to light because of Palestine.

And there’s these accusations that TikTok may be promoting one side of the conflict over the other. And a couple of researchers look at hashtags around Palestine, and they say that the data they pulled shows that TikTok is showing way more pro-Palestine videos and not so much for Israel. And this sets off huge alarm bells for this small group of lawmakers.

But isn’t that just a function of the fact that TikTok, at this point, is the public square in the United States for young people? I mean, this is what young people were talking about, and this is where they’re doing the talking.

TikTok has pushed back really forcefully against these accusations. They said that Gallup polls show that young people view Israel differently than older generations. They say that they’re not the ones influencing what people post, that the hashtags and the videos are a reflection of the user base and nothing that they’re doing to influence.

But for lawmakers, this doesn’t really make their concerns go away. Instead, this conflict shows them how TikTok could be used to spread propaganda. It made lawmakers feel that TikTok could be really dangerous when it comes to shaping the views of Americans on foreign policy, on US elections. And what it also does is, it provides this driving force to this group that’s drafting this bill. And they suddenly see that this might be a way to bring more people into their effort.

And so heading into November against this backdrop, they even bring in the White House and the Justice Department to help work on this bill. And with the White House, they want to make sure that this is a bill that the president will support. And they work with the Justice Department to shore up the language in the bill to help defend against court challenges.

Because the Justice Department, of course, would be the one that would have to defend the bill, right?

That’s exactly right. And so they’re trying to make it as rock solid as possible so that they can win in court when TikTok challenges this law. And so March rolls around, and they decide that it’s time to unveil this bill that they’ve been working on for close to a year.

The battle over TikTok on Capitol Hill is intensifying.

This morning, House lawmakers have agreed unanimously to move a bill to a full floor vote.

And TikTok is caught completely flat footed. They didn’t see this coming. And this is just what the group wanted. So TikTok has this army of lobbyists that’s suddenly scrambling. And they go into damage control mode. They start reaching out to members of Congress.

This app is so much more than just an app for dumb TikTok dances.

They fly a group of TikTok stars and small businesses to come to DC —

This is a life-changing apps.

— and lobby on the steps of the Capitol and meet with lawmakers.

Standing up here with all these amazing TikTokers behind me is a complete honor, and every single one of them would voice their opinion just like this. This is how we feel. This has to stop.

They set up interviews between these TikTok creators, as they’re known, and big TV shows and news programs. And they’re doing everything they can to fight against this bill before it goes any further. And then they decide to do something unusual, which is use TikTok itself to try and derail this bill.

How exactly do they do that?

So days after this bill is announced, a ton of TikTok users get a message when they open the TikTok app that basically says, call Congress and tell them not to ban TikTok.

Hmm. OK. So like, literally this window comes up and says, call Congress. Here you go.

Exactly. You can enter your zip code, and there’s a button that appears. And you can press it, and the call goes straight to your representative.

So offices are quickly overwhelmed by calls. And TikTok sent out this message to users on the same day that a House committee is going into vote on this bill and whether to move it forward. And so the stunt happens. They go into vote, and they come out, and it’s 50 to 0 in support of the bill.

One of the representatives who worked on the bill said that this stunt by TikTok turned a lot of no’s into yeses and yeses into, quote, “hell yeses.”

[LAUGHS]: so the whole episode sounds like it actually backfired, right? Like, TikTok’s stunt essentially just confirmed what was the deepest fears of lawmakers about this company, that the app could be used to influence American politics.

That’s definitely how a lot of lawmakers viewed it. And when this bill is brought to the full House a week later, it passes by an overwhelming majority. And weeks later, it passes in the Senate as part of a broader aid package. And on Wednesday, it’s signed into law by President Biden.

But now the question is, what does it mean? Like, how will this actually work? And how will it affect the tens of millions of Americans who use TikTok every day?

[MUSIC PLAYING]

We’ll be right back.

So Sapna, now that Biden has signed this bill, what does it actually mean in practice for TikTok? What does the law do?

So the law is really trying to push ByteDance, the company that owns TikTok, to sell to a non-Chinese owner. And the company basically has nine months for this sale to happen. There’s an option for President Biden to add another three months to that clock. And if the company doesn’t find a buyer or refuses to be sold, it will be banned.

And what would a ban actually mean, Sapna? I mean, people would still have the app on their phones, right? So it wouldn’t disappear overnight.

Yeah, no one’s coming to pick up your phone and to forcibly delete this thing. What the law says is that app stores and web hosting services wouldn’t be allowed to carry TikTok anymore. So basically, it would kind of die a slow death over time, where it wouldn’t be updated and just kind of peter out.

So the bottom line here is that the clock has started on this potential sale, right? They have 12 months to find a buyer. So what are the obstacles here? I mean, it sounds pretty ferociously complicated.

There’s a ton of challenges here. And it’s a very messy choose-your-own adventure. So one of the first big questions is who could buy this?

ByteDance and TikTok are private. We don’t know their financials. But analysts estimate that it will cost tens of billions of dollars. That narrows the buying pool pretty quickly. And a lot of the companies that could afford to buy it, like Meta, the owner of Facebook, or Google, which owns YouTube, would probably be kicked out of the running because they are simply too big. Regulators would say, you already own these big apps. You can’t possibly add this to your stable.

There’d be a monopoly concern there.

Exactly. And then, there’s a lot of questions around how this would work, technically. ByteDance and TikTok are very much global organizations. You have the CEO in Singapore. They have huge operations in Ireland. They have this big workforce in the US. And of course, they do have engineers in China.

So how do you extract all those things, make it all work? It’s a very big international transaction.

And then there is the chance that the Chinese government blocks this sale and says, you cannot do this, ByteDance. We will not allow it.

So Sapna, how does that actually work, though, in terms of China? I mean, can China actually just ban the sale of this company? I mean, it is a private company after all, right?

Well, here’s one way it could work. The Chinese government could block the export of TikTok’s algorithm. And let me explain that in kind of plain English.

They could basically block the technology that fuels the TikTokiness of this app, the recommendations, the magic of it, why you see what you see when you’re looking at TikTok.

TikTokiness, is that an adjective?

That is now an adjective.

[LAUGHS]: Nice.

And there’s a chance that Beijing could say, hey, you can’t export this technology. That is proprietary. And if that happens, that suddenly makes TikTok way, way less valuable.

So the Chinese government could let the sale go through potentially, but as a kind of an empty shell, right? The thing that makes TikTok TikTok, the algorithm, wouldn’t be part of the company. So that probably isn’t very appealing for a potential buyer.

Exactly. And I mean, the role of the Chinese government here is really interesting. I’ve talked to experts who say, well, if the Chinese government interferes to try and block a sale of this app, doesn’t that underscore and prove all the concerns that have been expressed by American lawmakers?

If you’re worried about China being in control of this thing, well, that just confirmed your fears.

Exactly. And I mean, it’s an interesting thing that ByteDance and TikTok have to grapple with.

So bottom line here — selling TikTok is quite complicated, and perhaps not even possible for these reasons that you’re giving, right? I mean, not least of which because the Chinese government might not allow the algorithm to leave the country. And that’s not something that the US Congress has a lot of control over.

So is this law fundamentally just a ban, then?

That’s what TikTok is calling it. Right after this bill was signed into law on Wednesday —

Make no mistake, this is a ban, a ban on TikTok and a ban on you and your voice.

— Tiktok’s CEO made a TikTok — what else?— that explained the company’s position.

Because the freedom of expression on TikTok reflects the same American values that make the United States a beacon of freedom.

He actually argued that TikTok reflects American values.

TikTok gives everyday Americans a powerful way to be seen and heard. And that’s why so many people have made TikTok part of their daily lives.

And he said that this law infringes on the First Amendment free speech rights of Americans who love it and who use it every day.

The facts and the Constitution are on our side, and we expect to prevail again.

So it’s very clear that TikTok plans to challenge this law in court. And the court fight to follow will determine the fate of TikTok’s future in the US.

So is that First Amendment argument that the TikTok CEO is making here going to work?

Nobody wants to put money on that. I mean, the company is really approaching this with the idea that the First Amendment rights of Americans are being infringed on. But if you remember, the government has been working on this law. They’ve been anticipating those challenges. And they can justify an infringing of First Amendment rights in certain cases, including with national security concerns. And so it’ll be up to a judge on whether those concerns pass muster and justify this sale and even a potential ban of TikTok.

Got it. So TikTok will argue free speech, First Amendment. And the government will counter by saying, look, this is about China. This is about America’s national security interests.

That’s right. And the legal experts that I’ve spoken with say this is a really big and sticky area of the law, and it’s a huge case. And they really think that this will go to the Supreme Court, regardless of who wins in the first round of this.

So where does that leave the millions of Americans who use TikTok, and many of them, of course, who earn a living on the platform?

I mean, it’s really uncertain what happens now with the company, and the clock has started ticking. When I’ve looked at TikTok and looked at videos from users —

This is about the impending TikTok ban. And it just triggered me so much. It makes my blood boil, and I have to get this out there.

There’s a lot of shock —

The most success I’ve had has been here on TikTok, and now they’re trying to take it away.

This is so stupid!

— and anger.

You can’t ban apps! You can’t ban things from people!

People are confused.

Word on the street is that in the next 9 to 12 months, TikTok could be banned.

And they’re also caught a bit off guard, just because there have been these years of efforts to do something about TikTok. People on the app have been hearing about a TikTok ban, really, since 2020.

The government can take away a literal app on our phones, and we’re supposed to believe we’re free?

A few TikTokers have said, how can this be the thing that the government is pushing through so quickly?

Can we stop funding a genocide? No. Can we get free COVID tests? No. Can we stop killing the planet? No. Can we at least watch videos on an app of people doing fun things and learn about the world around us? No.

So there’s this sense of distrust and disappointment for many people who love this app.

We got rid of TikTok. You’re welcome. Protecting you from China. You know that phone was made in China. Ah!

And I think there’s also this question, too, around what about TikTok makes it so harmful? Even though it has increasingly become a place for news, there’s plenty of people who simply use this app for entertainment. And what they’re seeing out of Washington just doesn’t square with the reality they experience when they pull out their phones.

And I wonder, Sapna, I mean, just kind of stepping back for a second, let’s say this ban on TikTok succeeds. If it goes through, would Americans be better off?

It depends who you ask. For the users who love TikTok, if it actually disappeared, it would be the government taking away a place where maybe they make money, where they get their entertainment, where they figure out what to read or what to cook next. To free speech advocates, this would be dystopian, unheard of for the government to crack down on an app with such wide usage by Americans.

But for the American political class And the National security establishment, this is a necessary move, one that was years in the making, not something that was just come up with on the fly. And ultimately, it all comes down to China and this idea that you can’t have a social media app like this, a source of news like this, that is even at all at risk of being influenced by the Chinese government and our greatest adversaries.

Sapna, thank you.

Here’s what else you should know today. On Monday, in its latest high-profile showdown with pro-Palestinian protesters, Columbia University gave students until 2:00 PM to clear out from an encampment at the center of campus or face suspension. It appeared to be an effort to remove the encampment without relying on New York City Police, whose removal of a previous encampment there two weeks ago inspired similar protests on campuses across the country.

Free Palestine!

Hi, this is Sharon Otterman reporting for “The New York Times.”

00 PM deadline for protesters to clear out of the encampment at the center of Columbia University has come and gone, and there’s still quite a large contingent inside the encampment.

But Monday’s warning seemed only to galvanize the Columbia protesters and their supporters.

And hundreds of students and others from around the campus have come out to support them. They are currently walking around in a picket around the encampment.

Hundreds of students, standing for or five people deep, encircled the encampment in a show of solidarity. They were joined by members of the Columbia faculty.

There’s also dozens of faculty members, who are prepared to stand in lines in front of the main entrance to the encampment, in case Public Safety or the NYPD move in. But as of 2:00, there was no sign of that happening.

Then, on Monday evening, Columbia announced it had begun to suspend students who had failed to leave the encampment. It was unclear exactly how many students had been suspended.

[PRO-PALESTINE CHANTING]:

Today’s episode was produced by Will Reid, Rachelle Banja, and Rob Szypko. It was edited by Marc Georges and Liz O. Baylen, contains original music by Dan Powell and Marion Lozano, and was engineered by Chris Wood. Our theme music is by Jim Brunberg and Ben Landsverk of Wonderly. Special thanks to Sharon Otterman.

That’s it for “The Daily.” I’m Sabrina Tavernise. See you tomorrow.

The Daily logo

  • May 1, 2024   •   35:16 The New Abortion Fight Before the Supreme Court
  • April 30, 2024   •   27:40 The Secret Push That Could Ban TikTok
  • April 29, 2024   •   47:53 Trump 2.0: What a Second Trump Presidency Would Bring
  • April 26, 2024   •   21:50 Harvey Weinstein Conviction Thrown Out
  • April 25, 2024   •   40:33 The Crackdown on Student Protesters
  • April 24, 2024   •   32:18 Is $60 Billion Enough to Save Ukraine?
  • April 23, 2024   •   30:30 A Salacious Conspiracy or Just 34 Pieces of Paper?
  • April 22, 2024   •   24:30 The Evolving Danger of the New Bird Flu
  • April 19, 2024   •   30:42 The Supreme Court Takes Up Homelessness
  • April 18, 2024   •   30:07 The Opening Days of Trump’s First Criminal Trial
  • April 17, 2024   •   24:52 Are ‘Forever Chemicals’ a Forever Problem?
  • April 16, 2024   •   29:29 A.I.’s Original Sin

Hosted by Sabrina Tavernise

Featuring Sapna Maheshwari

Produced by Will Reid ,  Rachelle Bonja and Rob Szypko

Edited by Marc Georges and Liz O. Baylen

Original music by Marion Lozano and Dan Powell

Engineered by Chris Wood

Listen and follow The Daily Apple Podcasts | Spotify | Amazon Music

American lawmakers have tried for years to ban TikTok, concerned that the video app’s links to China pose a national security risk.

Sapna Maheshwari, a technology reporter for The Times, explains the behind-the-scenes push to rein in TikTok and discusses what a ban could mean for the app’s 170 million users in the United States.

On today’s episode

speech to text app cost

Sapna Maheshwari , who covers TikTok, technology and emerging media companies for The New York Times.

With the U.S. Capitol building in the background, a group of people holding up signs are gathered on a lawn.

Background reading

A tiny group of lawmakers huddled in private about a year ago, aiming to bulletproof a bill that could ban TikTok.

The TikTok law faces court challenges, a shortage of qualified buyers and Beijing’s hostility .

Love, hate or fear it, TikTok has changed America .

There are a lot of ways to listen to The Daily. Here’s how.

We aim to make transcripts available the next workday after an episode’s publication. You can find them at the top of the page.

Special thanks to Sharon Otterman .

The Daily is made by Rachel Quester, Lynsea Garrison, Clare Toeniskoetter, Paige Cowett, Michael Simon Johnson, Brad Fisher, Chris Wood, Jessica Cheung, Stella Tan, Alexandra Leigh Young, Lisa Chow, Eric Krupke, Marc Georges, Luke Vander Ploeg, M.J. Davis Lin, Dan Powell, Sydney Harper, Mike Benoist, Liz O. Baylen, Asthaa Chaturvedi, Rachelle Bonja, Diana Nguyen, Marion Lozano, Corey Schreppel, Rob Szypko, Elisheba Ittoop, Mooj Zadie, Patricia Willens, Rowan Niemisto, Jody Becker, Rikki Novetsky, John Ketchum, Nina Feldman, Will Reid, Carlos Prieto, Ben Calhoun, Susan Lee, Lexie Diao, Mary Wilson, Alex Stern, Dan Farrell, Sophia Lanman, Shannon Lin, Diane Wong, Devon Taylor, Alyssa Moxley, Summer Thomad, Olivia Natt, Daniel Ramirez and Brendan Klinkenberg.

Our theme music is by Jim Brunberg and Ben Landsverk of Wonderly. Special thanks to Sam Dolnick, Paula Szuchman, Lisa Tobin, Larissa Anderson, Julia Simon, Sofia Milan, Mahima Chablani, Elizabeth Davis-Moorer, Jeffrey Miranda, Renan Borelli, Maddy Masiello, Isabella Anderson and Nina Lassam.

Sapna Maheshwari reports on TikTok, technology and emerging media companies. She has been a business reporter for more than a decade. Contact her at [email protected] . More about Sapna Maheshwari

Advertisement

The Federal Register

The daily journal of the united states government, request access.

Due to aggressive automated scraping of FederalRegister.gov and eCFR.gov, programmatic access to these sites is limited to access to our extensive developer APIs.

If you are human user receiving this message, we can add your IP address to a set of IPs that can access FederalRegister.gov & eCFR.gov; complete the CAPTCHA (bot test) below and click "Request Access". This process will be necessary for each IP address you wish to access the site from, requests are valid for approximately one quarter (three months) after which the process may need to be repeated.

An official website of the United States government.

If you want to request a wider IP range, first request access for your current IP, and then use the "Site Feedback" button found in the lower left-hand side to make the request.

IMAGES

  1. Best 8 Speech to Text Apps for Android and iPhone[2023]

    speech to text app cost

  2. Speech To Text Converter

    speech to text app cost

  3. 8 of the Best Text-to-Speech Apps for Android and iPhone

    speech to text app cost

  4. Speech to Text with Translator

    speech to text app cost

  5. 10 Best Text to Speech Apps

    speech to text app cost

  6. Free Get Realistic Voiceover By Best 7 Text-to-Speech APP

    speech to text app cost

VIDEO

  1. New: AI Text to Speech (Personal) Conversational Voices

  2. The Best Text to Speech Tool Powered by AI 2024 (Free Access Link Below)

  3. Azure AI: Text-to-Speech & Speech-to-Text

  4. Hindi text to Speech

  5. The most human-like text-to-speech AI 2023 (Approved)

  6. Convert Text to Speech with AI Voiceovers

COMMENTS

  1. Best speech-to-text app of 2024

    The best speech-to-text apps make it simple and easy to convert speech into text, for both desktop and mobile devices. Best speech-to-text app of 2024: Quick menu (Image credit: Shutterstock)

  2. Best Voice-to-Text Apps of 2024

    Whether you want to take notes, send quick messages, or translate on the fly, the best voice-to-text apps below are ready to help. Best Voice-to-Text Apps of 2024. Best Overall: Dragon Anywhere. Best Assistant: Google Assistant. Best Transcription: Transcribe. Best for Long Recordings: Speechnotes.

  3. Pricing

    Free. $0.078 / minute **. Speech Recognition (with data logging opt-in) Standard¹. Free. $0.016 / minute **. The prices in the table below apply to minutes of audio processed per month for the Speech-to-Text V2 API. Category. Models.

  4. The best dictation and speech-to-text software in 2024

    The best dictation software. Apple Dictation for free dictation software on Apple devices. Windows 11 Speech Recognition for free dictation software on Windows. Dragon by Nuance for a customizable dictation app. Google Docs voice typing for dictating in Google Docs. Gboard for a free mobile dictation app.

  5. Speech Recognition & Synthesis

    To use Google Speech-to-Text functionality on your Android device, go to Settings > Apps & notifications > Default apps > Assist App. Select Speech Recognition and Synthesis from Google as your preferred voice input engine. Speech Services powers applications to read the text on your screen aloud. For example, it can be used by: To use Google ...

  6. Free Speech to Text Converter

    Download the app for free. Create a podcast, a video, and all your social assets using Descript. ... Descript's speech-to-text software turns what you say into text in real time. Don't worry about filler words or mistakes; Descript makes it easy to find and remove those from both the generated text and recorded audio. Step 3.

  7. The 9 Best Speech-to-Text Apps in 2023 (Tried & Tested)

    MacWhisper is a transcription tool powered by Whisper. It's an automatic speech recognition (ASR) system developed by OpenAI, the same company that brought us ChatGPT. As OpenAI states on its website: Whisper is trained on 680,000 hours of multilingual and multitask supervised data collected from the web.

  8. Speech-to-Text AI: speech recognition and transcription

    Accurately convert voice to text in over 125 languages and variants using Google AI and an easy-to-use API.

  9. Speech To Text

    Improve clinical documentation. Medical doctors and practitioners can use Amazon Transcribe Medical to quickly and efficiently document clinical conversations into electronic health record (EHR) systems for analysis. The service is HIPAA- eligible and trained to understand medical terminology.

  10. The best speech-to-text software for 2022

    Dragon Anywhere. Amazon Transcribe. Braina Pro. Google Docs Voice Typing. The good news is that the best speech-to-text software doesn't have to cost an arm and a leg — or anything at all ...

  11. Azure AI Speech Pricing

    For Speech to Text and Speech Translation, usage is billed in one-second increments. For Text to Speech: usage is billed per character. Check the definition of character in the pricing note. For custom neural voice hosting: usage is billed per endpoint per second. Check details in the pricing note.

  12. 6 Best Speech-to-Text Apps for Seamless Transcriptions

    Also, not having any free transcription options for existing media can be a deal-breaker for creators who primarily record content on the fly. 4. Apple Dictation. Key features: No internet connection required (with Apple Silicon devices) Setting up Voice Control can add even more functionality to dictation.

  13. 7 Best speech to text apps for iPhone in 2024

    Price: Free (Premium Costs $16.99 per month) Download. Wrapping up! Well, this was my list of the best speech-to-text apps for iPhone and iPad, which are beneficial for both students and professionals. If you found the right speech-to-text app that suits your requirements, share your thoughts in the comments below! Read more:

  14. Free Speech to Text Online, Voice Typing & Transcription

    Speech to Text online notepad. Professional, accurate & free speech recognizing text editor. Distraction-free, fast, easy to use web app for dictation & typing. Speechnotes is a powerful speech-enabled online notepad, designed to empower your ideas by implementing a clean & efficient design, so you can focus on your thoughts.

  15. Amazon Transcribe Pricing

    Let's assume you want to subtitle 5,000 hours of live streaming content with Amazon Transcribe. In US East (N. Virginia), Tier 1 (T1) pricing of $0.024/minute applies to the first 250,000 minutes of transcriptions. Tier 2 pricing of $0.015/minute (38% discount to T1 pricing) applies to the next 750,000 minutes.

  16. Scraibe

    Good start but needs to become a share target good but can't share files from other apps to this app. Developer Response , Hey DoomBlah, thanks for your feedback, I'm happy to let you know that Scraibe now (version 1.2.6) supports importing files from other apps via the "Share" functionality!

  17. Pricing

    Save 46%. Buy Now. Everything in Basic +. Add teammates to your workspace. Team features: shared custom vocabulary; tag speakers; assign action items to teammates. Advanced search, export, and playback. 1200 monthly transcription minutes; 90 minutes per conversation. Import and transcribe 10* audio or video files per month.

  18. Best Speech-to-Text APIs in 2024

    8. Amazon Transcribe. Amazon Transcribe is offered as a part of the overall Amazon Web Services (AWS) platform. With similar features as Google and Microsoft's speech-to-text solutions, Amazon Transcribe offers good accuracy for pre-recorded audio, but poor accuracy for real-time streaming use cases.

  19. AI Speech To Text: Revolutionizing Transcription

    The top 5 best text to speech apps; Voice changer; Read my paper out loud; Text to speech on Amazon; Text to Speech on Apple Devices; ... The cost of using speech-to-text services can vary. Many providers offer a tiered pricing model based on usage, with some offering free tiers for startups or small-scale applications. Accessibility is also a ...

  20. Speech to Text

    Make spoken audio actionable. Quickly and accurately transcribe audio to text in more than 100 languages and variants. Customize models to enhance accuracy for domain-specific terminology. Get more value from spoken audio by enabling search or analytics on transcribed text or facilitating action—all in your preferred programming language.

  21. Pricing & Plans

    Growth. $4k-10k / year. Save up to 20%. With pre-paid credits for the year. Credits are redeemed against actual usage. Buy Now. Access all endpoints and public models at favorable discounts. Up to 100 concurrent requests for Deepgram speech-to-text models and 5 concurrent requests for Deepgram Whisper Cloud. Up to 2 concurrent requests and up ...

  22. Speechify Pricing: Free & Premium

    The basics for individuals. $69 per month / user. Buy Now. Everything in Free. Download as video, audio, or text. Video and audio Dubbing. Video and audio Transcription. 50 hours of voice generation per user/year. 12 hours of Dubbing per user/year.

  23. The 2 Best Dictation Softwares of 2024

    The best dictation tool for Windows PCs. Dragon Professional v16 is the most accurate dictation tool we tested for any operating system—but its hefty price tag is a lot to swallow. $699 from ...

  24. IBM Watson Speech to Text

    Our best-in-class AI, embedded within Watson Speech to Text, truly understands your customers. Customizable for your business. Train Watson Speech to Text on your unique domain language and specific audio characteristics. Protects your data. Enjoy the security of IBM's world-class data governance practices. Truly runs anywhere.

  25. Google Gemini: Everything you need to know about the new generative AI

    An additional endpoint, Gemini Pro Vision, can process text and imagery — including photos and video — and output text along the lines of OpenAI's GPT-4 with Vision model. Using Gemini Pro ...

  26. The Secret Push That Could Ban TikTok

    Sapna Maheshwari, a technology reporter for The Times, explains the behind-the-scenes push to rein in TikTok and discusses what a ban could mean for the app's 170 million users in the United States.

  27. Federal Register :: Nondiscrimination on the Basis of Disability

    These costs include one-time costs for familiarization with the requirements of the rule, the purchase of software to assist with remediation of web content or mobile apps, the time spent testing and remediating web content and mobile apps to comply with WCAG 2.1 Level AA, and elementary, secondary, and postsecondary education course content ...