Speech to text may seem like a foreign concept, but they’re actually quite useful for busy people. If you don’t want to type long texts yourself, the Voice to Text conversion service will be the best solution for you. The AnonyViet tools introduced below will help you save more time and effort when you need to create a long text without typing.
📢 Join the channel Telegram belong to AnonyViet
Update new articles, cool tools and IT tips fastest
Google Cloud Speech-to-Text
Google Cloud Speech-to-Text is an API that allows users to send short, long, or live audio containing speech and convert it to text. Google has long been recognized for our industry-leading speech recognition quality and our ability to deliver thousands of different solutions, including Contact Center AI and Video Transcription.
This is Google’s text-to-speech application, supporting Vietnamese. You have 2 options: use the Microphone to speak directly or upload an audio file so the application can analyze the sound into text. Google Speech-to-Text also has the ability to analyze punctuation to break sentences quite well. Punctuation is important for accurate transcription, helping users increase the accuracy of voice translations in both languages. Automatic Punctuation provides users with transcripts intended to mimic how a given user might have written what they said. This improves the readability of the transcript and can make dictation a breeze.
To convert voice to text, visit the page Google Cloud Speech-to-Text.
Input type section: you can select Microphone to speak directly or use File Upload to upload an existing audio file.
Select Language: is the language of the sound you need to convert to text.
Remember to check the Punctuation button so the application can interrupt the punctuation.
Then click Start Now to Google Cloud Speech-to-Text Analyze sounds and convert them into words for you.
Voice to Text with Google Docs
This is the way that people who use Macbooks or regularly edit documents on Google Docs often use. Google Docs has the feature built in voice input. You can use the Microphone to read and Google Docs will convert your reading voice into text. Quite good for those who are lazy to type like me. To use you visit Google Docslog in to your Google account and create a new document file.
At Menu Tools select Voice input or press the keyboard shortcut Ctrl+Shift+S to open the speech-to-text feature.
Now you can read comfortably and Google will parse it into text for you. If you want to break a sentence, remember to read “period”, “comma”… Google will automatically convert it into punctuation for you. I think this is the best audio-to-word conversion tool today.
After reading, you can save it as a word file for Offilne editing.
FPT.AI converts audio into pure Vietnamese text
An artificial intelligence tool from FPT – Vietnam. Therefore, FPT.AI’s transcriptions are quite accurate in the Vietnamese language. This tool has free and commercial versions. For small needs, you can use the free version with 60 minutes for audio files.
You create an account and log in at https://console.fpt.ai/getting-started
Then select the Speech to Text feature
Next, enter the API, press any letter and then OK.
At the main interface, you can choose Record or Upload file mode for the application to analyze voice into text.
Out of the above 3 text-to-speech tools, I like the Google Docs Voice Input tool the most because it is both free and convenient, with a built-in text editing tool. However, if you want to apply the voice-to-text feature for business purposes such as a virtual switchboard assistant, saving customer calls to analyze customer needs, information… you should use FPT.AI.
Frequently asked questions
Can I use these speech-to-text tools on my mobile phone?
Most of the tools mentioned, including Google Cloud Speech-to-Text and Google Docs, can be accessed and used on mobile phones via a web browser or mobile app.
Which tool is best for quick and easy note-taking?
Google Docs Voice Typing is great for quick and easy note-taking thanks to its convenience and built-in text editor.
Is there a better tool for commercial purposes, such as customer call analytics?
FPT.AI is a better choice for commercial purposes because it offers more professional features and is suitable for processing large amounts of audio data.












