Text to speech with natural sounding voices. Supports PDF, word, ebooks, webpages, Convert text to audio files. Oct 07, 2018 More Voices For Balabolka Voice Packs. Balabolka (version 2.14) You can view all the voices available on the computer by following Control Panel - Speech - Speech Properties - Text To Speech - Voice selection. Windows 10 has the new voices Microsoft Mark Mobile and Microsoft Zira Mobile. 5 days ago - Balabolka is a Text-To-Speech (TTS) program.
What is SAPI? |
The Speech API (SAPI) is an application programming interface developed by Microsoft to allow the use of speech synthesis within Windows applications. The SAPI provides a high-level interface between an application and speech engines. Text-To-Speech software synthesizes text strings and files into spoken audio using synthetic voices. |
Where can I get SAPI 4? |
To use SAPI 4 voices, download and install the redistributable Microsoft Speech API 4.0 file. Also, you may download and install the Microsoft Speech Control Panel; the Speech Control Panel will add an icon to your Control Panel to enable you to list the compatible text-to-speech engines installed on your system and customize their settings for your use. |
Where can I get SAPI 5? |
Windows XP (and later versions) comes with SAPI 5. |
Balabolka Voices
How do I know what Text-To-Speech voices have been installed on the computer? |
You can view all the voices available on the computer by following Control Panel->Speech->Speech Properties->Text To Speech->Voice selection. |
Windows 10 has the new voices Microsoft Mark Mobile and Microsoft Zira Mobile, but they are not available in text-to-speech software. Is it possible to unlock the new voices? |
By default, the Microsoft mobile voice is locked for using in text-to-speech software via SAPI 5. You can unlock it with a simple registry tweak. Download the archive, extract the file for your language and for your version of the operating system ('mobile_x86.reg' for 32-bit and 'mobile_x64.reg' for 64-bit), click the right mouse button on the file's name and choose the context menu item 'Merge'. The Microsoft mobile voice will appear in the list of the available voices in Balabolka. |
My computer is running the 64-bit version of Windows. I have installed the 64-bit Runtime package for Microsoft Speech Platform and the English voices. But the list of available voices is still empty. Where is a problem? |
Balabolka is the 32-bit application. You need to install the 32-bit Runtime package for Microsoft Speech Platform also. |
What is Google Text-To-Speech? |
In Google Translate you can find a 'Listen' button that converts text to speech. After pressing of this button a browser starts to download MP3 file. The service supports converting to speech texts not longer than 100 symbols. Balabolka allows to divide big text on small parts, create an audio file for each part and merge them together. |
How do I change a voice during reading aloud? |
Use the XML tags. It is recommended to write the opening and closing tags inside the same paragraph. For example: Hello, how are you? I am good. |
I'm getting the error message 'OLE error 80045042'. What does that mean? |
This error means: 'The XML parser failed due to bad syntax.' You must verify the syntax of the XML tags inside the text. Or, some part of the text looks like the start of an XML tag, and it is confusing SAPI. If you don't want to use XML tags, remove the symbols '<' and '>' from the text (or replace them by words 'less than' and 'greater than'). |
How can I make a silent installation of Balabolka? |
To install Balabolka in silent mode without dialogs you may use a silent installation. A silent installation is available by using -silent command-line switch. |
My computer contains two sound cards. How can I choose an audio device for text-to-speech playback? |
You may select Speech in Control Panel of Windows and click the Audio Output button in the Speech Properties dialog box. Also, Balabolka contains the main menu item 'Options | Audio Output'. |
I open a DjVu file, but the program doesn't show any text. What is wrong? |
DjVu format was designed to store scanned documents. A DjVu file contains images of pages for books, magazines, etc. Also, DjVu can contain an OCR text layer. Balabolka can extract data from a text layer of DjVu only. If such layer is not available, the only way to get text is to use a text recognition system (for example, FineReader). |
How can I remove dashes in the beginning of paragraphs? |
You should use the main menu item 'Edit | Replace'. Type ^p— in the Find what box and ^p in the Replace with box. The program allows to use most of special codes from Microsoft Word:
|
I am not satisfied, how Balabolka extracts text from PDF files. Can I use the other way for processing of PDF? |
The process of text extracting from PDF files is complicated enough, because PDF files do not contain plain text. You may use an external command-line utility for text extracting: for example, the program pdftotext.exe from the Xpdf project. Copy pdftotext.exe to the subfolder 'utils' in the folder of Balabolka, choose the main menu item 'Options | Text Import', the tab 'Custom Text Import', and click the Add button. Define the command for using of pdftotext.exe: %BFolder%utilspdftotext.exe -q -nopgbrk -enc UTF-8 %Input% %Output% Define the name of the converter (for example, 'Xpdf Converter'), file extension ('PDF') and output encoding ('UTF-8'). After activating of the option Use instead of default extracting method for this file type the program will be able to use the external utility for text extracting from PDF files. |
How can every line of a text file be converted to a separate audio file? |
You may add two empty lines after every line with text in the document. Open the file in Balabolka, choose the main menu item 'Edit | Replace', type ^p in the Find what box and ^p^p^p in the Replace with box, click the Replace all button. Choose the main menu item 'File | Split and Convert to Audio Files'; choose the split method 'by two empty lines in succession' and click the Split and Convert button. |
Can I use the first line of text as the name of the audio file (for example, '01 Chapter One.mp3', '02 Chapter Two.mp3')? |
Type %FirstLine% in the 'Base Output Filename' box in the window 'Split and Convert to Audio Files'. The application will replace this variable by the first line of each text part. To change the position of the sequence number inside the filename, use the %Number% variable. |
What is Spritz-Reader? |
Spritz is a a new speed-reading technology. It allows to speed reading rates anywhere from 100 to 1000 words per minute. The time consuming part of usual reading lies mainly in the actual eye movements from word to word and sentence to sentence. Spritz positions words in a spot on a display where you can recognize the word, without moving your eyes. |
What is the IFilter interface? |
The IFilter interface works with documents. It provides filters to extract information from proprietary file formats. Full-text search engines call the methods of the IFilter interface. IFilters are available for Adobe PDF, WordPerfect and many other popular file formats. You can get IFilter installers from respective vendors. IFilter.org is a good place to get started. Note that because Balabolka is the 32-bit application you may need to install the 32-bit version of IFilters. |
Can I use media files generated with Balabolka in YouTube videos? |
Balabolka is freeware, so you may create audio files for free. But if you use the commercial voices, you need to contact the developers of the voice and purchase the license for commercial use/audio broadcast. The audio distribution license will allow you to use speech in YouTube videos. For example, the information from the Cepstral web-site: 'Cepstral Personal voices are for personal use only and are NOT licensed for audio distribution. This means the audio you create is for your use only and cannot be shared with others or used in videos, presentations, or webpages. If you are interested in an audio distribution license, please contact sales.' |
Is it possible to create a video file in Balabolka (containing speech as audio data and synchronized text as video data)? |
No, Balabolka doesn't have such option. But you can get the similar result in another way. My application allows to create files in SRT and LRC formats. Use the main menu item 'Options | Audio Files' in Balabolka. On the tab 'LRC, SRT' you may define settings for subtitle files. The application will convert text to an audio file (for example, FILE.MP3) and create a subtitle file for this audio file (for example, FILE.SRT).
|
I am creating a 3D character, and it is necessary to generate a lip animation synchronized with audio generated by a text-to-speech engine. Can you help me? |
When a speech engine reads aloud, it generates information about visemes. A viseme is the basic visual unit of speech that represents the position of the mouth and face when pronouncing a phoneme. SAPI 5 supports the list of 21 visemes. The console application of Balabolka contains the command line parameter that allows to generate the output text file with visemes. The application will create the audio file and then read it aloud to get visemes and their timecodes. |
So far we've covered the basics of Balabloka. Let's look at some of the extra things that can be done with Balabolka.
The speech engine for Balabolka relies on an API (Application Program Interface) built into the Windows Operating System. The most current Speech API is SAPI 5. SAPI4 and its voices could be installed on the computer but the voices are not as high quality.
Microsoft David and Microsoft Zira are the default US English voices in Windows 10. Windows 7 had Microsoft Anna 64 bit. There are other voices which can be installed, some free and some purchased. The bit rate, either 32 or 64 bit needs to be matched with the operating system bit rate. Otherwise, the 64-bit programs might not be able to access 32bit voice.
The voice in Balabolka can be changed as we've seen, by going to the Menu option for Voice, selecting voice and selecting from the list of recognized voices. This will change the speaking voice. There is a way to change the voice and its properties for selected lines. You can alternate between male and female voices.
Because SAPI5 allows the use of XML tags, there are a number of things we can change including a different voice, Volume, Rate, Pitch, Emph, and Spell.
Balabolka Woman Voices Song
How do I change a voice during reading aloud? |
Use the XML tags. It is recommended to write the opening and closing tags inside the same paragraph. For example: Hello, how are you? I am good. |
I'm getting the error message 'OLE error 80045042'. What does that mean? |
This error means: 'The XML parser failed due to bad syntax.' You must verify the syntax of the XML tags inside the text. Or, some part of the text looks like the start of an XML tag, and it is confusing SAPI. If you don't want to use XML tags, remove the symbols '<' and '>' from the text (or replace them by words 'less than' and 'greater than'). |
How can I make a silent installation of Balabolka? |
To install Balabolka in silent mode without dialogs you may use a silent installation. A silent installation is available by using -silent command-line switch. |
My computer contains two sound cards. How can I choose an audio device for text-to-speech playback? |
You may select Speech in Control Panel of Windows and click the Audio Output button in the Speech Properties dialog box. Also, Balabolka contains the main menu item 'Options | Audio Output'. |
I open a DjVu file, but the program doesn't show any text. What is wrong? |
DjVu format was designed to store scanned documents. A DjVu file contains images of pages for books, magazines, etc. Also, DjVu can contain an OCR text layer. Balabolka can extract data from a text layer of DjVu only. If such layer is not available, the only way to get text is to use a text recognition system (for example, FineReader). |
How can I remove dashes in the beginning of paragraphs? |
You should use the main menu item 'Edit | Replace'. Type ^p— in the Find what box and ^p in the Replace with box. The program allows to use most of special codes from Microsoft Word:
|
I am not satisfied, how Balabolka extracts text from PDF files. Can I use the other way for processing of PDF? |
The process of text extracting from PDF files is complicated enough, because PDF files do not contain plain text. You may use an external command-line utility for text extracting: for example, the program pdftotext.exe from the Xpdf project. Copy pdftotext.exe to the subfolder 'utils' in the folder of Balabolka, choose the main menu item 'Options | Text Import', the tab 'Custom Text Import', and click the Add button. Define the command for using of pdftotext.exe: %BFolder%utilspdftotext.exe -q -nopgbrk -enc UTF-8 %Input% %Output% Define the name of the converter (for example, 'Xpdf Converter'), file extension ('PDF') and output encoding ('UTF-8'). After activating of the option Use instead of default extracting method for this file type the program will be able to use the external utility for text extracting from PDF files. |
How can every line of a text file be converted to a separate audio file? |
You may add two empty lines after every line with text in the document. Open the file in Balabolka, choose the main menu item 'Edit | Replace', type ^p in the Find what box and ^p^p^p in the Replace with box, click the Replace all button. Choose the main menu item 'File | Split and Convert to Audio Files'; choose the split method 'by two empty lines in succession' and click the Split and Convert button. |
Can I use the first line of text as the name of the audio file (for example, '01 Chapter One.mp3', '02 Chapter Two.mp3')? |
Type %FirstLine% in the 'Base Output Filename' box in the window 'Split and Convert to Audio Files'. The application will replace this variable by the first line of each text part. To change the position of the sequence number inside the filename, use the %Number% variable. |
What is Spritz-Reader? |
Spritz is a a new speed-reading technology. It allows to speed reading rates anywhere from 100 to 1000 words per minute. The time consuming part of usual reading lies mainly in the actual eye movements from word to word and sentence to sentence. Spritz positions words in a spot on a display where you can recognize the word, without moving your eyes. |
What is the IFilter interface? |
The IFilter interface works with documents. It provides filters to extract information from proprietary file formats. Full-text search engines call the methods of the IFilter interface. IFilters are available for Adobe PDF, WordPerfect and many other popular file formats. You can get IFilter installers from respective vendors. IFilter.org is a good place to get started. Note that because Balabolka is the 32-bit application you may need to install the 32-bit version of IFilters. |
Can I use media files generated with Balabolka in YouTube videos? |
Balabolka is freeware, so you may create audio files for free. But if you use the commercial voices, you need to contact the developers of the voice and purchase the license for commercial use/audio broadcast. The audio distribution license will allow you to use speech in YouTube videos. For example, the information from the Cepstral web-site: 'Cepstral Personal voices are for personal use only and are NOT licensed for audio distribution. This means the audio you create is for your use only and cannot be shared with others or used in videos, presentations, or webpages. If you are interested in an audio distribution license, please contact sales.' |
Is it possible to create a video file in Balabolka (containing speech as audio data and synchronized text as video data)? |
No, Balabolka doesn't have such option. But you can get the similar result in another way. My application allows to create files in SRT and LRC formats. Use the main menu item 'Options | Audio Files' in Balabolka. On the tab 'LRC, SRT' you may define settings for subtitle files. The application will convert text to an audio file (for example, FILE.MP3) and create a subtitle file for this audio file (for example, FILE.SRT).
|
I am creating a 3D character, and it is necessary to generate a lip animation synchronized with audio generated by a text-to-speech engine. Can you help me? |
When a speech engine reads aloud, it generates information about visemes. A viseme is the basic visual unit of speech that represents the position of the mouth and face when pronouncing a phoneme. SAPI 5 supports the list of 21 visemes. The console application of Balabolka contains the command line parameter that allows to generate the output text file with visemes. The application will create the audio file and then read it aloud to get visemes and their timecodes. |
So far we've covered the basics of Balabloka. Let's look at some of the extra things that can be done with Balabolka.
The speech engine for Balabolka relies on an API (Application Program Interface) built into the Windows Operating System. The most current Speech API is SAPI 5. SAPI4 and its voices could be installed on the computer but the voices are not as high quality.
Microsoft David and Microsoft Zira are the default US English voices in Windows 10. Windows 7 had Microsoft Anna 64 bit. There are other voices which can be installed, some free and some purchased. The bit rate, either 32 or 64 bit needs to be matched with the operating system bit rate. Otherwise, the 64-bit programs might not be able to access 32bit voice.
The voice in Balabolka can be changed as we've seen, by going to the Menu option for Voice, selecting voice and selecting from the list of recognized voices. This will change the speaking voice. There is a way to change the voice and its properties for selected lines. You can alternate between male and female voices.
Because SAPI5 allows the use of XML tags, there are a number of things we can change including a different voice, Volume, Rate, Pitch, Emph, and Spell.
Balabolka Woman Voices Song
Balabolka Best Voice
For example the XML tag to change the voice is . The placeholder, voice_name, is replaced by the full name of the voice, such as Microsoft David Desktop. The completed tag is placed before the line you want the voice to speak. The full tag will look like this:
The video below shows how this will work.
Balabolka Woman Voices Youtube
Have fun.