What freeware do you use to convert a relatively large (10-page) PDF text file to an audio sound lfile (MP3 is fine, as is WAV) for free on Windows?
What freeware do you use to convert a relatively large (10-page) PDF text file to an audio sound lfile (MP3 is fine, as is WAV) for free on Windows?
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
What freeware do you use to convert a relatively large (10-page) PDF text
file to an audio sound lfile (MP3 is fine, as is WAV) for free on
Windows?
You can likely find online programs but that would violate your privacy.
I don't think it's possible to create an MP3 or WAV offline from PDF text.
If it was, there would be programs that you could download that do it.
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
What freeware do you use to convert a relatively large (10-page) PDF text
file to an audio sound lfile (MP3 is fine, as is WAV) for free on Windows?
You can likely find online programs but that would violate your privacy.
I don't think it's possible to create an MP3 or WAV offline from PDF text.
If it was, there would be programs that you could download that do it.
On 7/24/2024 1:58 PM, david wrote:
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
What freeware do you use to convert a relatively large (10-page) PDF text >>> file to an audio sound lfile (MP3 is fine, as is WAV) for free on Windows? >>You can likely find online programs but that would violate your privacy.
I don't think it's possible to create an MP3 or WAV offline from PDF text. >> If it was, there would be programs that you could download that do it.
Even extracting the text can be tricky. Some PDFs are actually
just storing images, so OCR is necessary first. Even when text
is stored, it's not stored as character encoding but rather as
vector images. Which is why even the best PDF text extractors
will do things like converting u to ii or converting d to cl.
Converting text to WAV once extracted or selected and copied,
however, is very simple with the Windows SAPI5 (speech API) libraries.
SAPI5 has been available since XP.
On 7/24/2024 3:13 PM, Newyana2 wrote:
On 7/24/2024 1:58 PM, david wrote:
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
What freeware do you use to convert a relatively large (10-page) PDF text >>>> file to an audio sound lfile (MP3 is fine, as is WAV) for free on Windows? >>>You can likely find online programs but that would violate your privacy. >>>
I don't think it's possible to create an MP3 or WAV offline from PDF text. >>> If it was, there would be programs that you could download that do it.
Even extracting the text can be tricky. Some PDFs are actually
just storing images, so OCR is necessary first. Even when text
is stored, it's not stored as character encoding but rather as
vector images. Which is why even the best PDF text extractors
will do things like converting u to ii or converting d to cl.
Converting text to WAV once extracted or selected and copied,
however, is very simple with the Windows SAPI5 (speech API) libraries.
SAPI5 has been available since XP.
I think the PDF to TXT, you need to stop at that point
and edit the TXT file, before the TXT to WAV step.
Too many things could have gone wrong at that point,
and 200 pages of TXT is bound to be unusable as such.
It's going to need edits and removals. At a minimum.
On 7/24/2024 1:58 PM, david wrote:
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
What freeware do you use to convert a relatively large (10-page) PDF text >>> file to an audio sound lfile (MP3 is fine, as is WAV) for free on
Windows?
You can likely find online programs but that would violate your privacy.
I don't think it's possible to create an MP3 or WAV offline from PDF text. >> If it was, there would be programs that you could download that do it.
Even extracting the text can be tricky. Some PDFs are actually
just storing images, so OCR is necessary first. Even when text
is stored, it's not stored as character encoding but rather as
vector images. Which is why even the best PDF text extractors
will do things like converting u to ii or converting d to cl.
What freeware do you use to convert a relatively large (10-page) PDF text >>>> file to an audio sound lfile (MP3 is fine, as is WAV) for free on
Windows?
You can likely find online programs but that would violate your privacy. >>>
I don't think it's possible to create an MP3 or WAV offline from PDF text. >>> If it was, there would be programs that you could download that do it.
Even extracting the text can be tricky. Some PDFs are actually
just storing images, so OCR is necessary first. Even when text
is stored, it's not stored as character encoding but rather as
vector images. Which is why even the best PDF text extractors
will do things like converting u to ii or converting d to cl.
Incorrect. Text is stored as characters(glyphs), not vector images. Though sometimes it can be layed out in interesting orders. IDK about Windows, but pdftotext and similar on Linux will extract the text. With that said,
the text may have defects as you describe if it was the result of OCR.
But if it is a PDF generated from a word processor or such, the text will
be perfect, just as it was in the word processor.
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
That's why I had asked for help.
But nobody seems to know the answer any better than I do.
I guess that means freeware Windows 10 offline text to audio file
conversion tools probably do not exist since nobody can find any.
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
That's why I had asked for help.
But nobody seems to know the answer any better than I do.
I guess that means freeware Windows 10 offline text to audio file
conversion tools probably do not exist since nobody can find any.
I gave you the answer. Windows can do it through the
speech API, using the very simple script that I posted. All
you need to do is give it a TXT file by copying the text out
of the PDF.
The PDFs I have come across perfectly as text so that's not the problem.
The text can be spoken to English on Windows also, so that's not a problem.
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
On 7/25/2024 4:40 PM, Newyana2 wrote:
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
That's why I had asked for help.
But nobody seems to know the answer any better than I do.
I guess that means freeware Windows 10 offline text to audio file
conversion tools probably do not exist since nobody can find any.
I gave you the answer. Windows can do it through the
speech API, using the very simple script that I posted. All
you need to do is give it a TXT file by copying the text out
of the PDF.
Thanks for that VBS script as it probably would work for nerdy people.
But I'm trying to help non-nerds who won't be running visual basic.
What I want is what everyone wants which is a program that anyone can download and just run with an input file and an output file without having
to install Microsoft Visual Basic, Pascal, Docker, or other crutches.
What is the setup required to add visual basic to a PC to run your script?
On 7/25/2024 4:40 PM, Newyana2 wrote:
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
That's why I had asked for help.
But nobody seems to know the answer any better than I do.
I guess that means freeware Windows 10 offline text to audio file
conversion tools probably do not exist since nobody can find any.
I gave you the answer. Windows can do it through the
speech API, using the very simple script that I posted. All
you need to do is give it a TXT file by copying the text out
of the PDF.
Thanks for that VBS script as it probably would work for nerdy people.
But I'm trying to help non-nerds who won't be running visual basic.
What I want is what everyone wants which is a program that anyone can download and just run with an input file and an output file without having
to install Microsoft Visual Basic, Pascal, Docker, or other crutches.
What is the setup required to add visual basic to a PC to run your script?
On Wed, 24 Jul 2024 15:13:59 -0400, Newyana2 wrote:On 7/25/2024 10:29 AM, Joe Beanfish wrote:
On 7/24/2024 1:58 PM, david wrote:
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
Even extracting the text can be tricky. Some PDFs are actually
just storing images, so OCR is necessary first. Even when text
is stored, it's not stored as character encoding but rather as
vector images. Which is why even the best PDF text extractors
will do things like converting u to ii or converting d to cl.
Incorrect. Text is stored as characters(glyphs), not vector images.
On Thu, 25 Jul 2024 16:59:30 -0400, Larry Wolff wrote:
The PDFs I have come across perfectly as text so that's not the problem.
The text can be spoken to English on Windows also, so that's not a problem. >>
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
That's the sort of thing Audacity can do, if I'm not mistaken.
<https://www.audacityteam.org/>
On Fri, 7/26/2024 1:00 AM, Stan Brown wrote:
[quoted text muted]
That's the sort of thing Audacity can do, if I'm not mistaken.
<https://www.audacityteam.org/>
Audacity is mainly an audio editor. It takes an audio input
file, and outputs an audio file on output. For example,
you can "normalize" or "compand" audio, interact with the
amplitude. Or the frequency response.
On Fri, 26 Jul 2024 11:38:22 -0400, Paul wrote:
On Fri, 7/26/2024 1:00 AM, Stan Brown wrote:
[quoted text muted]
That's the sort of thing Audacity can do, if I'm not mistaken.
<https://www.audacityteam.org/>
Audacity is mainly an audio editor. It takes an audio input
file, and outputs an audio file on output. For example,
you can "normalize" or "compand" audio, interact with the
amplitude. Or the frequency response.
Your knowledge is greater than mine (I mean that honestly, not as
sarcasm), but there's that red "record" button on Audacity's console.
I used it to create short snippets in my voice that I then save as
AAC files (.m4A extension). I would think that the user changing
settings in Volume Mixer, would let Audacity record sounds generated
buy other software on the computer.
Incidentally, the problem I was trying to solve was to create "back announcements" so that I wouldn't have to get up while listening to
something in my randomized playlist and go over to the iPod to see
what was playing. My files played fine in iTunes on my PC, but the
iPod skipped most of my test announcements and also skipped all or
most files that were immediately before or after one of my
announcements.
On 7/25/2024 10:29 AM, Joe Beanfish wrote:
On Wed, 24 Jul 2024 15:13:59 -0400, Newyana2 wrote:On 7/25/2024 10:29 AM, Joe Beanfish wrote:
On 7/24/2024 1:58 PM, david wrote:
On Tue, 23 Jul 2024 17:43:39 -0400, Larry Wolff wrote:
Even extracting the text can be tricky. Some PDFs are actually
just storing images, so OCR is necessary first. Even when text
is stored, it's not stored as character encoding but rather as
vector images. Which is why even the best PDF text extractors
will do things like converting u to ii or converting d to cl.
Incorrect. Text is stored as characters(glyphs), not vector images.
Glyph just means shape. The shape must be encoded somehow.
Isn't it encoded as a vector image? There are only two methods
I'm aware of. A raster image is a map of pixel values. A vector image
is a math formula. The latter can be losslessly enlarged because
they're shapes rather than point data. My understanding is that PDFs
are using vector encoding, which is why they can be enlarged
without losing definition.
If that's not true then perhaps you could point to a link. I'd be
curious to understand better how it works.
This makes a difference
because if it's a vector image shape then OCR software might be
the best way to extract the text. Stored text, on the other hand, is
not shapes but rather numbers. For example, in plain ASCII, ANSI,
or UTF-8 text, a byte value of 65 represents "A". Binary data that
directly represents characters would translate perfectly to text. But
I don't think PDFs are storing it that way. First, because fonts must
be stored in the file. Second because PDF converters often make
visual/shape errors, like seeing "u" as "ii".
What freeware do you use to convert a relatively large (10-page) PDF text file to an audio sound lfile (MP3 is fine, as is WAV) for free on Windows?
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 465 |
Nodes: | 16 (2 / 14) |
Uptime: | 62:59:58 |
Calls: | 9,410 |
Calls today: | 2 |
Files: | 13,572 |
Messages: | 6,100,657 |