Forum: >>> Magnum BBS <<<

How to change a voice's pitch in real time?

From Piotr Mancini@21:1/5 to All on Sun Jul 12 20:58:26 2020

I just learned how to convert an audioclip from a 33.3 rpm vinyl record to 78 rpm, here:

https://community.adobe.com/t5/audition/converting-33-recording-to-78-how/td-p/9448709?page=1

What I need is similar but probably harder. I am developing a web application based on a segment from a 2013 TV program. Only the first seconds are relevant:

https://www.youtube.com/watch?v=8MF04X2aLBw

The clone of that segment is an interactive application, under construction, seen below. Notice how the user has two degrees of freedom: the vector's magnitude and its angle.

http://www.dealey-plaza.org/this-government-as-promised/SBT-MBT-Tools/Haags-Measurement-Tool/

I already have the code that will receive those two variables as the user drags the mouse around. The effects will be:

- Vector Magnitude changes: the voice volume increases/decreases. This is most likely the easy part.

- Vector Angle changes: as it is modified the pitch (listen to the two extremes attached) will vary.

What I need is the back-end part (library, etc).

The question is actually more general than simple signal intensity and frequency. I need to "modulate" a signal based on user's activity. Any recommendations are welcome.

TIA,

-Ramon F. Herrera
JFK Numbers

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Piotr Mancini@21:1/5 to Piotr Mancini on Mon Jul 13 12:57:15 2020

On Sunday, July 12, 2020 at 10:58:29 PM UTC-5, Piotr Mancini wrote:

I just learned how to convert an audioclip from a 33.3 rpm vinyl record to 78 rpm, here:

https://community.adobe.com/t5/audition/converting-33-recording-to-78-how/td-p/9448709?page=1

What I need is similar but probably harder. I am developing a web application based on a segment from a 2013 TV program. Only the first seconds are relevant:

https://www.youtube.com/watch?v=8MF04X2aLBw

The clone of that segment is an interactive application, under construction, seen below. Notice how the user has two degrees of freedom: the vector's magnitude and its angle.

http://www.dealey-plaza.org/this-government-as-promised/SBT-MBT-Tools/Haags-Measurement-Tool/

I already have the code that will receive those two variables as the user drags the mouse around. The effects will be:

- Vector Magnitude changes: the voice volume increases/decreases. This is most likely the easy part.

- Vector Angle changes: as it is modified the pitch (listen to the two extremes attached) will vary.

What I need is the back-end part (library, etc).

The question is actually more general than simple signal intensity and frequency. I need to "modulate" a signal based on user's activity. Any recommendations are welcome.

TIA,

-Ramon F. Herrera
JFK Numbers

Wow! This used to be one of the few Usenet newsgroups that had survived the onslaught of the sons/daughters of bitches. In fact, the production and interchanges were remarkable.

The bastards killed it!

Is there ANY Usenet Newsgroup that is actually functional?

-Ramon
JFK Numbers

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From gah4@u.washington.edu@21:1/5 to Piotr Mancini on Mon Jul 13 18:53:27 2020

On Sunday, July 12, 2020 at 8:58:29 PM UTC-7, Piotr Mancini wrote:

(snip)

- Vector Angle changes: as it is modified the pitch (listen to
the two extremes attached) will vary.

More usual are ones to change speed, but not pitch. Since you
can change the sampling rate, that is equivalent.

Before digital, there were analog tape players that did this
using a moving head.

You can speed up voice by cutting out segments, long enough to
determine pitch, but short enough not to determine phonemes.

With the popular 44.1kHz sampling rate divisible by 3,
(3*14.7kHz), you could speed up by 1.5 by removing 0.1s every 0.2s,
so remove 14700 samples, then leave 29400 samples.

Now you want to resample. Since it is hard to describe a better
way in a short note, double every other sample of the 29400 sample
fragment. You should probably low-pass the result, but
maybe close enough.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Piotr Mancini@21:1/5 to ga...@u.washington.edu on Mon Jul 13 20:00:55 2020

On Monday, July 13, 2020 at 8:53:31 PM UTC-5, ga...@u.washington.edu wrote:

On Sunday, July 12, 2020 at 8:58:29 PM UTC-7, JFK Numbers wrote:

(snip)

- Vector Angle changes: as it is modified the pitch (listen to
the two extremes attached) will vary.

More usual are ones to change speed, but not pitch. Since you
can change the sampling rate, that is equivalent.

Before digital, there were analog tape players that did this
using a moving head.

You can speed up voice by cutting out segments, long enough to
determine pitch, but short enough not to determine phonemes.

With the popular 44.1kHz sampling rate divisible by 3,
(3*14.7kHz), you could speed up by 1.5 by removing 0.1s every 0.2s,
so remove 14700 samples, then leave 29400 samples.

Now you want to resample. Since it is hard to describe a better
way in a short note, double every other sample of the 29400 sample
fragment. You should probably low-pass the result, but
maybe close enough.

Thank you so much! Finally...

What I need pretty much is an OSS library to manipulate audio signals. The more high level (audio-specific?), the better.

Once I have that, I will either code the app myself, or (most likely) get a hired gun (Freelancer) to do the implementation by you described for me.

Thanks again!!

-Ramon
JFK Numbers

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Piotr Mancini@21:1/5 to JFK Numbers on Mon Jul 13 20:48:35 2020

On Sunday, July 12, 2020 at 10:58:29 PM UTC-5, JFK Numbers wrote:

I just learned how to convert an audioclip from a 33.3 rpm vinyl record to 78 rpm, here:

https://community.adobe.com/t5/audition/converting-33-recording-to-78-how/td-p/9448709?page=1

What I need is similar but probably harder. I am developing a web application based on a segment from a 2013 TV program. Only the first seconds are relevant:

https://www.youtube.com/watch?v=8MF04X2aLBw

The clone of that segment is an interactive application, under construction, seen below. Notice how the user has two degrees of freedom: the vector's magnitude and its angle.

http://www.dealey-plaza.org/this-government-as-promised/SBT-MBT-Tools/Haags-Measurement-Tool/

I already have the code that will receive those two variables as the user drags the mouse around. The effects will be:

- Vector Magnitude changes: the voice volume increases/decreases. This is most likely the easy part.

- Vector Angle changes: as it is modified the pitch will vary.

What I need is the back-end part (library, etc).

The question is actually more general than simple signal intensity and frequency. I need to "modulate" a signal based on user's activity. Any recommendations are welcome.

TIA,

-Ramon F. Herrera
JFK Numbers

This needs further explanation. I will try to be as succinct as possible.

If you prefer technical issues only and don't care about history, politics and controversy please STOP reading now. Move on.

If you haven't please watch this videoclip and pay close attention. That is the most advanced study ever done of the shooting, it was paid with the unlimited expense credit card of the Koch brothers.

https://www.youtube.com/watch?v=8MF04X2aLBw

There have been 12 "scientific" studies of the Kennedy murder. All have produced pre-ordained results, some are LN ("It was Lee, alone, 3 shots") the rest are CT. What they have in common is that they are all fraudulent. Every single one of them. See
them here:

https://archive.org/details/@the_12_fraudulent_studies?sort=titleSorter

This is also important. Below is the front end to the audio-distortion application, just click on "Continue".

http://www.dealey-plaza.org/this-government-as-promised/SBT-MBT-Tools/The-12-Fraudulent-Studies/

One of my fundamental beliefs is this, as I told a book author:

- Chances of a lawyer admitting to a counterpart: "You were right all these years, it was a conspiracy"?
Hades will proverbially freeze over before that happens.

- Chances of a physician telling a dissenting colleague: "You were right on the autopsy X-rays, the fatal shot did not come from behind"
Similar to the odds above.

- Chances of an engineer, physicist, 3D designer, etc.?

Now we are talking. And I mean that in the literal sense: They ARE talking. Our colleagues are. Notice these 2 images:

http://www.jfknumbers.org/~ramon/jfk/Two-Mikes-One-is-a-Liar.png http://www.jfknumbers.org/~ramon/jfk/My-Dear-Colleague-and-Mentor-Mike-McCormick.jpg

That is enough intro. Later, I will be asking (make that: begging) for help on the audio aspects of "The Subject That Never Dies".

The official version is a dead man walking, BTW. It is up to us, numerically trained people (who have been away from the case, always controlled by liars, err, I mean: lawyers) to solve forever that tragic event. We cannot allow the Fake News, haters of
academia, science, logic, MAHA hat wearing types to destroy history.

-Ramon
JFK Numbers
ramon at jfknumbers dot org

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From boB@21:1/5 to piotr.mancini@gmail.com on Wed Jul 15 18:06:38 2020

On Mon, 13 Jul 2020 12:57:15 -0700 (PDT), Piotr Mancini <piotr.mancini@gmail.com> wrote:

On Sunday, July 12, 2020 at 10:58:29 PM UTC-5, Piotr Mancini wrote:

I just learned how to convert an audioclip from a 33.3 rpm vinyl record to 78 rpm, here:

https://community.adobe.com/t5/audition/converting-33-recording-to-78-how/td-p/9448709?page=1

What I need is similar but probably harder. I am developing a web application based on a segment from a 2013 TV program. Only the first seconds are relevant:

https://www.youtube.com/watch?v=8MF04X2aLBw

The clone of that segment is an interactive application, under construction, seen below. Notice how the user has two degrees of freedom: the vector's magnitude and its angle.

http://www.dealey-plaza.org/this-government-as-promised/SBT-MBT-Tools/Haags-Measurement-Tool/

I already have the code that will receive those two variables as the user drags the mouse around. The effects will be:

- Vector Magnitude changes: the voice volume increases/decreases. This is most likely the easy part.

- Vector Angle changes: as it is modified the pitch (listen to the two extremes attached) will vary.

What I need is the back-end part (library, etc).

The question is actually more general than simple signal intensity and frequency. I need to "modulate" a signal based on user's activity. Any recommendations are welcome.

TIA,

-Ramon F. Herrera
JFK Numbers

Wow! This used to be one of the few Usenet newsgroups that had survived the onslaught of the sons/daughters of bitches. In fact, the production and interchanges were remarkable.

The bastards killed it!

Is there ANY Usenet Newsgroup that is actually functional?

-Ramon
JFK Numbers

Yes there are a few I think. But not much.

I remember this group in the early 1990s !

People here helped me out when I needed another DSP56001 processor and
someone sent me one ! Still have it today.

Long live comp.dsp

Or something like that...

boB

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Sebastian Doht@21:1/5 to All on Thu Jul 16 21:21:21 2020

Am 14.07.2020 um 05:00 schrieb Piotr Mancini:

On Monday, July 13, 2020 at 8:53:31 PM UTC-5, ga...@u.washington.edu wrote:

On Sunday, July 12, 2020 at 8:58:29 PM UTC-7, JFK Numbers wrote:

(snip)

- Vector Angle changes: as it is modified the pitch (listen to
the two extremes attached) will vary.

More usual are ones to change speed, but not pitch. Since you
can change the sampling rate, that is equivalent.

Before digital, there were analog tape players that did this
using a moving head.

You can speed up voice by cutting out segments, long enough to
determine pitch, but short enough not to determine phonemes.

With the popular 44.1kHz sampling rate divisible by 3,
(3*14.7kHz), you could speed up by 1.5 by removing 0.1s every 0.2s,
so remove 14700 samples, then leave 29400 samples.

Now you want to resample. Since it is hard to describe a better
way in a short note, double every other sample of the 29400 sample
fragment. You should probably low-pass the result, but
maybe close enough.

Thank you so much! Finally...

What I need pretty much is an OSS library to manipulate audio signals. The more high level (audio-specific?), the better.

Once I have that, I will either code the app myself, or (most likely) get a hired gun (Freelancer) to do the implementation by you described for me.

Thanks again!!

-Ramon
JFK Numbers

Not sure if it has exactly what you have been looking for, but have you
had a look at the open source audio editor Audacity (https://www.audacityteam.org/)? If it can accomplish what you need you
might be able to extract the required functions from its backend library portaudio.

Greetz,

Sebastian

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From gah4@u.washington.edu@21:1/5 to Piotr Mancini on Thu Jul 16 17:22:56 2020

On Sunday, July 12, 2020 at 8:58:29 PM UTC-7, Piotr Mancini wrote:

(snip)

The question is actually more general than simple signal
intensity and frequency. I need to "modulate" a signal based
on user's activity. Any recommendations are welcome.

Reminds me of at a seminar last year (that is, pre-Covid) wondering
about an accent remover. The seminar speaker had a strong accent
which made it hard to understand. (That was in CSE, too!)

At a later seminar, related to deep learning and neural nets,
someone had a system that might be able to do it. It would use
one voice as a training set, then convert others into that voice.

But note that this could also be done using a speech to text system,
followed by a text to speech synthesizer. Not real time, but maybe
close enough.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From gah4@u.washington.edu@21:1/5 to Sebastian Doht on Thu Jul 16 17:18:30 2020

On Thursday, July 16, 2020 at 12:21:25 PM UTC-7, Sebastian Doht wrote:

(snip)

Not sure if it has exactly what you have been looking for, but have you
had a look at the open source audio editor Audacity (https://www.audacityteam.org/)? If it can accomplish what you need you
might be able to extract the required functions from its backend library portaudio.

Note that the OPs question can be answered with two operations.

One changes the speed without changing the pitch, then resampling
to get the original speed with pitch change.

The OP asked for 'real time', which I will interpret as minimal delay.
It can't be done with zero delay, but maybe close enough.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Keyop
  Sat Dec 21 03:08:36 2024
  from Huddersfield, West Yorkshire via SSH
- Bob Smith
  Sat Dec 21 02:40:27 2024
  from Here via SSH
- Bob Worm
  Fri Dec 20 21:12:29 2024
  from Wales, Uk via Telnet
- Bob Worm
  Fri Dec 20 15:21:07 2024
  from Wales, Uk via Telnet
- Johnnyv
  Fri Dec 20 15:17:33 2024
  from Bilbao, Spain via Raw
- Keyop
  Thu Dec 19 23:12:34 2024
  from Huddersfield, West Yorkshire via SSH
- Keyop
  Thu Dec 19 19:31:18 2024
  from Huddersfield, West Yorkshire via SSH
- Bob Worm
  Thu Dec 19 08:59:46 2024
  from Wales, Uk via Telnet

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	379
Nodes:	16 (2 / 14)
Uptime:	67:24:48
Calls:	8,084
Calls today:	2
Files:	13,068
Messages:	5,849,513

How to change a voice's pitch in real time?

Who's Online

Recent Visitors

System Info