audio – Make Me Engineer

Note onset detection

June 15, 2023 by Tarik

Here is a graphic that illustrates the threshold approach to note onset detection: This image shows a typical WAV file with three discrete notes played in succession. The red line represents a chosen signal threshold, and the blue lines represent note start positions returned by a simple algorithm that marks a start when the signal … Read more

using FFMPEG with silencedetect to remove audio silence

May 27, 2023 by Tarik

Use the silenceremove filter. This removes silence from the audio track only – it will leave the video unedited, i.e., things will go out of sync Its arguments are a little cryptic. An example ffmpeg -i input.mp3 -af silenceremove=1:0:-50dB output.mp3 This removes silence at the beginning (indicated by the first argument 1) with minimum length … Read more

How is audio represented with numbers in computers?

May 11, 2023 by Tarik

Physically, as you probably know, audio is a vibration. Typically, we’re talking about vibrations of air between approximitely 20Hz and 20,000Hz. That means the air is moving back and forth 20 to 20,000 times per second. If you measure that vibration and convert it to an electrical signal (say, using a microphone), you’ll get an … Read more

What do the bytes in a .wav file represent?

May 2, 2023 by Tarik

You will have heard, that audio signals are represented by some kind of wave. If you have ever seen this wave diagrams with a line going up and down — that’s basically what’s inside those files. Take a look at this file picture from http://en.wikipedia.org/wiki/Sampling_rate You see your audio wave (the gray line). The current … Read more

How can I calculate audio dB level?

October 7, 2022 by Tarik

All the previous answers are correct if you want a technically accurate or scientifically valuable answer. But if you just want a general estimation of comparative loudness, like if you want to check whether the dog is barking or whether a baby is crying and you want to specify the threshold in dB, then it’s … Read more

Detecting when head phones are plugged in

October 5, 2022 by Tarik

In Windows Vista and beyond, you can use the device arrival and removal notifications and retrieve the endpoint formfactor to determine if the manufacturer of your audio solution considers a particular endpoint a “headphone”. Before Vista there was no way of determining this information.

HTML5 Safari live broadcast vs not

July 26, 2022 by Tarik

Can you post the headers sent by the server both with and without the PHP script? I’m wondering if the PHP script is sending a different Content-Type than serving the files normally. It would also be a good idea to specify the type attribute on the source elements, so the browser does not have to … Read more

How to overlay/downmix two audio files using ffmpeg

July 9, 2022 by Tarik

stereo + stereo → stereo Normal downmix Use the amix filter: ffmpeg -i input0.mp3 -i input1.mp3 -filter_complex amix=inputs=2:duration=longest output.mp3 Or the amerge filter: ffmpeg -i input0.mp3 -i input1.mp3 -filter_complex amerge=inputs=2 -ac 2 output.mp3 Downmix each input into specific output channel Use the amerge and pan filters: ffmpeg -i input0.mp3 -i input1.mp3 -filter_complex “amerge=inputs=2,pan=stereo|c0<c0+c1|c1<c2+c3” output.mp3 mono … Read more

What is the best way to merge mp3 files? [closed]

June 29, 2022 by Tarik

David’s answer is correct that just concatenating the files will leave ID3 tags scattered inside (although this doesn’t normally affect playback, so you can do “copy /b” or on UNIX “cat a.mp3 b.mp3 > combined.mp3” in a pinch). However, mp3wrap isn’t exactly the right tool to just combine multiple MP3s into one “clean” file. Rather … Read more

How to add a new audio (not mixing) into a video using ffmpeg?

June 15, 2022 by Tarik

Replace audio ffmpeg -i video.mp4 -i audio.wav -map 0:v -map 1:a -c:v copy -shortest output.mp4 The -map option allows you to manually select streams / tracks. See FFmpeg Wiki: Map for more info. This example uses -c:v copy to stream copy (mux) the video. No re-encoding of the video occurs. Quality is preserved and the … Read more