Welcome to Audio Babel

...an infinite collection of all possible audio files...

Who made this thing?

Hey, my name's Tomer Braff. I'm a programmer, developer, game designer, actor, voice over artist, youtuber, all around something-or-other. I got a portfolio site at tomerbraff.com, a YouTube channel I sometimes do things with, and you can find on all the usual places like Twitter and LinkedIn. If you're curious or just want to chat, contact me at tomerbraff@gmail.com.

What is this site?

The Audio Library of Babel (or Audibel) is inspired by the Library of Babel, a thought experiment by Jorge Luis Borges. It is a collection of all possible books that can be created with a given set of parameters. It is also inspired by the Library of Babel website by Jonathan Basile, which implements said concept in a digital format. The goal in this case is to do the same, but for audio files.

How does it work?

Every audio file is generated from a unique "address" that encodes the audio data. The address is a string of characters that can be used to recreate the exact same audio file. These addresses are read as a base64 number (specifically URL Safe) and converted to binary. This represents the raw audio sample data, which then has a pre-determined WAV header appended at the end to create a valid audio file. The .wav header is fixed, meaning all resulting audio files will have the same sample rate (44.1kHz), bit depth (16-bit), and number of channels (mono). Essentially, the index represent the specific sequence of audio samples that make up the sound.

Surfing the Audio .WAV

Given these parameters we can calculate that there would be a total of $2^{44,100 \times 5 \times 16}=2^{3,528,000}$ possible 5-second audio files, which is approximately $10^{10^7}=10^{10,000,000}$ files. This far exceeds the estimated number of atoms in the observable universe (approximately $10^{80}$). The vast majority of these audio files will be complete noise, but among them are also every possible sound that can be represented within these parameters, including all music, speech, and other audio recordings.

Glossary