in this section we share insights, tutorials and articles about immersive audio

If you have any requestes we´ll be more than happy to do our best to answer any questions

The 4:00 AM Wakeup Call

The rainforest doesn't wake up gently. It explodes into life all at once.

My job is to capture this sonic madness and bottle it into a perfect, 360-degree digital sphere. I am an Ambisonic field recordist, and recording in one of the most biodiverse places on Earth is equal parts paradise and technical warfare. When someone listens to these tracks later on headphones or a VR headset, they won't just hear a rainforest. They will hear a poison dart frog chirping exactly three feet to their front-left, while a scarlet macaw screeches across the sky directly over their head from back to front.

The Enemy: Humidity, Ants, and Static

Capturing pristine audio in a tropical paradise sounds romantic until you actually try it. The rainforest actively tries to destroy electronics. The humidity here routinely hovers around 90-95%. For high-voltage condenser microphones, moisture is a death sentence. It causes "capsule frying"—a lovely phenomenon where your pristine nature recording is suddenly ruined by a sound resembling bacon sizzling in a pan. I have to travel with a mountain of silica gel packs and custom-sealed dry bags just to keep my gear functioning. Then there is the wildlife. I once left an Ambisonic rig isolated on a tripod for a two-hour "soundscape drop" deep in Corcovado National Park. When I returned, a colony of leafcutter ants had decided my furry windshield was the perfect structural material for their nest.

The Symphony of the Canopy

Despite the sweat, the mud, and the constant gear maintenance, nothing compares to the moment you hit record and put on your headphones. Through the spatial decoding, the jungle opens up. Around 5:30 AM, the "dawn chorus" hits its peak. You can map the geometry of the forest using your ears alone. The low-frequency thrum of the howler monkeys anchors the bottom of the soundscape. Above that, the mid-range is dominated by the rhythmic, clock-like clicking of toucans. Filling every remaining pocket of space is the high-frequency fizz of thousands of unseen cicadas and tree frogs. It’s a dense, chaotic, beautifully mixed symphony where every species has evolved to speak in its own specific frequency slot so it can be heard over its neighbors.

Preserving a Vanishing World

Living and working in Costa Rica has taught me that these recordings are more than just cool sound effects for games or films—they are historical records. Landscapes are changing fast. By capturing these full-sphere acoustic snapshots, we are archiving the sonic health of these ecosystems. When I pack up my gear, soaked in sweat and covered in mud, I know I’m bringing back a piece of the jungle that people all over the world can step into. It’s the closest thing we have to a sonic time machine. 

Emiliano Gomez (Ftct music)

#SpatialAudio #BinauralAudio #Ambisonics #3DAudio #ImmersiveAudio #Binaural #Ambisonic

Check out these great youtube channels 

great information about immersive audio techniques, sofware and more

 

https://www.youtube.com/@michaelgwagner

 

https://www.youtube.com/@AudioBrewers

 

https://www.youtube.com/@sound.particles

 

https://orbitspatial.io/

 

 

 

 

#SpatialAudio #BinauralAudio #Ambisonics #3DAudio #SoundDesign #AudioEngineering #AudioProduction #SoundDesigner #AudioPost #ProAudio #MixingAndMastering #MusicProduction #ImmersiveAudio #Binaural #Ambisonic

Beyond Stereo:

The Beginner’s Guide to Ambisonic AudioImagine standing in the middle of a dense rainforest.

A bird chirps high up in the canopy to your front-left. Twigs snap underfoot directly behind you. A low rumble of thunder rolls in from the distant right.If you record this with a standard microphone, you flatten that majestic, three-dimensional reality into a two-dimensional pancake (stereo left and right). But what if you could capture the entire sphere of sound exactly as your ears perceive it?Welcome to the world of Ambisonics—the audio tech powering Virtual Reality (VR), immersive cinema, and the next generation of sound design.
Here is everything you need to know to get started.
What on Earth is Ambisonic Audio?
At its core, Ambisonics is a full-sphere surround sound technique. Unlike traditional surround sound (like 5.1 or 7.1), which relies on sending specific audio signals to specific speakers placed around a room, Ambisonics treats sound as a complete 360-degree bubble.It doesn't just capture left, right, front, and back; it captures above and below.The coolest part?
Ambisonic recordings are speaker-independent. You capture the "sonic bubble" once, and then decode it later to fit whatever playback system you want—whether that’s a pair of standard headphones using binaural rendering, a soundbar, or a massive 22-channel speaker array.How It Works: The "A-Format" vs. "B-Format"To understand Ambisonics, you have to look at the specialized microphones used to capture it. They don't look like your standard vocal mic. Instead, they feature a cluster of four distinct microphone capsules arranged in a tight tetrahedron (a triangular pyramid shape).When you hit record, the process happens in two distinct stages:
1. The A-Format (The Raw Capture)The four individual capsules on the microphone record four separate tracks of audio. They point in different directions: Left-Front-Up (LFU), Right-Front-Down (RFD), Left-Back-Down (LBD), and Right-Back-Up (RBU). At this stage, the audio isn't usable for standard listening yet—it’s just four raw perspectives of the same space.
2. The B-Format (The Magic Sauce)Using a software plugin (usually provided by the microphone maker), the raw A-Format is converted into B-Format. This is where the magic happens. The software recalculates those four tracks into a mathematical coordinate system consisting of four entirely new channels:W (Omnidirectional): The overall sound pressure level, carrying all the acoustic data without direction (the center point).X (Front-to-Back): Measures sound on the front/back axis.Y (Left-to-Right): Measures sound on the left/right axis.Z (Up-to-Down): Measures sound on the vertical height axis.
Why this matters: Because the audio is mapped to X, Y, and Z axes, if a user turns their head while wearing a VR headset, the audio decoder can instantly rotate the sound field in real-time. If you look to the left, the sound that was in front of you seamlessly shifts to your right ear.First-Order vs. Higher-Order Ambisonics (HOA)When researching Ambisonics, you will frequently see the term "Order." This essentially refers to the resolution and spatial accuracy of the sound field.1st-Order Ambisonics: Uses 4 channels (W, X, Y, Z). It’s great for basic spatial awareness and is the most common format for field recorders.2nd and 3rd-Order (Higher-Order): Uses 9, 16, or even more channels by adding more microphone capsules. Higher-order formats offer incredibly sharp "spatial resolution," meaning you can pinpoint exactly where a sound is coming from with laser precision.

 

#VRAudio #GameAudio #VirtualReality #AugmentedReality #SpatialComputing #GameDev #360Video #XR

Ambisonic vs Binaural

The emergence of virtual reality (VR), 360-degree video, and immersive gaming has pushed traditional stereo audio past its limits. Today, sound designers and audio engineers rely on spatial audio to construct believable, three-dimensional acoustic environments. Two terms dominate this space: Binaural Audio and Ambisonics. While both formats aim to envelop the listener in a 3D soundscape, they approach the task from fundamentally different technical directions.

 

1. Binaural Audio: Replicating Human Hearing Binaural audio is designed exclusively for headphone playback. Its core philosophy is to directly mimic the physical mechanics of how human ears receive and interpret sound waves. When you hear a sound in the real world, your physical head, torso, and outer ears (the pinnae) modify the acoustic waves before they reach your eardrums. Our brains calculate these microscopic shifts using Interaural Time Differences (ITD) and Interaural Level Differences (ILD) to pinpoint exactly where a sound originates. Binaural audio captures this by using specialized "dummy head" microphones that possess anatomically correct ears. Alternatively, sound engineers can artificially process standard mono tracks using software filters known as Head-Related Transfer Functions (HRTFs) to trick the brain into perceiving depth, height, and direction. The Limitation: Traditional binaural audio is structurally "static." Because the sound transformations are baked directly into a two-channel (left/right) track based on a fixed head position, the acoustic environment is locked. If you rotate your head while wearing standard headphones, the entire sonic universe turns with you, breaking the illusion of a fixed physical environment.

 

2. Ambisonics: Mapping the Entire Sound Field Unlike binaural audio, which models how a listener receives sound, Ambisonics models the acoustic environment itself. It is a scene-based, speaker-independent format that captures or synthesizes an entire 3D sphere of sound around a central point. Ambisonics achieves this by mathematically decomposing a sound field into mathematical components called spherical harmonics.

The most common standard—First-Order Ambisonics (B-format)—records audio across four independent channels:

 

W: Omni-directional sound pressure

X: Front-to-back directionality

Y: Left-to-right directionality

Z: Up-to-down directionality

 

The Power of Flexibility: Because Ambisonics maps a mathematical sphere of sound rather than a specific playback device, it is infinitely scalable. A single Ambisonic master file can be decoded to output to a 5.1 surround-sound living room, a massive 22.2 loudspeaker dome arena, or compressed for headphones. Natively Interactive: Because the entire 3D sound field is mathematically preserved, Ambisonic environments can be rotated digitally in real-time . This makes it the definitive backend standard for VR/AR platforms and 360-degree videos, allowing the soundscape to stay perfectly anchored to the environment when a user moves their head

 

How they converge Binaural ambisonics

 

It is a mistake to view these formats purely as rivals; in modern media workflows, they frequently work together.

To experience a 360-degree Ambisonic environment on consumer headphones (such as watching an immersive video on YouTube), the spatial sound field must be translated into something your ears can understand. This process is called binaural Ambisonic rendering (McKenzie et al., 2019). The system actively takes the Ambisonic sound field, calculates the user’s real-time head orientation via tracking sensors, and decodes it through a virtual HRTF pipeline into a binaural signal on the fly (Berebi et al., 2025; Politis et al., 2017).

In short, if you need a quick, highly realistic 3D acoustic snapshot for a listener whose perspective will remain fixed, binaural audio is highly effective. But if you are building an interactive, responsive world where the user has the freedom to look around, Ambisonics provides the required canvas.

 

 

#SpatialAudio #BinauralAudio #Ambisonics #3DAudio #ImmersiveAudio #Binaural #Ambisonic