October 2024: Fractal Audio's VP4 Virtual Pedalboard added to the wiki.
Difference between revisions of "Impulse responses (IR)"
(→Future) |
|||
Line 185: | Line 185: | ||
Acoustic sounds benefit from long IRs with some room ambience, so UltraRes IRs are preferred. | Acoustic sounds benefit from long IRs with some room ambience, so UltraRes IRs are preferred. | ||
− | =Future= | + | =Future of IRs= |
<blockquote>"The Axe-Fx III (and II) actually capture the Volterra kernels when doing an IR capture (it's hidden in the firmware for possible future use). I've studied dozens upon dozens of them and the kernels above first order (the first order kernel is the linear IR) are so small as to be inaudible. The distortion from an amp is orders of magnitude greater even when using a clean amp. The only significant nonlinear thing I've measured that speakers do is thermal compression (that we model already) and "cone cry" which sounds like sh*t. Jay Mitchell is probably the leading authority on speaker design and he has stated pretty much the same thing. I'm all for improvements but they need to be real improvements. I've sat here countless times comparing an IR to the actual speaker with a mic on it doing blind A/B tests and can NEVER tell the difference and I think my ears are pretty good. I dug through my Matlab stuff and found this. (graph) This is an IR of a speaker taken twice. The first time the drive level is around 1W (in red). The second time the amp was turned way up, I would estimate at least 50W (in blue). As you can see the difference is extremely small. There's a small difference from 10 Hz and down which is way below the reproduction range of any system and a difference way up at Nyquist (24 kHz) but that's 100 dB down (!). Furthermore we don't know if the tiny differences are from the speaker or from the amp or the mic or the mic preamp. I should add that speakers can and do distort (when Xmax is exceeded) but it's not a pleasant sound. Since the displacement of the cone is the inverse of the frequency the low frequencies are distorted which is the opposite of what you want when creating "pleasing distortion". Speaker distortion is flubby, flabby and farty. The Axe-Fx II and III can simulate that, if desired, using the Speaker Drive parameter in the Amp block. I always set it to zero. There are probably some other modes that cause distortion but, again, these are dwarfed by the distortion of the amp. The only other significant one I've experienced is cone cry. Manufacturers go to great lengths to prevent it from happening. I have a speaker here that does it. Whenever I play a high F it cries and it's annoying." [http://forum.fractalaudio.com/threads/will-we-see-improvements-new-tech-regarding-speaker-cabinets.135191/page-2#post-1599277 source] </blockquote> | <blockquote>"The Axe-Fx III (and II) actually capture the Volterra kernels when doing an IR capture (it's hidden in the firmware for possible future use). I've studied dozens upon dozens of them and the kernels above first order (the first order kernel is the linear IR) are so small as to be inaudible. The distortion from an amp is orders of magnitude greater even when using a clean amp. The only significant nonlinear thing I've measured that speakers do is thermal compression (that we model already) and "cone cry" which sounds like sh*t. Jay Mitchell is probably the leading authority on speaker design and he has stated pretty much the same thing. I'm all for improvements but they need to be real improvements. I've sat here countless times comparing an IR to the actual speaker with a mic on it doing blind A/B tests and can NEVER tell the difference and I think my ears are pretty good. I dug through my Matlab stuff and found this. (graph) This is an IR of a speaker taken twice. The first time the drive level is around 1W (in red). The second time the amp was turned way up, I would estimate at least 50W (in blue). As you can see the difference is extremely small. There's a small difference from 10 Hz and down which is way below the reproduction range of any system and a difference way up at Nyquist (24 kHz) but that's 100 dB down (!). Furthermore we don't know if the tiny differences are from the speaker or from the amp or the mic or the mic preamp. I should add that speakers can and do distort (when Xmax is exceeded) but it's not a pleasant sound. Since the displacement of the cone is the inverse of the frequency the low frequencies are distorted which is the opposite of what you want when creating "pleasing distortion". Speaker distortion is flubby, flabby and farty. The Axe-Fx II and III can simulate that, if desired, using the Speaker Drive parameter in the Amp block. I always set it to zero. There are probably some other modes that cause distortion but, again, these are dwarfed by the distortion of the amp. The only other significant one I've experienced is cone cry. Manufacturers go to great lengths to prevent it from happening. I have a speaker here that does it. Whenever I play a high F it cries and it's annoying." [http://forum.fractalaudio.com/threads/will-we-see-improvements-new-tech-regarding-speaker-cabinets.135191/page-2#post-1599277 source] </blockquote> |
Revision as of 16:53, 20 April 2019
Contents
- 1 About impulse responses (IR)
- 2 IR file format
- 3 IR resolution: UltraRes, HiRes, Normal
- 4 IR sampling rate
- 5 Different IRs for different processors
- 6 Near-field and far-field IRs
- 7 Minimum Phase Transformation (MPT) and Auto Trim
- 8 IR Capture: create your own IRs
- 9 IRs versus Tone Matching
- 10 Acoustic sounds
- 11 Future of IRs
- 12 Commercial and free IRs
About impulse responses (IR)
To recreate the tonal characteristics of speaker cabs, the Cab block (and IR Player block on the Axe-Fx III) relies on Impulse Responses (IRs).
An IR is a collection of data representing sound measurements taken from a speaker cabinet or system. A test signal is played through the actual speaker, recorded, and used to generate a profile. The profile (IR) can then be used by an IR-loader, such as the Cab block and IR Player blocks, to recreate the sound of the speaker.
The terms "cab", "user cab" and "IR" are often mixed up.
The Axe-Fx II also uses IRs to reproduce the characteristics of specific microphones.
"Fractal Audio Systems speaker simulation technology is incredibly accurate, yet some listeners find the sound of IRs unfamiliar at first. This is because impulse responses typically recreate the sound of “close miking.” When you mic a guitar speaker, the mic “hears” something very different to what you might hear. Our ears are by definition “neutral” whereas a mic has distinct “color.” We typically listen at a distance (and speaker tone is very different as we move around) while a mic is inches away and stationary, focusing on the desirable sound at a specific spot.
As guitarists, we are accustomed to the sound of a speaker “in the room,” but this is not what our audiences hear. For recording and performing, the close mic’d sound is essentially a universal standard. THIS is the sound that the Cab block is designed to reproduce, and this explains why not only guitarists, but recording and front-of-house engineers have embraced its use. (Of course our Amp models can also be used with a traditional guitar speaker as demonstrated in many of the rig designs in Section 1: Setting Up). If you are new to using mics on a guitar amp, you will find the Cab block is a fantastic way to learn more. To get started, listen to single IRs, or explore the factory presets which combine several at once. For almost a century, artists, producers and engineers have honed the craft of placing or blending mics to achieve a desired tone. Many classic techniques are easy to recreate. Try a tried-and-true “recipe” blending one “bottomy” and one “bright” mic, or try something totally original.
The factory content includes dozens of speakers with multiple different mics in different positions. You may also enjoy “Mix” IRs by Fractal Audio or 3rd-parties, which bring a producer’s experience to you in a single IR. In any case, recognize that the sound of IRs is the very sound of the speakers and mics they capture."
Fractal Audio:
"An IR stands for "Impulse Response". In mathematical terms it is the time response of a system to a Dirac delta function (also known as an impulse). An IR can be used directly as the coefficients for an FIR (Finite Impulse Response) filter. In the modeling world IRs are obtained from real speakers and when processed using an FIR filter produce extremely accurate results. In essence an IR is a "sample" of the speaker and microphone and uses very similar principles. However the quality of any IR is subject to the talents of the individual(s) capturing the IR. Mic placement, preamp choice, etc., etc. are important as you are essentially recording the speaker. In the old days modelers used EQ to emulate speaker response but I don't think there are many left that still use that technique. So the quality of the IR is really the issue here. The original Axe-Fx pioneered this technology which has since become almost ubiquitous." source
"It's analogous to a sampler. Consider the sample of, say, a kick drum. You make a short recording of that and then "trigger" that recording. If you want the kick drum louder you make the trigger louder. Now assume you trigger that recording thousands of times a second at varying amplitudes. That's essentially how IRs and convolution work. You trigger the recording at the sample rate and each playback is weighted by the sample amplitude." source
IR file format
The modelers process IRs in .syx format (MIDI System Exclusive).
Capturing IRs creates WAVE files. WAVE files can be converted into SYX files using Cab-Lab, and then can be imported into the hardware. The tool Axe-Manage Cabs in Axe-Edit III can also directly import WAVE files and convert these on the fly.
When using Cab-Lab to convert WAV files to UltraRes resolution, and when using IR Capture to create IRs, two files are created: an .IR file and a .SYX file. The .IR file is the raw IR data that can be imported into Cab-Lab for mixing purposes, and can ONLY be used with Cab-Lab.
IR resolution: UltraRes, HiRes, Normal
Fractal Audio's modelers and software support IRs of various lengths, measured in number of samples (points) and milliseconds. This is also referred to as: resolution.
Normal – 1024 samples, 20 ms
You can often use Normal Res without a noticeable impact on the tone, compared to HiRes and UltraRes.
HiRes – 2040 samples, 40 ms
HiRes processing requires more CPU power than mono or stereo Normal Res, and also more than UltraRes.
UltraRes – up to 8000 samples, 170 ms
UltraRes IR processing requires more CPU power than mono or stereo Normal Res but less than HiRes (about 4% less CPU usage AND higher resolution)!
The Cab block's Stereo UltraRes mode supports two UltraRes IRs. Non-UltraRes IRs will be processed in Stereo UltraRes mode as Normal Res. If one IR is UltraRes and the other not, then the UltraRes IR will still processed as UR and the other as Normal in stereo mode.
UltraRes speaker IR processing is a Fractal Audio proprietary technique which enhances the spectral resolution of an IR without adding CPU burden or storage requirements.
UltraRes IRs do not support size warping. Therefore, the Speaker Size parameter is unavailable in UltraRes mode. This parameter isn’t available on the Axe-Fx III anyway.
UltraRes lRs are displayed in italics or in a different color in the software editors.
(source) "The problem with conventional IRs is that they are too short to capture the detail in the low frequencies. There are those that maintain 20 ms is the maximum length you need to fully replicate the speaker. This would be about 1000 samples at 48 kHz. I disagree with this as I have many IRs here that exhibit significant energy beyond 20 ms. I believe the room has some influence as the low-frequency modes of the room will impact the resulting sound. The amount of this impact depends on the room, the mics, distance, etc., etc. Or perhaps certain speakers have particularly high Qs in the low frequencies. Regardless, it is my opinion that you need IRs much longer than 20 ms to fully capture the "mic'd amp in the studio" sound. My tests show that IRs of 8000 samples are required to fully capture the low-frequency detail. Unfortunately to process an 8K IR in real-time require copious processing power... Fortunately I have developed "Ultra-Res" cabinet modeling. Ultra-Res cabinet modeling provides the frequency detail of a very long IR with little or no added processing power requirements. The following image depicts the response of Ultra-Res cabinet IR processing: ..." (see thread)
"Existing IRs will still be processed as usual. Ultra-Res IRs will be tagged as such which will indicate to the processor to use the new processing algorithms. Note that Ultra-Res IR data is not conventional IR data."
"The frequency resolution of an IR is the sample rate divided by the number of samples in the IR. The window function has nothing to do with frequency resolution (except for making it even less). So a 1K IR at 48 kHz sample rate has a frequency resolution of roughly 48 Hz. If a speaker has a resonance (formant) at, say 80 Hz with a Q of, say, 3.0, then 48 Hz is insufficient to capture that resonance accurately. You need a frequency resolution of several Hz to accurately recreate that resonance. I chose 80 Hz and a Q of 3 because that's what that response looks like. The Q could even be higher than that. It doesn't take much mental energy to realize that if you have a narrow formant at a low frequency then you need fine frequency resolution to reproduce that. An 80 Hz formant with a Q of 3 only spans about 25 Hz. Obviously a frequency resolution of 48 Hz is not going to be able to reproduce that. Windowing only smooths the response even more. This is basic FFT theory. The less time-domain information you have, the less frequency domain information you have and vice-versa. This is the uncertainty principle. I always window IRs with a Hann window."
"Another way to look at it is to think in terms of formants. That particular speaker has a pronounced 80 Hz formant. It takes well over 100 ms for the energy of that formant to decay to the point of imperceptibility. Obviously a 20 ms IR can't reproduce an event that occurs for over 100 ms. Here is a zoom of the original non-minimum-phase IR (IOW raw time response)... (see thread). You can clearly see the 80 Hz formant. There are some room reflections but they are very small. The 80 Hz formant starts well before any reflections. It's obviously a high-Q resonance as it rings for quite a while. The higher the Q, the longer it takes to decay."
"Here's another example. (see thread) This is one of the new OwnHammer IRs. The IR is OwnHammer_412_MAR-CB_D-120_SS_RBN-121. These IRs are 100 ms long (4800 samples). I windowed the original IR to 4K to prove a point. The blue trace is the original IR (windowed to 4K samples). The green trace is the "typical" 20 ms IR (windowed to 1K samples). The red trace is the Ultra-Res version."
"The problem is that human perception is logarithmic and IRs are a linear process. 48 Hz resolution is way more than necessary at, say, a few kHz but not nearly enough at low frequencies. The brute force solution is to use very long IRs, 8K or more. Ultra-Res solves this in a novel way that uses little to no extra processing power and no additional latency."
"Normalization is your friend. Rectangular windows are simply truncation and are generally regarded as bad practice due to extremely high sidelobe levels. The choice of window is subjective. I actually use my own custom window that is not really a Hann window but that's proprietary information. My window preserves more frequency detail while still suppressing Gibbs phenomenon. Windowing trades off frequency resolution for sidelobe suppression. My window is optimized for the unique statistics of IRs. For a random process I tend towards Bessel-Kaiser windows. IRs have unique statistics that aren't addressed by any of the standard textbook windows."
"It is desired that the IR be 8K samples or more."
"Let me state these points:
- We don't record guitar amps in airplane hangers or anechoic chambers. We record them in studios.
- When we record a guitar amp we carefully set the amp up in the studio to get the best sound "on tape". This involves moving the amp around, placing gobos, etc. When we collected the Producer's Packs IRs we spent hours arranging the amps/speakers, mics and gobos and playing through the amp and readjusting until we were satisfied. This also included adjusting the preamps and mixing board. In one studio we found that we got the best tone raising the cabs off the floor by a couple feet, orienting them towards a particular wall and placing gobos behind (this was the engineer's standard recording arrangement).
- At this point our objective of the IR is to capture the sound of that amp/speaker at that position in the room, with the gobos, mics, preamps, etc., etc. The goal is not to capture the raw sound of the amp/speaker in an airplane hanger or outside using a ground-plane measurement and measurement mics. That might be someone else's goal but it is not ours. IOW our goal is to treat the cab, mics, preamps, room, etc. as a whole, as a good engineer/producer would.
- Subsequent analysis of the data shows that there is significant energy out to 100ms and even beyond. However there is little energy beyond 200 ms or so (as it should be in a well-designed studio). This observation was the catalyst for the Ultra-Res algorithm. There are other observations about the statistics of the data that I cannot disclose.
- Some cabinets displayed noticeable resonances at low frequencies. Others did not. The frequency of these resonances were not consistent and, not coincidentally, matched the measured resonance of the impedance sweep. It is a logical conclusion, therefore, that the resonance was NOT caused by the room but by the speaker/cabinet combination. Furthermore a plot of the group delay for the raw data showed that the delay of the resonance was too short to be a room mode. Regardless, whether the resonance is from the speaker or room or mics or preamps is irrelevant. All we care about is recreating the sound of that speaker as it would be recorded as accurately as possible.
- Truncating an IR destroys information by definition. We don't care where the information comes from, be it the speaker or the room or the mics or the preamps. We want all the information. If a plot of the frequency response of a truncated IR differs considerably from the non-truncated version then we have lost information and concomitant accuracy.
- NO ONE producing commercial IRs records them in an airplane hanger, for obvious reasons. The best ones are done in a studio using the same technique we used for the Producer's Packs: setting up the cab, adjusting the position, mics, preamps, etc. and playing through the amp/cab and readjusting until the best tone is achieved. The new OwnHammer IRs are an example of this. Many, if not all, of those IRs exhibit significant energy to 100 ms (and likely beyond but the data stops at 100 ms). Truncating them to 20 ms destroys vital information. You can argue the semantics all day long. I've compared truncated and non-truncated and the difference is clearly audible. It is especially noticeable when chugging power chords. You can hear the resonance. It goes "bonggggggg" as opposed to "thuk". Most importantly it sounds "better" IMO.
- Ultra-Res is an algorithm that markedly increases accuracy. It gives the frequency resolution of a 200ms IR without additional processing overhead and no added latency."
"Ultra-Res is especially powerful in Tone Matching applications, particularly real-time matches and was another impetus behind the development."
"The myopic only see the IR as a capture of the speaker's "unadulterated" response. As I stated before I believe the future is treating IRs as capturing the entire recording chain including mics, preamps, etc. and have pushing in that direction. We have already seen the fruits of this labor in the Producer Pack and OwnHammer V2 IRs. We used mainly PP and OH IRs at Axe-Fest this weekend and the results were stellar. Andy Wood's tone was among the best guitar tones I've ever heard live and we dialed it up in 10 minutes under far less than ideal conditions. It consisted of the Two Rock amp model and the EV 12L Mix IR. When you include more than the speaker response in the IR you can have low-frequency resonances that persist for tens of milliseconds or more. Truncating an IR destroys this LF information. In many cases this LF information loss would probably not be perceptible. In other cases, from experience, it can be extremely noticeable. The bottom line is that you can always remove the information if you don't want it but you can't add back what isn't there."
"Let me phrase this another way. An IR can consist of the "raw" speaker response plus none, one, some or all of the following: mic, preamp, room, power amp (e.g. you want to capture the response of a tube amp driving the speaker), etc. If you only care about the raw response then a short IR is all that is required. However if you want any of the other elements as part of the IR then a longer IR may be necessary. Ultra-Res gives you the OPTION of processing longer IRs."
"If the .wav is only 40ms long there is no sense in converting to Ultra-Res as you won't gain anything. Over 80 ms is desirable. The maximum length supported is 170 ms or so. Anything longer than that is truncated to 170 ms." source
"To get the optimum results the length should be 170 ms or more. As the length gets shorter you'll lose information. However there may not be any information to lose. It all depends on the IR. I've seen long IRs where only the first 100 ms or so is actual information and the rest is silence. OTOH I've seen 100 ms IRs where there is obviously more information but it got truncated. You lose nothing with Ultra-Res except the ability to change the size of the cabinet. You gain better sound and less CPU." "You can't mix Ultra-Res IRs as the data is not compatible. However... we foresaw that and the UltraRes conversion process produces two files: a .ir file and a .syx file. The .ir file is the raw IR data that can be imported into CabLab for mixing purposes. So CabLab can take .wav, non-Ultra-Res .syx and .ir files as input to the mixer section and product Ultra-Res .syx files." "The .ir files are included with our cabinet packs. We will not be offering .wav files. If you have the .wav file you don't need the .ir file. A .ir file can ONLY be used with CabLab. If you use the Axe-Fx II to capture IRs it will only generate .ir and/or .syx files. No .wav files are generated. The resulting data can only be used on Fractal Audio products." source
"It depends on the IR. Ultra-Res improves low-frequency resolution. It is very apparent with some IRs and virtually inaudible with others. It all depends on the low-frequency formants in the original IR. If there are significant, high-Q formants Ultra-Res will preserve those whereas conventional, short IRs will not. Audibility also varies with the amp being used. The difference is more audible with high gain as this will excite the formants more. Low-frequency formants vary with the type of cabinet and speaker. Some cabinets have a smooth low frequency response. Others have prominent formants. The mic also has an impact. Some mics will accentuate the formants. The room also contributes if it has strong LF modes. Furthermore some people like to capture an IR using a tube power amp. In this case you WILL get a significant formant at the low-frequency resonance of the speaker. A conventional IR will not capture that as the Q of the formant will exceed the resolution of the IR. Ultra-Res will capture that formant as Ultra-Res has 8 times the low-frequency resolution. Those who claim they can't hear a difference are correct. They can't. It's nothing to be ashamed of. But because they can't doesn't mean others also cannot. I can clearly hear the difference but I've trained myself on what to listen for. I vastly prefer Ultra-Res and only use Ultra-Res IRs in my personal patches (aside from the TV Mix, which is just a magical IR)." source
"The length of the sweep only determines the signal-to-noise ratio. If the room is completely silent the sweep can be infinitely short (an impulse). To overcome ambient noise you need more energy in the applied stimulus. With an impulse you can only increase the power so much before the amplifier or the speaker or the mic or the preamp, etc. distort. However, if you spread that power out over a longer period of time you can increase the energy and therefore increase the SNR. Think of it this way: a 1 ms pulse at 1000W has the same energy as a 1 second pulse at 1W. Now you can't just put a 1 second pulse into a system because the pulse has little frequency content. A 1 second sweep over the band of interest allows the transfer function (IR) of the system to be obtained via deconvolution. There are other signals you can use like pseudo-noise and MLS sequences but a "chirp" has the best characteristics. In the early days of room IR capturing they used impulses generated by popping a ballon, firing a starting pistol or clapping two boards together. The results were poor due to low SNR. This lead to the development of signals that have higher energy. To get the IR of a room long sweeps are typically used because there is a lot of ambient noise and the "returned signal" is weak (the reverb portion of the response is very low compared to the direct signal). When close mic'ing a speaker the ambient noise is low and the signal strength is very high so a short sweep is adequate. In fact you could probably get away with 100 ms or less in a studio environment." source
"I've never seen a cabinet IR (and I've examined thousands) that has any significant content beyond 150 ms or so. Most cab IRs are under 40 ms. The exception to this would be a "room IR" where the mic is very far from the speaker and the room is significantly reverberant. But one wouldn't normally use that as the primary tone, instead to add a little ambience to the tone and the loss of information would be imperceptible in context. Modeling products typically use IR lengths of 1K samples as this covers 90% of IRs ever captured. We support 2K and Ultra-Res (which is equivalent to 8K) which covers 99% of IRs. The amount of CPU power required to process an IR is proportional to the length of the IR. To support a 500 ms IR (24,000 samples) would require over ten times the CPU power of a 2K sample IR. It also requires over ten times the memory for storage. Given that that vast majority of IRs do not have any information beyond 40 ms it is wasteful of CPU and memory resources to support IRs longer than 2K." source
"The length of time you hold a chord is irrelevant. The impulse response of a speaker cab is typically much less than 100 ms. Only when there is significant room reflections is the length greater. Then you get into the whole argument of whether the IR should contain any room information." source
(about Tone Matching and Ultra-Res) "In Realtime mode the raw internal IR length is 8K which you can dump." source "You can export the Tone Match to CabLab and create and Ultra-Res IR." source
(source)
"I'm a huge advocate of longer IRs. In fact I think I was the first to advocate it despite all the naysayers. I pushed OwnHammer (and others) to increase their IR lengths and they were the only ones who acted on that advice (so far, maybe the other guys will start to follow suit). Ultra-Res was born out of the desire for longer IRs.
For recording you don't need to use the cab block in the Axe-Fx though. Record the raw amp sound and then "re-cab" it later. This way you can try different cabs. Cab-Lab is great for this. Cab-Lab does not do UltraRes processing. It creates UltraRes files for the Axe-Fx but it does all processing at the full IR length up to 8K samples. You can use other convolution plug-ins as well.
The reason for UltraRes is that long IRs have several drawbacks:
- They require lots of storage space. Not an issue on a computer but on a hardware product that means expensive non-volatile memory.
- They require lots of processing power if you don't want any latency. On a computer it doesn't matter since latency is a non-factor if you are processing prerecorded tracks. On a hardware product we must have zero latency.
So UltraRes was devised as a way to exploit the statistics of the data to give the benefits of longer IRs without the usual hardware drawbacks.
In my tests I've found that 8K samples (170 ms) is more than enough. I think 500 ms (24K samples) is overkill and if an IR has significant energy out that far then it has too much room in it. The speaker and cab itself are never more than 100 ms, usually much less. Anything beyond that is the room. I personally don't like IRs with lots of room in them. A little bit of early reflections are nice and make things sound less direct but too much room makes the sound get lost in the mix.
There's no meaningful data beyond 150 ms and if there is, it's the room and you don't want that much room."
(Ultra-Res 2.0) "No big deal, just some improved processing algorithms. The UltraRes cabs in Quantum 2.0 were all reprocessed with UltraRes". source
"UltraRes 2.0 is the next level of evolution for our patent-pending speaker simulation technology, with even greater accuracy than the original version. UltraRes 2.0 cab files are backwards compatible with previous Axe-Fx and AX8 firmwares supporting UltraRes 1.0." source
"1" from the speaker is the near field. The response of a speaker in the near field is very different than the response in the far field. In the near field the response changes (drastically) across the face of the transducer. Even moving the mic a fraction of an inch will result in a very different sound. 10 ft. from the speaker is the far field and the response changes smoothly as you move across the field. If the near field were the same as the far field then the sound wouldn't change as you moved the microphone and you could place the microphone anywhere on the face of the speaker. Anyone who has mic'd a speaker knows that this isn't the case." source
"The energy of the speaker itself is contained in less than 50 ms. Anything beyond that is room reflections. Therefore any differences that you may hear are room reflections. The question then becomes do you want room reflections. Some say yes, some say no. One approach is absolutely no room reflections and then you add them with room simulation. The other approach is to use longer IRs with reflections in them. Both are valid approaches. Close mic'ing minimizes room reflections as the direct path is much shorter than the reflections path and sound pressure decreases by the square of the distance." source
IR sampling rate
IRs are tied to the sampling rate of the hardware. This is 48 kHz (fixed) in Fractal Audio devices.
Different IRs for different processors
The Axe-Fx and AX8 models have different sysex IDs and therefore require different impulse responses..
IRs for the Axe-Fx Standard/Ultra must be converted to be able to use these with the later generation Axe-Fx. It's no use converting 1024-point IRs to 2040 points because they don't contain the necessary data. You need an original WAVE file of sufficient length to create a 2040 point IR.
Near-field and far-field IRs
Most speaker IRs represent the tone of a speaker that was recorded with the microphone close to the speaker, aka "near-field" or "close-miked".
"Far-field" IRs represent the sound of a speaker that was captured at a longer distance. There are a couple of far-field IRs among the stock cabs, created by Jay Mitchell ("JM"). These IRs are better suited for recreating the "amp-in-the-room" sound of a traditional guitar speaker.
More information about close-miking
Minimum Phase Transformation (MPT) and Auto Trim
An IR can be manually or automatically edited.
Auto Trim – Removing superfluous silence from the start of the IR. Most commercial IRs do not require this. But it might come in handy when shooting your own IRs.
"There is no wrong place to trim. It's impossible to know where the data starts because of noise. So we find where the data starts to increase, back up a few samples and trim there." source
Minimum Phase Transformation (MPT) – Verifies that the phase of the IR causes no issues when mixed with other (MPT) IRs, by time-aligning the IR. This is especially important when you mix multiple IRs and you don't want to align them manually in Cab-Lab, in the DAW or on the Axe-Fx III hardware. All stock cabs are MPT (Minimum Phase Transformed) to make them mix-compatible.
When neither Min Phase or Trim has been applied, the impulse response is considered "raw", containing the original phase details.
Cab-Lab and Axe-Edit let you apply MPT or Auto Trim when importing files, manually or automatically.
"If it's a cab IR the difference is basically nil because a speaker is a minimum-phase device. All minimum-phase does in this case is automatically remove the leading silence. A room IR is not minimum phase so you should not use MPT when processing a room IR." source
"The factory cabs are minimum phase for precisely the reason that mixing non-minimum phase leads to phase problems." source
"All the factory cabs are minimum-phase transformed so they are, by definition, "in phase" with each other." source
"So I've been doing a lot of critical listening the last couple months and have come to the conclusion I like non-minimum phase IRs better. The difference is subtle. They don't really sound that different but there are differences in the attack and in the feel. They just sound/feel a little more open and realistic. Another thing is that they mix very differently. It's less predictable but more natural. The caveat is that it's like mixing real mics, you need to experiment moving each mic in and out whereas with minimum phase you can usually just leave one mic at zero and move the other in and out. So here's a zip file of my favorite IR session, the Wellspring session, in non-min phase format for use with the Axe-Fx III. My suggestion is to put them in one of the user banks and compare with the factory min-phase versions. Note that names are a bit different but you should be able to figure it out." source
"Minimum vs. non-minimum phase changes the "delay" of the individual frequency components of the waveform. In a minimum phase system the individual sine waves have the least phase possible which concentrates the energy near the start of the waveform. For example consider a sine wave with an isosceles triangle envelope. The energy is concentrated at the center of the waveform (at the apex). The Fourier transform of that is mostly the primary frequency with a bunch of other sine waves at various amplitudes added. We can phase shift the component sine waves and the magnitude (frequency response) will not change but the waveform will. If we make it minimum phase the sine waves will add up so that the energy is concentrated at the beginning of the waveform and the waveform will then look something like a sine wave with a right triangle (ramp down) envelope." source
IR Capture: create your own IRs
Read this: IR Capture
IRs versus Tone Matching
"The IR is vastly more important. Tone Matching is a nifty feature and certainly useful but you'll get far more satisfaction by concentrating on capturing good IRs. The single most important aspect of recording guitar amps is micing the amp. Therefore the single most important aspect of using your Axe-Fx is the IR. People are too hung up on "matching" or "profiling" an amp but fail to realize that when you are doing that you are basically capturing an IR. If you capture the IR separately now you have an IR that is fully separated from the amp and therefore can be used with all models. Matching and profiling cannot mathematically separate the amp's frequency response from the cabinet frequency response. Once you do this you'll be surprised at how accurate the amp models are. I do this all the time and find Tone Matching is unnecessary now (in fact many of the amp models have had their built-in matching data removed in the latest firmware). Any differences between the model and the real amp are so minuscule as to be immaterial. A little tweak of the tone stack or EQ is usually enough to remove and differences. Besides, once you get into mixing you'll realize that you'll be applying EQ anyways so tiny differences in EQ are irrelevant. Moving the mic just a small amount drastically changes the sound. The best producers have mastered micing. You can only fix so much via EQ since EQ is essentially painting with a broad brush where mic technique is akin to using a fine-point brush." source
Acoustic sounds
To emulate acoustic instruments (acoustic guitar, cello, violin etc.), an IR of an acoustic body improve the sound. You can find some on Axe-Change.
Acoustic sounds benefit from long IRs with some room ambience, so UltraRes IRs are preferred.
Future of IRs
"The Axe-Fx III (and II) actually capture the Volterra kernels when doing an IR capture (it's hidden in the firmware for possible future use). I've studied dozens upon dozens of them and the kernels above first order (the first order kernel is the linear IR) are so small as to be inaudible. The distortion from an amp is orders of magnitude greater even when using a clean amp. The only significant nonlinear thing I've measured that speakers do is thermal compression (that we model already) and "cone cry" which sounds like sh*t. Jay Mitchell is probably the leading authority on speaker design and he has stated pretty much the same thing. I'm all for improvements but they need to be real improvements. I've sat here countless times comparing an IR to the actual speaker with a mic on it doing blind A/B tests and can NEVER tell the difference and I think my ears are pretty good. I dug through my Matlab stuff and found this. (graph) This is an IR of a speaker taken twice. The first time the drive level is around 1W (in red). The second time the amp was turned way up, I would estimate at least 50W (in blue). As you can see the difference is extremely small. There's a small difference from 10 Hz and down which is way below the reproduction range of any system and a difference way up at Nyquist (24 kHz) but that's 100 dB down (!). Furthermore we don't know if the tiny differences are from the speaker or from the amp or the mic or the mic preamp. I should add that speakers can and do distort (when Xmax is exceeded) but it's not a pleasant sound. Since the displacement of the cone is the inverse of the frequency the low frequencies are distorted which is the opposite of what you want when creating "pleasing distortion". Speaker distortion is flubby, flabby and farty. The Axe-Fx II and III can simulate that, if desired, using the Speaker Drive parameter in the Amp block. I always set it to zero. There are probably some other modes that cause distortion but, again, these are dwarfed by the distortion of the amp. The only other significant one I've experienced is cone cry. Manufacturers go to great lengths to prevent it from happening. I have a speaker here that does it. Whenever I play a high F it cries and it's annoying." source
Commercial and free IRs
The processors contain a lot of factory cabs. More information here.
You can also create your own IRs or get additional ones from sources such as listed below:
- Axe-Change
- Fractal Audio Cab Packs: here and here
- OwnHammer
- ML Sound Lab
- Red Wirez
- CabIR.eu
- Dr. Bonkers Soundlab
- Valhallir.at
- forum discussion
- free UltraRes IRs
Fractal Audio provides a couple of free IR libraries:
- Wellspring Sound (Vol. 1)
- Wellspring in non-MPT format (source)
- Mad Oak Studio
- IRs from a session with Larry Mitchell
Some commercial manufacturers provide free impulse responses. There are also quite a few popular impulse responses available in the public domain. Some examples:
- "gtrbody": adds the resonant sound of an acoustic guitar body (Axe-Change).
- "cello_body": adds the resonant sound of a cello (Axe-Change).
- GuitarHack ThisOne V30 (Axe-Change).