Jump to content
  • Pure Vinyl Club
    Pure Vinyl Club

    Digital Vinyl: Temporal Domain

    Note : The following article contains information that has been deemed incorrect by leading digital audio engineers. I attempted to corroborate the findings of this article by asking several digital audio experts. I was unable to find anyone who could back up the statements made, with any scientific data or theory. Consider the following article retracted.

     

    I am leaving the text of this article up on CA because it has enabled a good discussion to take place. By leaving it up, people can read what was claimed and read the followup arguments that the prove it incorrect. To remove the article completely only opens up a space for this to happen again, and again, and again.

     

     

    I take full responsibility for the publishing of this article. I should have had a technical editor check it before publication. I apologize to the CA Community for the error in judgement.

     

    - CC.

     

     

     

    1-Pixel.png

    Temporal Domain of Signal, or What is More Important for Listening to Music, Static or Dynamic Characteristics of the Sound Signal?

     

    Every time my audiophile friends, who do not have an analog setup (TT), come to me and see huge piles of expensive, rare LPs, they get puzzled. They wonder, how can it be that LP lovers spend huge amounts of money on their "analog" hobby, while suffering such discomforts when listening to music. They say this method of listening in the 21st century is absolutely impractical. In addition, there are signal distortion and limitations in many of the technical aspects of vinyl.

     

    In response, I always say the same thing in support of analog - it's mainly because of the time domain signal. We (fans of analog audio) are willing to make these sacrifices and inconveniences for much better performance in a time aspect, the so-called dynamic characteristics. Static characteristics, those belonging to the spectral and dynamic domains (Dynamic Range, THD + N, Frequency Response, etc.) certainly are important for high-quality sound, but when it comes to listening to music in real time, in my opinion, it is the dynamic characteristics that matter most

     

     

    Often, in response to my comments, people react with skepticism. They say they are used to trusting technical information that can be measured and compared and what I say is very subjective and ephemeral.

     

    Also viewing comments here on СA, especially those connected with the current topics such as MQA, I have noticed that some members react rather skeptically to the arguments about MQA's improvements of characteristics in the time-domain. And, some even question the very existence of such improvements.

     

    Here it is shown that "High-resolution in temporal, spatial, spectral, and dynamic domains together determine the quality value of perceived music and ,sound and that temporal resolution may be the most important domain perceptually". Temporal resolution, is actually what I would like to briefly discuss with you.

     

    There's a deeply rooted opinion that frequency above 10 kHz, and moreover above 20 kHz, contains a small amount of music information. And yet research shows that, for example transients from cymbals contain significant frequency components extending even above 60 kHz. The trumpet playing fortissimo has transients components up through 40 kHz, and in the case of the violin even temporary frequency of 100 kHz occurs.

     

    As you can see quite a lot of music information is contained in frequencies above 20 kHz. Of course, immediately a question is raised: "Are we able to hear it?". To answer this, it is worth mentioning some rarely discussed issues. Commonly cited audibility up to 20 kHz frequency is derived from conventional hearing tests, which are based on the audibility of simple sounds. But there is an alternative look at the issue from the more "dynamic" side. This is the temporal resolution of the ear, not the "static" harmonic content and audibility of pure sinusoidal tones.

     

    This may be more appropriate in a case of music signals than the prospect of simple tones. The actual music signals have a very complex structure as a result of the imposition of the attack and decay of many instruments. More importantly, their frequency spectrum is very different between the short period of the initial attack, or the rise of sound, eg. as a result of pulling a string or striking a key of a piano, and the subsequent, much longer sound decay.

     

    There is a large group of instruments, which are characterized by a very "transient," dynamic nature of the initial attack phase of the sound. Xylophone, trumpet, cymbals and striking a drum achieve dynamic levels in between 120 and 130 dB within 10 ms or less. One thing we can say for sure, it is not possible for a CD-quality sample scattered at 22.7ms to have an opportunity to correct the commissioning attack phase of musical instruments, which are half the distance between two consecutive samples.

     

    And the attack phase is very important for audio reception. In experiments, in which the samples were of wind instruments dissected in a way that combined the short attack phase of one instrument with a longer sound decay of another one, listeners identified the sound that of the instrument with the short fragment attack, not the longer decay sound.

     

     

    image2.png

     

    The sound wave graph from a cymbal being struck by a stick. The sound increase is nearly instantaneous, followed by a long sustain of a rather uniform nature. - from highfidelity.pl

     

     

    When viewed from the hearing mechanism perspective, you can find information indicating that the signals which have pulsing character (i.e., generally transients), in contrast to simple tones, activate significantly larger areas of hearing cells than pure sinusoidal tones (which, in nature are almost non existent). In the case of pulses, the possible temporal resolution of the human ear may be up to 10 microseconds, corresponding to frequencies of 100 kHz.

     

    This information is also confirmed in the opinion of recognized practitioners. Art Dudley from "Stereophile" magazine, in an interesting interview from The Editors cycle, is of the opinion that the Nyquist frequency does not apply while there are working decimation and reconstruction filters of complex music signals. In his opinion, two samples may be used to describe a single frequency, but do not provide sufficient density samples to describe the speed at which the signal increases or decreases. This is crucial to distinguishing between music and ordinary sound.

     

    Also I would like to quote, in the context of the above information, an excerpt from my correspondence with Dr. Rob Robinson:

     

    "My thoughts are that with extended frequency response you are not capturing "audible" frequencies but rather preserving the critical time relationships in the music at all frequencies. Human hearing might not be able to "detect" sounds above 15 - 20 kHz or so, but on the other hand hearing, in conjunction with the brain, is very sensitive to temporal information. It's been reported that the human auditory system is capable of discerning temporal differences of tens of microseconds or less (and note, at 192 kHz the time between samples is 5 microseconds). This temporal discrimination is the reason we are able to accurately discern directional / spatial cues. Hearing evolved so that the location of threats, e.g., the cougar about to pounce, could be determined accurately, as key to survival. The spatial information comes not only from amplitude, but the time difference between the same sound arriving at each ear. And the more sensitive hearing is to temporal information, the more accurately that spatial cues can be located.

     

    A CD format brickwall filter will affect time relationships, part of the reason that CD format digital audio may sound less "natural" than analog (or live sound). Preserving temporal information is key to preserving lifelike sound and imaging. While all digital audio will affect temporal information, the influence diminishes the higher the sample rate, because the antialiasing and reconstruction filters are operating at ultrasonic frequencies. So, by using higher sample rates, even though we may be recording sounds that are inaudible, we have better preservation of the temporal information in the signal, which conveys a more lifelike presentation of the music. Besides using a high sample rate to capture the signal, we also have the ultra wide 5,000 kHz bandwidth (five thousand kilohertz, as contrasted with "just" 20 kilohertz as the generally accepted audible upper frequency limit) of the Seta preamplifier which again faithfully preserves temporal relationships in the music signal (internally, the front end circuitry has a risetime of less than 50 nanoseconds)." - Dr. Rob Robinson

     

    If we take into consideration the typical technical parameters of audio, which is mainly bandwidth and dynamics (signal-to-noise ratio), we can easily come to the conclusion that, omitting the variables associated with the physiology of hearing, audiophile devices should not differ from each other, and moreover sonically stand out in relation to the audio devices from the mass market.

     

    And yet, there are people willing to pay much higher prices for equipment and the typical specifications are often similar or even slightly worse than the cheaper devices of the mass segment.

     

    Most importantly, in many cases audiophiles agree on the description of the main attributes of the sound of the given device, although expressed in a specific descriptive dictionary, and not in strict technical parameters.

     

    This raises a difficult to challenge conclusion that if some audiophile characteristics are consistently perceived by a large number of people there's a good chance that behind this stands specific physical phenomena, though their nature can be complicated and can be difficult to express in simple numerical parameters, eg. dynamic range or frequency response.

     

    What may these phenomena be? If the key to the mystery lies not in the parameters of the frequency domain (frequency response) or dynamics (noise at a low level), then a single area remains, and that's phase issues, or timing aspects of the sound. In fact, these are the most fundamental parameters of the sound signal, because they underlie its creation, what a sound wave actually looks like in the time domain. The question is how much of the sound wave graph corresponds to the wave reaching the microphone registering this recording.

     

    The nuances of the tonal colors, to the greatest extent, are shaped by the sound wave characteristic from each instrument. And, it's not just a simple analysis of the contents of the so-called harmonics but more of dynamic aspects, mainly the so-called attacks, or the rising of sound at the moment of its creation. It is not difficult to imagine that the course of the rise in amplitude of the sound will be quite different for wind, string and plucked instruments. It's a very fine structure of transients, which over a very short period time, this new tone of a musical instrument provides the bulk of information about its color and texture. Studies show that the human ear is most sensitive to the initial part of the pulse of a new musical sound.

     

    Any disturbance or contamination of this sensitive time structure leads to a noticeable loss of sound quality from the perspective of people sensitive to audiophile aspects, such as nuances in fidelity transmission of all the colors of musical instruments.

     

    In other words, the time domain signal (issues phase, or timing aspects of the sound). In fact, these are the most fundamental parameters of the sound signal, because they lie in its creation - thus what a sound wave in the time domain actually looks like.

     

    So, one of the main advantages of vinyl is the lack of restrictions of temporal resolution in LP. One of the key challenges for us in the Pure Vinyl Club was to find a way (technology, a method of recording) the equipment to maintain a maximum level of temporal resolution from the LP while recording in digital. This does not mean that we were going to compromise or neglect other characteristics which are also important for the sound.

     

    Paweł Piwowarski in his article "PLIKI HI-RES - niezbędny krok do nirwany czy nadmiarowy gadżet?" on High Fidelity.pl in the October 2016 issue to which I referred above, noted that "The trumpet playing fortissimo contains transients of 40 kHz". I invite you to watch this little video using our LP rip, which clearly shows that transients of the trombone can get higher than 50 kHz, and trumpet reaches almost 70kHz!

     

    Later, in one of the following articles, which might be called "What is actually recorded on LP" I will showcase many interesting videos and screenshots, which clearly show that in many musical instruments transients exceed the 40-50 kHz threshold, and among them will be some unexpected ones (contrabass and sibilance of the human voice).

     

    Also, many audiophiles have prejudices about the LPs Dynamic Range. Here's a screenshot of the DR of an album's full side (Duration: 24:07, RAW Record).

     

     

     

    screnshot-DR.jpg

     

     

     

    I will focus on these and other interesting LP aspects in more detail in in the next articles of the Digital Vinyl series.

     

     

    Thank you,

     

    Igor

     

     

     

     

     

    Sound Samples

     

     

    Trippin (Kenny Drew – Trippin (1984, Japan) Promo WL, Baystate (RJL-8101))

    Official DR Value: DR13, Gain Output Levels (Pure Vinyl) – 14.00dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (103MB)

     

     

     

    Play Fiddle Play (Isao Suzuki Quartet + 1 – Blue City (1974, Japan) Three Blind Mice (TBM-24))

    Official DR Value: DR13, Gain Output Levels (Pure Vinyl) – 8.02dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (113MB)

     

     

     

    Make Someone Happy (Carmen McRae – Live At Sugar Hill San Francisco (1964, USA) Time Records (S/2104))

    Official DR Value: DR14, Gain Output Levels (Pure Vinyl) – 7.23dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (99MB)

     

     

     

    Early In The Morning (John Henry Barbee, 1963

    VA – The Best Of The Blues (Compilation) (RE 1973, West Germany) Storyville (671188))

    Official DR Value: DR14, Gain Output Levels (Pure Vinyl) – 10.63dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (75MB)

     

     

     

    La Cumparsita (Werner Müller And His Orchestra – Tango! (1967, USA) London Records (SP 44098))

    Official DR Value: DR11, Gain Output Levels (Pure Vinyl) – 0.00dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (104MB)

     

     

     

    Wild Is The Wind (The Dave Pike Quartet Featuring Bill Evans – Pike’s Peak 1962 (RE 1981, USA) Columbia (PC 37011))

    Official DR Value: DR12, Gain Output Levels (Pure Vinyl) – 10.31dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (109MB)

     

     

     

    People Are Strange (The Doors – 13 (1970, USA) Elektra (EKS-74079))

    Official DR Value: DR11, Gain Output Levels (Pure Vinyl) – 0.00dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (82MB)

     

     

     

    Let’s Groove (Earth, Wind and Fire – Raise! (1981, Japan) CBS/Sony (25AP 2210))

    Official DR Value: DR15, Gain Output Levels (Pure Vinyl) – 7.89dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (108MB)

     

     

     

    Smooth Operator (Sade – Smooth Operator (1984, Single, 45rpm, Japan) Epic (12・3P-581))

    Official DR Value: DR13, Gain Output Levels (Pure Vinyl) – 7.15dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (114MB)

     

     

     

    Fernando (Paul Mauriat – Feelings (1977, 45rpm, Japan) Philips (45S-14))

    Official DR Value: DR14, Gain Output Levels (Pure Vinyl) – 5.76dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (112MB)

     

     

     

    1-Pixel.png




    User Feedback

    Recommended Comments



    Would you say "various forms of distortion" are characteristic of the DSD recordings on offer? What about audibility of such distortions if/where they do exist?

     

    With a decent modulator, the distortion is well below audible levels, at least with realistic inputs.

    Share this comment


    Link to comment
    Share on other sites

    Sorry I was referring to the Grimm white paper.

     

    My impression has been that Lipshitz was referring to the ADC and storage stages at least primarily. Hard to know how many chip based DACs use 3rd order modulators as they don't typically publish their internals.

    Why do you bring up the Grimm paper when everybody else is talking about Lipshitz? Or did I misunderstand something?

     

    The conclusions of Lipshitz are applicable to any SDM regardless of position in the production chain.

    Share this comment


    Link to comment
    Share on other sites

    With a decent modulator, the distortion is well below audible levels, at least with realistic inputs.

     

    Out of curiosity, how many bits are the Sox modulators? (I know they're open source, and I've looked at the source, but since I'm not a programmer I may as well not have.)

    Share this comment


    Link to comment
    Share on other sites

    Out of curiosity, how many bits are the Sox modulators? (I know they're open source, and I've looked at the source, but since I'm not a programmer I may as well not have.)

    1 bit, obviously. That's what DSD is.

    Share this comment


    Link to comment
    Share on other sites

    1 bit, obviously. That's what DSD is.

     

    You are remarkably sanguine for someone who has loosed on the world's Sox, Daphile, and A+ users "1-bit modulators of 3rd order or higher [that] are inherently unstable and exhibit various forms of distortion." ;)

    Share this comment


    Link to comment
    Share on other sites

    You are remarkably sanguine for someone who has loosed on the world's Sox, Daphile, and A+ users "1-bit modulators of 3rd order or higher [that] are inherently unstable and exhibit various forms of distortion." ;)

    Such is the nature of the beast. Blame Sony and Philips.

    Share this comment


    Link to comment
    Share on other sites

    Why do you bring up the Grimm paper when everybody else is talking about Lipshitz? Or did I misunderstand something?

     

    The conclusions of Lipshitz are applicable to any SDM regardless of position in the production chain.

     

    There were several issues being discussed and different papers. I'm responding while waiting for some timing analyses to be done i.e. multitasking -- oh well..

     

    You stated that DACs are mostly multibit "for a reason" -- a NOS DAC which accepts DSD input generally has no reason to do SDM, and the inputs I know of are 1-bit ... one could transmit multibit over the network or USB using a proprietary protocol but not one known to ALSA for example ... so I am not understanding why the Lipshitz issue generally affects the DAC? (unless the DAC is upsampling or converting from PCM)

    Share this comment


    Link to comment
    Share on other sites

    There were several issues being discussed and different papers. I'm responding while waiting for some timing analyses to be done i.e. multitasking -- oh well..

     

    Timing analyses, something related to this thread?

     

    You stated that DACs are mostly multibit "for a reason" --

     

    The vast majority of audio DACs use multi-bit SDM internally for the simple reason that it gives the best results.

     

    a NOS DAC which accepts DSD input generally has no reason to do SDM, and the inputs I know of are 1-bit ... one could transmit multibit over the network or USB using a proprietary protocol but not one known to ALSA for example ... so I am not understanding why the Lipshitz issue generally affects the DAC? (unless the DAC is upsampling or converting from PCM)

     

    I'm not sure what you're trying to say here. NOS PCM DACs are entirely outside this discussion. If the signal is at any point in 1-bit format, something must have created it, most likely a sigma-delta modulator subject to the issues raised by Lipshitz.

    Share this comment


    Link to comment
    Share on other sites

    There were several issues being discussed and different papers. I'm responding while waiting for some timing analyses to be done i.e. multitasking -- oh well..

     

    You stated that DACs are mostly multibit "for a reason"

     

    The vast majority of audio DACs use multi-bit SDM internally for the simple reason that it gives the best results.

     

     

    The central point of confusion. mansr is not referring to "multibit" in the Schiit PCM DAC sense. He's talking about the internal SDM stage.

     

     

    Here's Wikipedia about a similar thing on the A/D side:

     

     

    Because it has been extremely difficult to carry out DSP operations (for example performing EQ, balance, panning and other changes in the digital domain) in a one-bit environment, and because of the prevalence of solely PCM studio equipment such as Pro Tools, the vast majority of SACDs—especially rock and contemporary music, which rely on multitrack techniques—are in fact mixed in PCM (or mixed analog and recorded on PCM recorders) and then converted to DSD for SACD mastering.

     

    To address some of these issues, a new studio format has been developed, usually referred to as DSD-wide, which retains the high sample rate of standard DSD, but uses an 8-bit, rather than single-bit digital word length, yet still relies heavily on the noise shaping principle. DSD-wide is PCM with noise shaping—and is sometimes disparagingly referred to as "PCM-narrow"—but has the added benefit of making DSP operations in the studio a great deal more practical. The main difference is that "DSD-wide" still retains 2.8224 MHz (64Fs) sampling frequency while the highest frequency in which PCM is being edited is 384 kHz (8Fs). The "DSD-wide" signal is down-converted to regular DSD for SACD mastering. As a result of this technique and other developments there are now a few digital audio workstations (DAWs) that operate, or can operate, in the DSD domain, notably Pyramix and some SADiE systems.

    Share this comment


    Link to comment
    Share on other sites

    I'm pretty sure, though, that given the nature of sound they are a bunch of nice sine waves

    One thing further to what jabbr said, for our masked vegetable friend ;) :

     

    "Periodic" in the sense of sine wave-like signals is also not necessary. The Sampling Theorem also works fine for stuff that doesn't produce nice sine waves, like percussion or inharmonic attacks, such as the pluck of a string or (for brass players) "tonguing."

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    "assuming the signal is bandwidth limited to 20 kHz* 44 kHz is adequate to capture any transients that exist *in that signal*, to answer your question. Not silly." Indeed. And you can go up to 22K and still be kosher per sampling theorem. But if there are cymbals and a up to 44K bandwidth you need a 88K sampling, don't you ?SN theorem certainly is guaranteed theorem quality ; that actual music can perfectly fit a 22K bandwidth is much less certain. Thus the need for larger bandwidth hence sampling rate to capture reality.

     

    What I'm not sure of is the equivalence of a limited to start with 20K bandwidth capture with a 2O/20K bandwidth loudspeakers output

    Let me try to explain. The Fourier Transform itself is defined for continuous signals (without sampling). It states that for *any* continuous time dependent signal, there exists a transform into the frequency/phase dimension such that the signal can be described *only* by its constituent frequency/phase coordinates -- that means that *any* time dependent signal can be described entirely by a set of sinusoidal waves having a defined frequency, amplitude and phase.

     

    This is mathematically true for every signal. *Why* its true involves some math: https://en.wikipedia.org/wiki/Fourier_transform but trust that this is true.

     

    Now there is the discrete fourier transform (DCT) which is defined for discrete samples. This is equivalent to the general fourier transform when the sampling rate is infinite. So what to do? In practice suppose we take samples in the Ghz range. We can then examine the discrete fourier transform and suppose we see that *for that specific signal, e.g. a jazz band* that the amplitudes of the frequency components >20 kHz are *zero*. That tells us that *that particular signal* is bandwidth limited to 20 kHz. Now suppose we hit the cymbal ? is there a component > 20 kHz ... let's say there is, now for the sake of discussion up to 40 kHz and then zero above 40 kHz. That means that we can capture the transients in that signal by sampling at say 88 kHz. Or if there are components at 70 kHz then 176 kHz will contain those transients.

     

    Do you follow?

     

    So you can just look at the discrete fourier transform of the recording and see what the minimal sampling rate would be to capture any transients or *anything else* in that signal. You can determine the actual bandwidth of that signal.

     

    Shannon-Nyquist https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampling_theorem just tells us what the sampling rate *needs to be* in order to capture the full fidelity of any particular signal (given its particular bandwidth).

     

    So again *assuming the signal is bandwidth limited to 20 kHz* 44 kHz is adequate to capture any transients that exist *in that signal*, to answer your question. Not silly.

    Share this comment


    Link to comment
    Share on other sites

    Timing analyses, something related to this thread?

     

    FPGA, clocks, network, DAC so peripherally.

     

    I'm not sure what you're trying to say here. NOS PCM DACs are entirely outside this discussion. If the signal is at any point in 1-bit format, something must have created it, most likely a sigma-delta modulator subject to the issues raised by Lipshitz.

     

    I meant NOS DSD DAC and the DAC just converts what it's given both good and bad ;)

    Share this comment


    Link to comment
    Share on other sites

    I meant NOS DSD DAC and the DAC just converts what it's given both good and bad ;)

     

    The term NOS typically refers to PCM DACs. DSD is already oversampled.

    Share this comment


    Link to comment
    Share on other sites

    I'm pretty sure, though, that given the nature of sound they are a bunch of nice sine waves

     

    Nope. Please read about what "inharmonic" means. Inharmonicity is characteristic of various instrumental attacks, and of a lot of percussion instruments. If you do Fourier analysis, it shows that all these sounds are built from sine waves, but without the analysis that isn't at all apparent - on a scope they don't look like "nice sine waves."

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    "assuming the signal is bandwidth limited to 20 kHz* 44 kHz is adequate to capture any transients that exist *in that signal*, to answer your question. Not silly." Indeed. And you can go up to 22K and still be kosher per sampling theorem. But if there are cymbals and a up to 44K bandwidth you need a 88K sampling, don't you ?SN theorem certainly is guaranteed theorem quality ; that actual music can perfectly fit a 22K bandwidth is much less certain. Thus the need for larger bandwidth hence sampling rate to capture reality.

     

    What I'm not sure of is the equivalence of a limited to start with 20K bandwidth capture with a 2O/20K bandwidth loudspeakers output

     

    There are (at least) two possibilities:

     

    - All the effects of ultrasonic frequencies are already "baked into" what we can hear. That is, intermodulation - interference - between ultrasonics and what we can hear, and among the ultrasonics themselves, if the result is audible, is in the recording already, and therefore 20KHz is enough.

     

    - Or, in order to have the most faithful rendition, perhaps the interference between ultrasonics and the audible range should be recreated in your room. Note that this would require speakers to be relatively flat to ultrasonic range, the room and speaker drivers to be able to reproduce the interference as it occurred live.... None of this is trivial stuff, if it is even necessary (i.e., if the first alternative above isn't good enough).

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    Glad to see Chris add the note to this article.

     

    I wonder however, is this a good time for one of the engineering advisers to write the proper article. One that lays out the basics of how digital works, what the parameters are. One that compares it to LP, and reel tape. One that shows the ways in which digital PCM is superior. That one can prefer or enjoy these other formats. Just don't resort to myths or fallacies about why that might be the case. It isn't about superior fidelity. An article that illuminates rather than obfuscates.

     

    Wouldn't that be a good article one likely to draw equal interest to this article? One which is doing a service to make clear how this digital audio really works. A light in the darkness of myth and superstition which is all too pervasive. Seems like such an article would be feather in the cap of Computer Audiophile if you ask me.

    Share this comment


    Link to comment
    Share on other sites

    ?? "If you do Fourier analysis, it shows that all these sounds are built from sine waves" Yes

    Nope. Please read about what "inharmonic" means. Inharmonicity is characteristic of various instrumental attacks, and of a lot of percussion instruments. If you do Fourier analysis, it shows that all these sounds are built from sine waves, but without the analysis that isn't at all apparent - on a scope they don't look like "nice sine waves."

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    There's a trivial hypothesis : does a DAC ouputing at say 192 is outputing the same stuff with the exact same timing when it's been fed 44 , that is when some events have not been sampled, rather than 192?

     

    BTW Fremer impaired the US career of the wonderful Cabasse Sphere with considerations that what he was hearing was too good so had to be kind of rolled of sweetened by the (then limited) 16/44 filtering of the Sphere

     

    I use old active studio monitors that deliver 20/20K and are top guns from vinyl/experimental PCM age

    Note that my TEAC 501 I feed DSD 128 and can be 384K fed as well starts to roll off at 20K

    Is it really the same to filter 20/20K at speakers level and to not sample beyond a 20K bandwidth ? Is it the same reconstruction of the actual musical event? at DAC level?

     

     

     

    There are (at least) two possibilities:

     

    - All the effects of ultrasonic frequencies are already "baked into" what we can hear. That is, intermodulation - interference - between ultrasonics and what we can hear, and among the ultrasonics themselves, if the result is audible, is in the recording already, and therefore 20KHz is enough.

     

    - Or, in order to have the most faithful rendition, perhaps the interference between ultrasonics and the audible range should be recreated in your room. Note that this would require speakers to be relatively flat to ultrasonic range, the room and speaker drivers to be able to reproduce the interference as it occurred live.... None of this is trivial stuff, if it is even necessary (i.e., if the first alternative above isn't good enough).

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    There's a trivial hypothesis : does a DAC ouputing at say 192 is outputing the same stuff with the exact same timing when it's been fed 44 , that is when some events have not been sampled, rather than 192?

     

    BTW Fremer impaired the US career of the wonderful Cabasse Sphere with considerations that what he was hearing was too good so had to be kind of rolled of sweetened by the (then limited) 16/44 filtering of the Sphere

     

    I use old active studio monitors that deliver 20/20K and are top guns from vinyl/experimental PCM age

    Note that my TEAC 501 I feed DSD 128 and can be 384K fed as well starts to roll off at 20K

    Is it really the same to filter 20/20K at speakers level and to not sample beyond a 20K bandwidth ? Is it the same reconstruction of the actual musical event? at DAC level?

     

     

    Probably helpful to use numbers instead of abstract concepts, so just for our present purposes, let's say we have some hypothetical instrument that has strong harmonics at 38kHz and 40kHz, and due to interference between the two we get audible energy from the instrument at 2kHz.

     

     

    In order to get that 38 and 40kHz energy through the DAC, we need a sample rate more than double those frequencies, so at least 88.2 or 96kHz, right from the A/D at the recording end through the DAC, which nicely provides the same 38 and 40kHz output that was recorded. Fine, so let's say we've done that, and your electronics pass all those frequencies. But your speakers only have useful response to 30kHz. Whoops, no more 38 and 40kHz energy, and so there's nothing at 2kHz, right? But wait - didn't the DAC also pick up the 2kHz energy along with everything else up to 44 or just under 48 kHz? And your speakers can reproduce that 2kHz just fine. So have we lost anything? I don't know the answer.

     

     

    Now let's say instead of being recorded at an 88.2 or 96kHz sample rate, the input from our hypothetical instrument is band-limited by filtering it to capture just the components up to 20kHz, so we can do A/D with a 44.1kHz sample rate. No 38 or 40kHz energy. But we have captured the result of that interference in the audible band at 2kHz, so the question is exactly the same: Have we lost anything by capturing only the audible result of the interference (2kHz), rather than the energy that created the interference in the first place (38 and 40kHz)? Again, I don't know the answer.

     

     

    The other thing I think you have asked is that assuming we have recorded a signal limited just to the audible band, just up to 20kHz, will sampling at 44.1kHz give us the same timing as sampling at 192kHz? The answer is yes - that's just math, the Sampling Theorem we've been talking about. The interesting issues here don't have anything to do with capturing events that happen between samples. That may not be intuitive, but it's mathematical fact. Rather, the interesting issues have to do with how easily and cheaply you can make an excellent filter for a signal if you sample it at 44.1kHz versus sampling at a higher rate. Sampling at a higher rate makes good filtering easier and cheaper, which is why the vast, vast majority of DACs oversample (in fact, oversampling became pretty well the standard thing shortly after the introduction of CD players, before separate DACs even existed).

    Share this comment


    Link to comment
    Share on other sites

    Your last paragraph is sheer interpretation of my question

    Probably helpful to use numbers instead of abstract concepts, so just for our present purposes, let's say we have some hypothetical instrument that has strong harmonics at 38kHz and 40kHz, and due to interference between the two we get audible energy from the instrument at 2kHz.

     

     

    In order to get that 38 and 40kHz energy through the DAC, we need a sample rate more than double those frequencies, so at least 88.2 or 96kHz, right from the A/D at the recording end through the DAC, which nicely provides the same 38 and 40kHz output that was recorded. Fine, so let's say we've done that, and your electronics pass all those frequencies. But your speakers only have useful response to 30kHz. Whoops, no more 38 and 40kHz energy, and so there's nothing at 2kHz, right? But wait - didn't the DAC also pick up the 2kHz energy along with everything else up to 44 or just under 48 kHz? And your speakers can reproduce that 2kHz just fine. So have we lost anything? I don't know the answer.

     

     

    Now let's say instead of being recorded at an 88.2 or 96kHz sample rate, the input from our hypothetical instrument is band-limited by filtering it to capture just the components up to 20kHz, so we can do A/D with a 44.1kHz sample rate. No 38 or 40kHz energy. But we have captured the result of that interference in the audible band at 2kHz, so the question is exactly the same: Have we lost anything by capturing only the audible result of the interference (2kHz), rather than the energy that created the interference in the first place (38 and 40kHz)? Again, I don't know the answer.

     

     

    The other thing I think you have asked is that assuming we have recorded a signal limited just to the audible band, just up to 20kHz, will sampling at 44.1kHz give us the same timing as sampling at 192kHz? The answer is yes - that's just math, the Sampling Theorem we've been talking about. The interesting issues here don't have anything to do with capturing events that happen between samples. That may not be intuitive, but it's mathematical fact. Rather, the interesting issues have to do with how easily and cheaply you can make an excellent filter for a signal if you sample it at 44.1kHz versus sampling at a higher rate. Sampling at a higher rate makes good filtering easier and cheaper, which is why the vast, vast majority of DACs oversample (in fact, oversampling became pretty well the standard thing shortly after the introduction of CD players, before separate DACs even existed).

    Share this comment


    Link to comment
    Share on other sites

     

    My basic thought is your premise of the question isn't true a majority of the time. It isn't generally true that CDs of analog recordings sound like crap. For one thing I have digitally recorded both reel tape and LP with the result sounding like reel tape or LP. So the digital CD isn't a bottleneck.

     

     

    Me too. Very good digital recordings of needle drops are indistinguishable on playback from the original vinyl - played back on the same system. I've personally experienced this and demonstrated it with others.

     

    A properly done digital recording of vinyl has all those "vinyl like" qualities that vinyl lovers love. So the digital medium itself isn't the culprit or the limitation. I refer all of you to John Atkinson's review of the Ayre ADC, where he said he compared digital conversions of his own analog recordings to the original- in fact compared them "till his ears bled" - and said he couldn't tell them apart.

    Share this comment


    Link to comment
    Share on other sites

    Your last paragraph is sheer interpretation of my question

     

     

    I guessed at what you meant incorrectly, then.

    Share this comment


    Link to comment
    Share on other sites

    Probably helpful to use numbers instead of abstract concepts, so just for our present purposes, let's say we have some hypothetical instrument that has strong harmonics at 38kHz and 40kHz, and due to interference between the two we get audible energy from the instrument at 2kHz.

     

     

    In order to get that 38 and 40kHz energy through the DAC, we need a sample rate more than double those frequencies, so at least 88.2 or 96kHz, right from the A/D at the recording end through the DAC, which nicely provides the same 38 and 40kHz output that was recorded. Fine, so let's say we've done that, and your electronics pass all those frequencies. But your speakers only have useful response to 30kHz. Whoops, no more 38 and 40kHz energy, and so there's nothing at 2kHz, right? But wait - didn't the DAC also pick up the 2kHz energy along with everything else up to 44 or just under 48 kHz? And your speakers can reproduce that 2kHz just fine. So have we lost anything? I don't know the answer.

     

     

    Now let's say instead of being recorded at an 88.2 or 96kHz sample rate, the input from our hypothetical instrument is band-limited by filtering it to capture just the components up to 20kHz, so we can do A/D with a 44.1kHz sample rate. No 38 or 40kHz energy. But we have captured the result of that interference in the audible band at 2kHz, so the question is exactly the same: Have we lost anything by capturing only the audible result of the interference (2kHz), rather than the energy that created the interference in the first place (38 and 40kHz)? Again, I don't know the answer.

     

    The answer depends on where the intermodulation takes place. If the 2 kHz product is registered by the microphone, and there are no further effects of the ultrasonic tones inside our ears, then removing them will have no audible effect.

    Share this comment


    Link to comment
    Share on other sites

    The answer depends on where the intermodulation takes place. If the 2 kHz product is registered by the microphone, and there are no further effects of the ultrasonic tones inside our ears, then removing them will have no audible effect.

     

    Right. To address @Jud, there should be no assumption that nonlinear effects are the same. Intermodulation is a type of nonlinear effect. Air, bone, various tissues each have unique nonlinear transmission qualities. The nervous system also has memory and computation capabilities that enable nonlinearities that go beyond what is being discussed hear eg "synesthesia" The reason I bring this up is to make the point that many linear assumptions are not necessarily valid.

    Share this comment


    Link to comment
    Share on other sites

    Me too. Very good digital recordings of needle drops are indistinguishable on playback from the original vinyl - played back on the same system. I've personally experienced this and demonstrated it with others.

     

    A properly done digital recording of vinyl has all those "vinyl like" qualities that vinyl lovers love. So the digital medium itself isn't the culprit or the limitation. I refer all of you to John Atkinson's review of the Ayre ADC, where he said he compared digital conversions of his own analog recordings to the original- in fact compared them "till his ears bled" - and said he couldn't tell them apart.

     

    I'm not disputing that a digital copy can and should sound as good as the original analog recording but in reality if you listen to the 16/44 content of artist's like Frank Sinatra on Tidal, it does sound like utter crap compared to the analog original.

    Share this comment


    Link to comment
    Share on other sites




    Create an account or sign in to comment

    You need to be a member in order to leave a comment

    Create an account

    Sign up for a new account in our community. It's easy!

    Register a new account

    Sign in

    Already have an account? Sign in here.

    Sign In Now




×
×
  • Create New...