Jump to content
  • Pure Vinyl Club
    Pure Vinyl Club

    Digital Vinyl: Temporal Domain

    Note : The following article contains information that has been deemed incorrect by leading digital audio engineers. I attempted to corroborate the findings of this article by asking several digital audio experts. I was unable to find anyone who could back up the statements made, with any scientific data or theory. Consider the following article retracted.

     

    I am leaving the text of this article up on CA because it has enabled a good discussion to take place. By leaving it up, people can read what was claimed and read the followup arguments that the prove it incorrect. To remove the article completely only opens up a space for this to happen again, and again, and again.

     

     

    I take full responsibility for the publishing of this article. I should have had a technical editor check it before publication. I apologize to the CA Community for the error in judgement.

     

    - CC.

     

     

     

    1-Pixel.png

    Temporal Domain of Signal, or What is More Important for Listening to Music, Static or Dynamic Characteristics of the Sound Signal?

     

    Every time my audiophile friends, who do not have an analog setup (TT), come to me and see huge piles of expensive, rare LPs, they get puzzled. They wonder, how can it be that LP lovers spend huge amounts of money on their "analog" hobby, while suffering such discomforts when listening to music. They say this method of listening in the 21st century is absolutely impractical. In addition, there are signal distortion and limitations in many of the technical aspects of vinyl.

     

    In response, I always say the same thing in support of analog - it's mainly because of the time domain signal. We (fans of analog audio) are willing to make these sacrifices and inconveniences for much better performance in a time aspect, the so-called dynamic characteristics. Static characteristics, those belonging to the spectral and dynamic domains (Dynamic Range, THD + N, Frequency Response, etc.) certainly are important for high-quality sound, but when it comes to listening to music in real time, in my opinion, it is the dynamic characteristics that matter most

     

     

    Often, in response to my comments, people react with skepticism. They say they are used to trusting technical information that can be measured and compared and what I say is very subjective and ephemeral.

     

    Also viewing comments here on СA, especially those connected with the current topics such as MQA, I have noticed that some members react rather skeptically to the arguments about MQA's improvements of characteristics in the time-domain. And, some even question the very existence of such improvements.

     

    Here it is shown that "High-resolution in temporal, spatial, spectral, and dynamic domains together determine the quality value of perceived music and ,sound and that temporal resolution may be the most important domain perceptually". Temporal resolution, is actually what I would like to briefly discuss with you.

     

    There's a deeply rooted opinion that frequency above 10 kHz, and moreover above 20 kHz, contains a small amount of music information. And yet research shows that, for example transients from cymbals contain significant frequency components extending even above 60 kHz. The trumpet playing fortissimo has transients components up through 40 kHz, and in the case of the violin even temporary frequency of 100 kHz occurs.

     

    As you can see quite a lot of music information is contained in frequencies above 20 kHz. Of course, immediately a question is raised: "Are we able to hear it?". To answer this, it is worth mentioning some rarely discussed issues. Commonly cited audibility up to 20 kHz frequency is derived from conventional hearing tests, which are based on the audibility of simple sounds. But there is an alternative look at the issue from the more "dynamic" side. This is the temporal resolution of the ear, not the "static" harmonic content and audibility of pure sinusoidal tones.

     

    This may be more appropriate in a case of music signals than the prospect of simple tones. The actual music signals have a very complex structure as a result of the imposition of the attack and decay of many instruments. More importantly, their frequency spectrum is very different between the short period of the initial attack, or the rise of sound, eg. as a result of pulling a string or striking a key of a piano, and the subsequent, much longer sound decay.

     

    There is a large group of instruments, which are characterized by a very "transient," dynamic nature of the initial attack phase of the sound. Xylophone, trumpet, cymbals and striking a drum achieve dynamic levels in between 120 and 130 dB within 10 ms or less. One thing we can say for sure, it is not possible for a CD-quality sample scattered at 22.7ms to have an opportunity to correct the commissioning attack phase of musical instruments, which are half the distance between two consecutive samples.

     

    And the attack phase is very important for audio reception. In experiments, in which the samples were of wind instruments dissected in a way that combined the short attack phase of one instrument with a longer sound decay of another one, listeners identified the sound that of the instrument with the short fragment attack, not the longer decay sound.

     

     

    image2.png

     

    The sound wave graph from a cymbal being struck by a stick. The sound increase is nearly instantaneous, followed by a long sustain of a rather uniform nature. - from highfidelity.pl

     

     

    When viewed from the hearing mechanism perspective, you can find information indicating that the signals which have pulsing character (i.e., generally transients), in contrast to simple tones, activate significantly larger areas of hearing cells than pure sinusoidal tones (which, in nature are almost non existent). In the case of pulses, the possible temporal resolution of the human ear may be up to 10 microseconds, corresponding to frequencies of 100 kHz.

     

    This information is also confirmed in the opinion of recognized practitioners. Art Dudley from "Stereophile" magazine, in an interesting interview from The Editors cycle, is of the opinion that the Nyquist frequency does not apply while there are working decimation and reconstruction filters of complex music signals. In his opinion, two samples may be used to describe a single frequency, but do not provide sufficient density samples to describe the speed at which the signal increases or decreases. This is crucial to distinguishing between music and ordinary sound.

     

    Also I would like to quote, in the context of the above information, an excerpt from my correspondence with Dr. Rob Robinson:

     

    "My thoughts are that with extended frequency response you are not capturing "audible" frequencies but rather preserving the critical time relationships in the music at all frequencies. Human hearing might not be able to "detect" sounds above 15 - 20 kHz or so, but on the other hand hearing, in conjunction with the brain, is very sensitive to temporal information. It's been reported that the human auditory system is capable of discerning temporal differences of tens of microseconds or less (and note, at 192 kHz the time between samples is 5 microseconds). This temporal discrimination is the reason we are able to accurately discern directional / spatial cues. Hearing evolved so that the location of threats, e.g., the cougar about to pounce, could be determined accurately, as key to survival. The spatial information comes not only from amplitude, but the time difference between the same sound arriving at each ear. And the more sensitive hearing is to temporal information, the more accurately that spatial cues can be located.

     

    A CD format brickwall filter will affect time relationships, part of the reason that CD format digital audio may sound less "natural" than analog (or live sound). Preserving temporal information is key to preserving lifelike sound and imaging. While all digital audio will affect temporal information, the influence diminishes the higher the sample rate, because the antialiasing and reconstruction filters are operating at ultrasonic frequencies. So, by using higher sample rates, even though we may be recording sounds that are inaudible, we have better preservation of the temporal information in the signal, which conveys a more lifelike presentation of the music. Besides using a high sample rate to capture the signal, we also have the ultra wide 5,000 kHz bandwidth (five thousand kilohertz, as contrasted with "just" 20 kilohertz as the generally accepted audible upper frequency limit) of the Seta preamplifier which again faithfully preserves temporal relationships in the music signal (internally, the front end circuitry has a risetime of less than 50 nanoseconds)." - Dr. Rob Robinson

     

    If we take into consideration the typical technical parameters of audio, which is mainly bandwidth and dynamics (signal-to-noise ratio), we can easily come to the conclusion that, omitting the variables associated with the physiology of hearing, audiophile devices should not differ from each other, and moreover sonically stand out in relation to the audio devices from the mass market.

     

    And yet, there are people willing to pay much higher prices for equipment and the typical specifications are often similar or even slightly worse than the cheaper devices of the mass segment.

     

    Most importantly, in many cases audiophiles agree on the description of the main attributes of the sound of the given device, although expressed in a specific descriptive dictionary, and not in strict technical parameters.

     

    This raises a difficult to challenge conclusion that if some audiophile characteristics are consistently perceived by a large number of people there's a good chance that behind this stands specific physical phenomena, though their nature can be complicated and can be difficult to express in simple numerical parameters, eg. dynamic range or frequency response.

     

    What may these phenomena be? If the key to the mystery lies not in the parameters of the frequency domain (frequency response) or dynamics (noise at a low level), then a single area remains, and that's phase issues, or timing aspects of the sound. In fact, these are the most fundamental parameters of the sound signal, because they underlie its creation, what a sound wave actually looks like in the time domain. The question is how much of the sound wave graph corresponds to the wave reaching the microphone registering this recording.

     

    The nuances of the tonal colors, to the greatest extent, are shaped by the sound wave characteristic from each instrument. And, it's not just a simple analysis of the contents of the so-called harmonics but more of dynamic aspects, mainly the so-called attacks, or the rising of sound at the moment of its creation. It is not difficult to imagine that the course of the rise in amplitude of the sound will be quite different for wind, string and plucked instruments. It's a very fine structure of transients, which over a very short period time, this new tone of a musical instrument provides the bulk of information about its color and texture. Studies show that the human ear is most sensitive to the initial part of the pulse of a new musical sound.

     

    Any disturbance or contamination of this sensitive time structure leads to a noticeable loss of sound quality from the perspective of people sensitive to audiophile aspects, such as nuances in fidelity transmission of all the colors of musical instruments.

     

    In other words, the time domain signal (issues phase, or timing aspects of the sound). In fact, these are the most fundamental parameters of the sound signal, because they lie in its creation - thus what a sound wave in the time domain actually looks like.

     

    So, one of the main advantages of vinyl is the lack of restrictions of temporal resolution in LP. One of the key challenges for us in the Pure Vinyl Club was to find a way (technology, a method of recording) the equipment to maintain a maximum level of temporal resolution from the LP while recording in digital. This does not mean that we were going to compromise or neglect other characteristics which are also important for the sound.

     

    Paweł Piwowarski in his article "PLIKI HI-RES - niezbędny krok do nirwany czy nadmiarowy gadżet?" on High Fidelity.pl in the October 2016 issue to which I referred above, noted that "The trumpet playing fortissimo contains transients of 40 kHz". I invite you to watch this little video using our LP rip, which clearly shows that transients of the trombone can get higher than 50 kHz, and trumpet reaches almost 70kHz!

     

    Later, in one of the following articles, which might be called "What is actually recorded on LP" I will showcase many interesting videos and screenshots, which clearly show that in many musical instruments transients exceed the 40-50 kHz threshold, and among them will be some unexpected ones (contrabass and sibilance of the human voice).

     

    Also, many audiophiles have prejudices about the LPs Dynamic Range. Here's a screenshot of the DR of an album's full side (Duration: 24:07, RAW Record).

     

     

     

    screnshot-DR.jpg

     

     

     

    I will focus on these and other interesting LP aspects in more detail in in the next articles of the Digital Vinyl series.

     

     

    Thank you,

     

    Igor

     

     

     

     

     

    Sound Samples

     

     

    Trippin (Kenny Drew – Trippin (1984, Japan) Promo WL, Baystate (RJL-8101))

    Official DR Value: DR13, Gain Output Levels (Pure Vinyl) – 14.00dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (103MB)

     

     

     

    Play Fiddle Play (Isao Suzuki Quartet + 1 – Blue City (1974, Japan) Three Blind Mice (TBM-24))

    Official DR Value: DR13, Gain Output Levels (Pure Vinyl) – 8.02dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (113MB)

     

     

     

    Make Someone Happy (Carmen McRae – Live At Sugar Hill San Francisco (1964, USA) Time Records (S/2104))

    Official DR Value: DR14, Gain Output Levels (Pure Vinyl) – 7.23dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (99MB)

     

     

     

    Early In The Morning (John Henry Barbee, 1963

    VA – The Best Of The Blues (Compilation) (RE 1973, West Germany) Storyville (671188))

    Official DR Value: DR14, Gain Output Levels (Pure Vinyl) – 10.63dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (75MB)

     

     

     

    La Cumparsita (Werner Müller And His Orchestra – Tango! (1967, USA) London Records (SP 44098))

    Official DR Value: DR11, Gain Output Levels (Pure Vinyl) – 0.00dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (104MB)

     

     

     

    Wild Is The Wind (The Dave Pike Quartet Featuring Bill Evans – Pike’s Peak 1962 (RE 1981, USA) Columbia (PC 37011))

    Official DR Value: DR12, Gain Output Levels (Pure Vinyl) – 10.31dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (109MB)

     

     

     

    People Are Strange (The Doors – 13 (1970, USA) Elektra (EKS-74079))

    Official DR Value: DR11, Gain Output Levels (Pure Vinyl) – 0.00dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (82MB)

     

     

     

    Let’s Groove (Earth, Wind and Fire – Raise! (1981, Japan) CBS/Sony (25AP 2210))

    Official DR Value: DR15, Gain Output Levels (Pure Vinyl) – 7.89dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (108MB)

     

     

     

    Smooth Operator (Sade – Smooth Operator (1984, Single, 45rpm, Japan) Epic (12・3P-581))

    Official DR Value: DR13, Gain Output Levels (Pure Vinyl) – 7.15dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (114MB)

     

     

     

    Fernando (Paul Mauriat – Feelings (1977, 45rpm, Japan) Philips (45S-14))

    Official DR Value: DR14, Gain Output Levels (Pure Vinyl) – 5.76dB, Edit “Click Repair” – yes

     

    192 kHz / 24 bit (112MB)

     

     

     

    1-Pixel.png




    User Feedback

    Recommended Comments



    There is a little mistake I think. Timing accuracy of digitally sampled audio is 1/(44,100*2*pi*65536) Works out to about 56 picoseconds. This is for 44,100 at 16 bit.

    .

    The 20,000 is the bandwidth available on a CD, although it should probably be 21,000. As far as I know, the limit for how small an increment can encode is 1 nanosecond.

     

    It's also worth remembering that the the quantization errors performed by the A/D converter are completely eliminated and turned into random noise by a process called dithering, which is contained in every A/D converter on the market. The early A/D converters from the 80s didn't contain dithering though, as this wasn't introduced until the mid 80s. So, bit depth is only signal-to-noise ratio. In other words, 24 bit doesn't offer higher resolution, just higher dynamic range (144 dB for 24 bit, 96 dB for 16 bit).

    Share this comment


    Link to comment
    Share on other sites

    So then you would agree that demonstration of a consistent ability to distinguish between PCM44 and let's say DSD256 recordings would be proof of the ability to perceive ultrasonic content?

     

     

    Room treatments for headphone users

     

    Yes and no.

    DSD is a bit tricky, as it's a different technology that has certain issues. If you haven't heard about it, Stanley Lipshitz and John Vanderkooy did a paper on 1-bit technology in DSD and basically concluded that it was broken:

    http://www.essentialaudio.com/1-Bit-Is-Bad.pdf

     

    I've heard about studies that concluded that people couldn't tell DSD and PCM apart anyways. I believe our ears are a lot more forgiving than most audiophiles like to think.

    Share this comment


    Link to comment
    Share on other sites

    Why are you recording at 176 ? would you buy your 176 source stuff as hi-res or be perfectly happy streaming it "CD Quality" ? MQA ?

     

    I didn't record the cymbals. John E. Johnson Jr who writes for one of the Home Theater mags is a drummer and did a series of measurements and reviews of dozens of cymbals. His recordings.

     

    I do a little recording. I have a few times recorded at 96 khz and also at 192 khz. Why? The ability to do so was there. If there were a difference, a recording done by me directly off the mikes with zero in between processing should reveal it. While I didn't expect a difference I did try and see if there was one. I also sent some tracks to the younger folks in the band (one was 12 years old) who might have better hearing than I. I told them nothing other than I thought these are a bit different tell me which is better. The fly in the ointment is they don't have great high end gear. They did use some studio monitors and a DAC that could play the high res file. They all said they heard no difference.

     

    So generally I record at 48 khz. Why 48 khz? Well 48 khz does have a wider transition band between the cut-off at 24 khz and response at 20 khz. That should allow the filters to work slightly better and have a little lower aliasing.

     

    As for buying recordings I am in favor of as few transitions as possible. If I knew a recording was done originally at 176 khz then that is the one I would prefer. Although I have violated that idea when the price difference was ridiculous. Pricing by the bit for instance. I'll take the 48/24 version when the 176 or 192 version costs 4 times as much.

    Share this comment


    Link to comment
    Share on other sites

    So far this article that we're debating has spawned 13 PAGES OF COMMENTS!

    If the people writing the article had actually consulted people who understood how digital audio works, they wouldn't have written this article to begin with.

    If someone who understood how digital audio works had posted the first comment, debunking everything in the article, the debate would have stopped right there.

    The authors are just repeating what Kunchur wrote several years ago. Back then it was disproven, and he was discredited. The same should have happened here. For anybody who wants to understand how the technology works, I provide links at the bottom of this post :-).

     

    But surprise, surprise! The authors didn't consult people who understood the technology but instead went to people who love vinyl and hate CDs but don't understand the technology. That has, unfortunately, become standard procedures for vinyl lovers, because the truth seems less important than propaganda that hails vinyl and sheds bad light on digital audio – CDs, in particular. Unfortunately, some of these people, like Dudley and Fremer, are in positions of power: Most audiophiles only have experience with listening to music, but they look up to Fremer and Dudley et al as figures of authority, so if those people spread misunderstandings and lies, those beliefs are rarely questioned by audiophiles.

    Unfortunately, this has created a lot of fighting between the audiophiles and the scientists :-(.

     

    There's nothing wrong with having a preference for something – we all do. Many albums I would never, ever buy on CD. But I would have a lot more respect for the vinyl lovers if they just said "yes, I know vinyl is less transparent, but I simply like the coloration it gives to music", instead of spreading bullshit and lies. If they said that, there would be no arguments. Saying the Nyquist theorem is wrong is simply downright stupid!

    Luckily, other people who understand the technology have chimed in here, trying to sort things out, but an article like this does a lot of damage to the people who don't understand the technology and can't stomach reading through 12 pages of comments.

     

    As mentioned before, the ONLY difference there can POSSIBLY BE between CD and hi-res (unless something has gone wrong) is that the hi-res file has information above 22 kHz and that it may have more relaxed filters. Nothing else. Zip. Zero. Nada.

    192 kHz or 384 kHz sampling rates do NOT produce an output where the raggedy steps more closely approximates the original analogue soundwave, because the output of a DAC are not stair-steps to begin with (see more about this further down). All they do is include higher frequencies. Anybody who claims otherwise don't understand the Nyquist sampling theorem.

    And yes, if the original analogue signal contains energy above 22 kHz, then a higher sample rate could strictly speaking be said to be "closer to the source". If you want to retain that information for intellectual stimulation, then fair enough. As long as you state that. But as it's inaudible anyway IMO it's pointless to preserve those frequencies. But that's my opinion :-).

     

    If you insist on preserving supersonic frequencies, hi-res digital is the way to go – NOT vinyl. Vinyl records are mostly cut using a low-pass filter that rolls off high frequencies, usually above 15 or 18 kHz. Any supersonic frequencies are so low in level to be inaudible – which they would be in any case, as they are above 20 kHz (masking also comes into play here). Yet, many spectrograms of hi-res recordings of vinyl records show energy above 22 kHz, but most of this is distortion or noise. Some vinyl records are cut off at even lower frequencies. An extreme example would be the original copy of Grand Funk's "We're an American Band" where anything above 5 kHz seems to be cut off!

    When the Beatles remasters vinyl records were released, someone presented a spectogram of a hi-res recording of one, which showed content above 22 kHz, and people were raving "Yes! Yes! Yes! I knew it! They are cut from analogue tape or hi-res digital!". The mastering engineer then stepped in and said that they were cut from 44.1 kHz files, so the content above 22 kHz could simply not be anything else than noise or distortion from the vinyl material or the turntable rig.

     

    Art Dudley's claims are fantasies and just fancy and romantic sounding jargon that proves nothing. He's free to believe anything he wants, but when he makes his fantasies public, and they're factually wrong, he will be corrected.

    Granted, digital audio is a very complicated subject, and it took me a long time to understand. There are still issues I struggle with, and I don't fashion myself to be an expert on the matter, but apparently I know more about it than most people working for Stereophile!

    But then again, how could I expect that someone who's willing to pay $3000 for a power cord could possibly understand anything about technology?

     

    So, Art Dudley, and the authors of this article, please provide the results of the level-matched blind-tests you have passed between vinyl or analogue tape vs. said tapes/records properly transferred to 16/44.1 digital. Then please also provide the results of the blind-tests you have passed between hi-res and the same material properly downsampled to 16/44.1.

    Until you do that, the claims are just empty claims.

     

    A small side-note on something that very few people bring up: I was vinyl only for 15 years, but after upgrading my stereo I have compared almost 800 albums on vinyl and CD.

    If there's one category of CDs that are the easiest to generalize about it's CD from the 80s. I'm not a fan of 80s music, but I did compare a reasonable amount of CDs from that period. So far I have only found two CDs from the 80s that were better than their vinyl counterpart (as well as one album where the two medias were practically identical). The rest sounded thin, shrill, cold, clinical and hard.

    I still have several hundred albums that either simply sound better on vinyl, or that I simply prefer. Unfortunately, there are many CDs, both old and new, that sound bad, and many albums from the analogue era sound lovely, as they are well-produced, unlike a lot of music nowadays.

    Art Dudley, Michael Fremer and so many other CD haters probably heard CDs back in the 80s and concluded they were crap (I know Fremer did). I would have done the same! And whenever you hear of comparisons being done, they are usually done with CDs from the 80s.

    I attribute these issues to the poor A/D converters and the poor mastering practices (for CDs) in the 80s. Even pro-digital people (Stanley Lipshitz, Bob Ludwig, Bob Clearmountain, etc.) have also mentioned these issues in public.

     

    Nowadays, converters have become cheap, small and well-functioning. I ripped a CD, then took the CD and recorded the output from my CD player with a very long (at least 5 meters) cheap standard analogue cable into the line-in on my cheap Soundblaster soundcard, adjusted the volume levels and did a blind test: 4 out of 12 correct. Worse than flipping a coin!

    You can try this yourself, but if you're too lazy you don't even have to record anything on your own: You can go to Ethan Winer's website and download three different songs for his loop-back test. They all come in four versions: The original, then recorded back into his $250 Focusrite soundcard (one generation). Then after 5, 10 and 20 generations. See if you can hear a difference.

    Also, if you take a vinyl record and record it to CD, you will most likely not be able to hear a difference. If you transfer a CD onto a vinyl disc you will most likely be able to hear a difference (although some, and only some, vinyl records will of course be transparent within the limits of audibility, minus the surface noise). If you take an analogue master tape and copy it to another analogue tape, most people can hear the difference.

    That just goes to show that digital audio is transparent and analogue is far from. Claiming vinyl or analogue tape is closer to the source than CDs is ludicrous. But when these "figures of authority" say so, it becomes gospel. Let me use Michael Fremer's own favourite phrase against him: "You're talking out of your ass!"

     

    I hope everybody will take 1 minute and 30 seconds to watch the beginning of this following video by Sam Harris. The rest of the video is mostly about religion, and you don't need to watch more than the first 1:30 minutes. I link to this, as I feel this is how it is debating with vinyl-lovers who don't trust facts nor science.

     

     

    For anybody who wishes to understand how digital audio works, here are a couple of links:

     

    First, watch the video by Monty above. It's absolutely crucial!

     

    Then a lot of answers to myths about vinyl being technically superior to CDs:

    Myths (Vinyl) - Hydrogenaudio Knowledgebase

     

    Then an easy to understand article about the real difference between 16 and 24 bit (spoiler: it's not resolution):

    24bit vs 16bit, the myth exploded!

     

    Then an article from the same person about the real difference between CDs and hi-res (spoiler: it's not resolution either):

    Hi-Rez - Another Myth Exploded!

     

     

    Then lastly, a quote by Christopher Hitchens:

    "What can be asserted without proof can be dismissed without proof."

     

    This article here has shown no proof and can be dismissed.

    Share this comment


    Link to comment
    Share on other sites

    The 20,000 is the bandwidth available on a CD, although it should probably be 21,000. As far as I know, the limit for how small an increment can encode is 1 nanosecond.

     

    It's also worth remembering that the the quantization errors performed by the A/D converter are completely eliminated and turned into random noise by a process called dithering, which is contained in every A/D converter on the market. The early A/D converters from the 80s didn't contain dithering though, as this wasn't introduced until the mid 80s. So, bit depth is only signal-to-noise ratio. In other words, 24 bit doesn't offer higher resolution, just higher dynamic range (144 dB for 24 bit, 96 dB for 16 bit).

     

    The pertinent number however is rate of sampling not response. Yes dither in theory might make timing resolution as low as you want. In practice in some sense it increases timing resolution though not to infinity I think. In any case it is a non-problem for fidelity to human ears.

    Share this comment


    Link to comment
    Share on other sites

    May we listen to Stephen Colbert. He said last night that millenials had to stop pushing vinyl. Don't they know we would have given anything to get free music whenever we wished streaming over our rotary phones?

     

    I may have garbled the quote, but you get the idea.

    Share this comment


    Link to comment
    Share on other sites

    I sample rate converted to 44.1 and then back to 176 so the scale would match on both. Upper is the original 176 khz and lower is the 44 khz.

     

    [ATTACH=CONFIG]34044[/ATTACH]

     

     

    [ATTACH=CONFIG]34045[/ATTACH]

     

     

    You can see in the latter image that the very beginning of the transient wasn't slow to start on 44 khz. You do see some peaks are rounded and lower in level due to the energy in ultrasonic frequencies that were not captured.

     

    It is hard to say where the real fundamental modes of the cymbal are. The decaying tail has strong output at 6700 hz and 8 khz with a big drop off after that. The harmonics do extend to maybe 50 khz prior to 100 milliseconds. The microphone used is said flat to 40 khz and I am sure has some output beyond that. After about 100 milliseconds the strong harmonics beyond 10 khz suddenly die out leaving pretty much only below 10 khz sound.

     

    Here is the spectrogram. It goes to the gray background at -80 db and uses 1024 bin FFT as I have it set here.

     

    [ATTACH=CONFIG]34046[/ATTACH]

     

    Thanks. What's interesting is that (at least as my eyes are seeing it - would be nice to get an overlay view if that's possible) that as you say the rise to peak doesn't seem slow; the differences I think I'm seeing are in the decay. (And these don't look to be large differences.)

     

    Edit: What conversion filter did you use?

    Share this comment


    Link to comment
    Share on other sites

    There's nothing wrong with having a preference for something – we all do. Many albums I would never, ever buy on CD. But I would have a lot more respect for the vinyl lovers if they just said "yes, I know vinyl is less transparent, but I simply like the coloration it gives to music", instead of spreading bullshit and lies.

     

     

    * * *

     

    If there's one category of CDs that are the easiest to generalize about it's CD from the 80s. I'm not a fan of 80s music, but I did compare a reasonable amount of CDs from that period. So far I have only found two CDs from the 80s that were better than their vinyl counterpart (as well as one album where the two medias were practically identical). The rest sounded thin, shrill, cold, clinical and hard.

    I still have several hundred albums that either simply sound better on vinyl, or that I simply prefer. Unfortunately, there are many CDs, both old and new, that sound bad, and many albums from the analogue era sound lovely, as they are well-produced, unlike a lot of music nowadays.

     

    So many vinyl lovers' preferences may come to some extent from the fact that at least part of the time they're listening to better mastering. I completely agree with the proposition that mastering trumps resolution or even vinyl vs. digital.

     

     

    And then folks get sucked in by plausible-sounding hypotheses that are incorrect, but they don't examine too closely since these hypotheses are in line with their preconceptions. I'm still in agreement, in fact perhaps more so than you: that Lipshitz and Vanderkooy article is steadfastly cited by lots of people whose DACs are doing sigma-delta modulation internally (yourself as well, perhaps?), and they don't seem to mind.

    Share this comment


    Link to comment
    Share on other sites

    ie Van Gelder's remasterings sound great at 24/44.1

    I didn't record the cymbals. John E. Johnson Jr who writes for one of the Home Theater mags is a drummer and did a series of measurements and reviews of dozens of cymbals. His recordings.

     

    I do a little recording. I have a few times recorded at 96 khz and also at 192 khz. Why? The ability to do so was there. If there were a difference, a recording done by me directly off the mikes with zero in between processing should reveal it. While I didn't expect a difference I did try and see if there was one. I also sent some tracks to the younger folks in the band (one was 12 years old) who might have better hearing than I. I told them nothing other than I thought these are a bit different tell me which is better. The fly in the ointment is they don't have great high end gear. They did use some studio monitors and a DAC that could play the high res file. They all said they heard no difference.

     

    So generally I record at 48 khz. Why 48 khz? Well 48 khz does have a wider transition band between the cut-off at 24 khz and response at 20 khz. That should allow the filters to work slightly better and have a little lower aliasing.

     

    As for buying recordings I am in favor of as few transitions as possible. If I knew a recording was done originally at 176 khz then that is the one I would prefer. Although I have violated that idea when the price difference was ridiculous. Pricing by the bit for instance. I'll take the 48/24 version when the 176 or 192 version costs 4 times as much.

    Share this comment


    Link to comment
    Share on other sites

    Hope I won't ruin your patience but I'm getting more and more confused :

     

    to my understanding every demo and thus SN theorem relies on both the band limitation and the periodical nature of the signal, the 2 (X2, twice bandwidth) factor being related to the fact that, taking into account the given sinusoidal shape by nature of the signal, what was up will be mirrored down, that there's only one solution for a given position at a determined timing in a limited bandwidth.

     

    This is why I tried to break the image by bursts at rates higher than sampling.

     

    Led to out of bandwidth.

    We now are looking at signals that when recorded at 176 yield degraded versions at 44.

    So, at the end of the day (actually very beginning here !) is 44 perfectly enough as per Monty etc or do actual sound emissions require larger bandwidth for better capture and can this capture be accurately be reproduced at 44? To my eyes you demonstrate Yes they require larger bandwidth, No 44 can not be faithful.

    I'm confused for it would thus appear that if it's not a matter of number of samples it's a matter of bandwidth ; isn't it silly ?

    Let me try to explain. The Fourier Transform itself is defined for continuous signals (without sampling). It states that for *any* continuous time dependent signal, there exists a transform into the frequency/phase dimension such that the signal can be described *only* by its constituent frequency/phase coordinates -- that means that *any* time dependent signal can be described entirely by a set of sinusoidal waves having a defined frequency, amplitude and phase.

     

    This is mathematically true for every signal. *Why* its true involves some math: https://en.wikipedia.org/wiki/Fourier_transform but trust that this is true.

     

    Now there is the discrete fourier transform (DCT) which is defined for discrete samples. This is equivalent to the general fourier transform when the sampling rate is infinite. So what to do? In practice suppose we take samples in the Ghz range. We can then examine the discrete fourier transform and suppose we see that *for that specific signal, e.g. a jazz band* that the amplitudes of the frequency components >20 kHz are *zero*. That tells us that *that particular signal* is bandwidth limited to 20 kHz. Now suppose we hit the cymbal ? is there a component > 20 kHz ... let's say there is, now for the sake of discussion up to 40 kHz and then zero above 40 kHz. That means that we can capture the transients in that signal by sampling at say 88 kHz. Or if there are components at 70 kHz then 176 kHz will contain those transients.

     

    Do you follow?

     

    So you can just look at the discrete fourier transform of the recording and see what the minimal sampling rate would be to capture any transients or *anything else* in that signal. You can determine the actual bandwidth of that signal.

     

    Shannon-Nyquist https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampling_theorem just tells us what the sampling rate *needs to be* in order to capture the full fidelity of any particular signal (given its particular bandwidth).

     

    So again *assuming the signal is bandwidth limited to 20 kHz* 44 kHz is adequate to capture any transients that exist *in that signal*, to answer your question. Not silly.

    Share this comment


    Link to comment
    Share on other sites

    Thanks. What's interesting is that (at least as my eyes are seeing it - would be nice to get an overlay view if that's possible) that as you say the rise to peak doesn't seem slow; the differences I think I'm seeing are in the decay. (And these don't look to be large differences.)

     

    Edit: What conversion filter did you use?

     

    Don't have a simple quick way to overlay them precisely. I instead inverted one and mixed them to show the difference. The FFT at the bottom is for the first 100 milliseconds. The top view is spectrogram and the lower one of the zoomed in part where you can see the samples. I used Sox for the resampling to 44 and back via Audacity. Below 20 khz their is some noise at -160 db. When I play this I hear nothing. When I play it at 25% normal speed it sounds like if you tap the bottom of a 12 ounce soda can with a pencil.

     

    difference cymbal.png

    Share this comment


    Link to comment
    Share on other sites

    Let me try to explain. The Fourier Transform itself is defined for continuous signals (without sampling). It states that for *any* continuous time dependent signal, there exists a transform into the frequency/phase dimension such that the signal can be described *only* by its constituent frequency/phase coordinates -- that means that *any* time dependent signal can be described entirely by a set of sinusoidal waves having a defined frequency, amplitude and phase.

     

    This is mathematically true for every signal. *Why* its true involves some math: https://en.wikipedia.org/wiki/Fourier_transform but trust that this is true.

     

    Now there is the discrete fourier transform (DCT) which is defined for discrete samples. This is equivalent to the general fourier transform when the sampling rate is infinite. So what to do? In practice suppose we take samples in the Ghz range. We can then examine the discrete fourier transform and suppose we see that *for that specific signal, e.g. a jazz band* that the amplitudes of the frequency components >20 kHz are *zero*. That tells us that *that particular signal* is bandwidth limited to 20 kHz. Now suppose we hit the cymbal ? is there a component > 20 kHz ... let's say there is, now for the sake of discussion up to 40 kHz and then zero above 40 kHz. That means that we can capture the transients in that signal by sampling at say 88 kHz. Or if there are components at 70 kHz then 176 kHz will contain those transients.

     

    Do you follow?

     

    So you can just look at the discrete fourier transform of the recording and see what the minimal sampling rate would be to capture any transients or *anything else* in that signal. You can determine the actual bandwidth of that signal.

     

    Shannon-Nyquist https://en.wikipedia.org/wiki/Nyquist%E2%80%93Shannon_sampling_theorem just tells us what the sampling rate *needs to be* in order to capture the full fidelity of any particular signal (given its particular bandwidth).

     

    So again *assuming the signal is bandwidth limited to 20 kHz* 44 kHz is adequate to capture any transients that exist *in that signal*, to answer your question. Not silly.

     

    One thing further to what jabbr said, for our masked vegetable friend ;) :

     

    "Periodic" in the sense of sine wave-like signals is also not necessary. The Sampling Theorem also works fine for stuff that doesn't produce nice sine waves, like percussion or inharmonic attacks, such as the pluck of a string or (for brass players) "tonguing."

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    One thing further to what jabbr said, for our masked vegetable friend ;) :

     

    "Periodic" in the sense of sine wave-like signals is also not necessary. The Sampling Theorem also works fine for stuff that doesn't produce nice sine waves, like percussion or inharmonic attacks, such as the pluck of a string or (for brass players) "tonguing."

    Yes, absolutely true. The fourier transform is applicable to all time dependent signals including transients. The classic transient is called the "impulse" and the "impulse response" of a system is extensively studied and discussed. https://en.wikipedia.org/wiki/Impulse_response

     

    The response of a system to an impulse or transient is one of the best ways to study a system, be that an audio system e.g. when measuring rooms for correction kernels, or even to study what happens to the Twitter space when Trump issues an impulse tweet about Obama. ***

     

    *** I'm entirely serious.

    Share this comment


    Link to comment
    Share on other sites

    One thing further to what jabbr said, for our masked vegetable friend ;) :

     

    "Periodic" in the sense of sine wave-like signals is also not necessary. The Sampling Theorem also works fine for stuff that doesn't produce nice sine waves, like percussion or inharmonic attacks, such as the pluck of a string or (for brass players) "tonguing."

     

    One way of looking at it is that all signals are periodic, but some have infinite period time.

    Share this comment


    Link to comment
    Share on other sites

    So many vinyl lovers' preferences may come to some extent from the fact that at least part of the time they're listening to better mastering. I completely agree with the proposition that mastering trumps resolution or even vinyl vs. digital.

     

     

    And then folks get sucked in by plausible-sounding hypotheses that are incorrect, but they don't examine too closely since these hypotheses are in line with their preconceptions. I'm still in agreement, in fact perhaps more so than you: that Lipshitz and Vanderkooy article is steadfastly cited by lots of people whose DACs are doing sigma-delta modulation internally (yourself as well, perhaps?), and they don't seem to mind.

     

    That paper talks about inherent problems with 1-bit sigma-delta converters, and it is quite correct. It explicitly states that multi-bit sigma-delta is perfectly fine. Almost all modern DACs use multi-bit converters. Guess why.

    Share this comment


    Link to comment
    Share on other sites

    Hi Guys - I just edited the article with the following statement at the top.

     

     

    The following article contains information that has been deemed incorrect by leading digital audio engineers. I attempted to corroborate the findings of this article by asking several digital audio experts. I was unable to find anyone who could back up the statements made, with any scientific data or theory. Consider the following article retracted.

     

    I am leaving the text of this article up on CA because it has enabled a good discussion to take place. By leaving it up, people can read what was claimed and read the followup arguments that the prove it incorrect. To remove the article completely only opens up a space for this to happen again, and again, and again.

     

    I take full responsibility for the publishing of this article. I should have had a technical editor check it before publication. I apologize to the CA Community for the error in judgement.

     

    - CC.

    Share this comment


    Link to comment
    Share on other sites

    That paper talks about inherent problems with 1-bit sigma-delta converters, and it is quite correct. It explicitly states that multi-bit sigma-delta is perfectly fine. Almost all modern DACs use multi-bit converters. Guess why.

     

    Yes, as Miska is fond of saying, it shows you get bad results when you use crappy SDM. :)

    Share this comment


    Link to comment
    Share on other sites

    Hi Guys - I just edited the article with the following statement at the top.

     

     

     

    The following article contains information that has been deemed incorrect by leading digital audio engineers. I attempted to corroborate the findings of this article by asking several digital audio experts. I was unable to find anyone who could back up the statements made, with any scientific data or theory. Consider the following article retracted.

     

    I am leaving the text of this article up on CA because it has enabled a good discussion to take place. By leaving it up, people can read what was claimed and read the followup arguments that the prove it incorrect. To remove the article completely only opens up a space for this to happen again, and again, and again.

     

    I take full responsibility for the publishing of this article. I should have had a technical editor check it before publication. I apologize to the CA Community for the error in judgement.

     

    - CC.

     

     

    I think that's great, frankly. On how many other sites would the editorial content (1) take account of the comments, (2) ever bother to say "whoops, we goofed," and (3) bother to leave (2) in a position of some prominence?

    Share this comment


    Link to comment
    Share on other sites

    Hi Guys - I just edited the article with the following statement at the top.

     

     

    The following article contains information that has been deemed incorrect by leading digital audio engineers. I attempted to corroborate the findings of this article by asking several digital audio experts. I was unable to find anyone who could back up the statements made, with any scientific data or theory. Consider the following article retracted.

     

    I am leaving the text of this article up on CA because it has enabled a good discussion to take place. By leaving it up, people can read what was claimed and read the followup arguments that the prove it incorrect. To remove the article completely only opens up a space for this to happen again, and again, and again.

     

    I take full responsibility for the publishing of this article. I should have had a technical editor check it before publication. I apologize to the CA Community for the error in judgement.

     

    - CC.

     

    Thanks for leaving the discussion on CA. Please, be sure not to delete it because I have learnt a lot and I will be able to learn a lot more, reading again the thread and giving myself time to go to the linked articles. I think it is very valuable.

    And also many thanks to everybody that has given their time and efforts to describe technical questions in an understandable way (besides I am not an English speaking person)

    I do not post in CA but I usually follow it and really enjoy discussions like this.

    Share this comment


    Link to comment
    Share on other sites

    That paper talks about inherent problems with 1-bit sigma-delta converters, and it is quite correct. It explicitly states that multi-bit sigma-delta is perfectly fine. Almost all modern DACs use multi-bit converters. Guess why.

     

    Ha ha, well if you don't believe anything over 20 kHz is important, and according to their graph the noise level roughly -150dB with DSD64 then what's the problem? Oh right...

     

    In any case if we upsample then that rising noise level get's pushed out farther and farther. An excellent single bit sigma delta converter which can be entirely analyzed because the circuit is entirely published is the Signalyst DSC1. You would see that the digital FIR filter suppresses noise > 80 kHz or so and then the 4 stage SK filter with a corner at 100 kHz suppresses the generated noise which at DSD512 is pushed into the MHz range.

     

    To my reading however, the Grimm article is comparing PCM 24/192 to DSD 64 and if we are talking purely NOS DAC, that is a reasonable comparison.

    Share this comment


    Link to comment
    Share on other sites

    Ha ha, well if you don't believe anything over 20 kHz is important, and according to their graph the noise level roughly -150dB with DSD64 then what's the problem? Oh right...

     

    In any case if we upsample then that rising noise level get's pushed out farther and farther. An excellent single bit sigma delta converter which can be entirely analyzed because the circuit is entirely published is the Signalyst DSC1. You would see that the digital FIR filter suppresses noise > 80 kHz or so and then the 4 stage SK filter with a corner at 100 kHz suppresses the generated noise which at DSD512 is pushed into the MHz range.

     

    To my reading however, the Grimm article is comparing PCM 24/192 to DSD 64 and if we are talking purely NOS DAC, that is a reasonable comparison.

    The Lipshitz/Vanderkooy paper is about sigma-delta modulators, not the D/A stage. While excellent performance is possible, there is no denying that 1-bit modulators of 3rd order or higher are inherently unstable and exhibit various forms of distortion. A multi-bit modulator does not (necessarily) have these issues. Given the choice, a multi-bit architecture is therefore preferable.

    Share this comment


    Link to comment
    Share on other sites

    Hi Guys - I just edited the article with the following statement at the top.

     

    Well done. This is appreciated.

    Share this comment


    Link to comment
    Share on other sites

    The Lipshitz/Vanderkooy paper is about sigma-delta modulators, not the D/A stage. While excellent performance is possible, there is no denying that 1-bit modulators of 3rd order or higher are inherently unstable and exhibit various forms of distortion. A multi-bit modulator does not (necessarily) have these issues. Given the choice, a multi-bit architecture is therefore preferable.

     

    Would you say "various forms of distortion" are characteristic of the DSD recordings on offer? What about audibility of such distortions if/where they do exist?

     

     

    Sent from my iPhone using Computer Audiophile

    Share this comment


    Link to comment
    Share on other sites

    The Lipshitz/Vanderkooy paper is about sigma-delta modulators, not the D/A stage. While excellent performance is possible, there is no denying that 1-bit modulators of 3rd order or higher are inherently unstable and exhibit various forms of distortion. A multi-bit modulator does not (necessarily) have these issues. Given the choice, a multi-bit architecture is therefore preferable.

     

    Sorry I was referring to the Grimm white paper.

     

    My impression has been that Lipshitz was referring to the ADC and storage stages at least primarily. Hard to know how many chip based DACs use 3rd order modulators as they don't typically publish their internals.

    Share this comment


    Link to comment
    Share on other sites




    Create an account or sign in to comment

    You need to be a member in order to leave a comment

    Create an account

    Sign up for a new account in our community. It's easy!

    Register a new account

    Sign in

    Already have an account? Sign in here.

    Sign In Now




×
×
  • Create New...