El Guapo Posted March 17, 2022 Share Posted March 17, 2022 @MiskaHi Miska After months of running MCH DSD256 upsampling, I found an issue might related to your filters CUDA API call (or EC modulator?). I could reproduce this issue on both version (i.e. x86 and amd build). Here's the log: My system is 12th Gen i9 12900K with EVGA 3080Ti, Ubuntu Server 20.04 HWE, nVidia driver 510.47.03. Music would stall at anytime using gauss-* series with any EC modulator for MCH DSD256 (non-EC could run 24x7 tho). I check the syslog showed Xid 31 error message which indicated GPU memory page fault and possible caused by user's app. 2 channels is fine. gauss-* series filters plus DSD5EC or AMSDM7EC can do DSD1024 with GPU 24x7 on my system. Such error only happened in multichannel. Would you help me verify this issue? Thank you very much! Guapo Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 41 minutes ago, Miska said: I suspect you are running out of GPU RAM... Since cudaMalloc() fails... Here's the recent screenshot when testing x86 build / AVX512 performance so I disabled all E-cores. 8ch DSD256 from 48KHz source, using gauss-hires-ip + 5EC with IR wav room corrections. Took less than 600MB of GPU memory... Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 6 minutes ago, Miska said: I'm a bit lost on what case fails then? What is the difference between failing and non-failing cases? Previous screenshot was one of the failing case. Just let you know the GPU usage. Music played couple of minutes then stalled. Have to restart hqplayer daemon to recover. Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 10 minutes ago, Miska said: OK, a bit of mystery. From the CUDA code point of view, it doesn't know if there are two channels or 8. But memory consumption is certainly 4x higher on 8 channels than on 2 channels. Understand. Here's the screenshot when using non-EC doing 12ch DSD256 without any issues. Could run 24x7. GPU memory utilization was still low. Issue only happened when using any EC modulators doing 6 or 8ch DSD256 (beyond this channel number only non-EC could do DSD256 on my system). Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 8 minutes ago, Miska said: That is even more strange, because that doesn't affect GPU work at all. (modulators don't use GPU) Understand GPU is not related to modulators so I feel so loss when I meet this issue.😅 Really have no idea which part / step I missed. I use non-EC for MCH music playback so far but I really like the 5EC's SQ on Anubis... Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 40 minutes ago, StreamFidelity said: Have you enabled AVX512 in the BIOS? Yes, I do. 40 minutes ago, StreamFidelity said: How high is the clock frequency? Screenshot has i7z's info. Roughly 2.4GHz per physical core. Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 58 minutes ago, bobflood said: You need to get it up to about 4 Ghz to get DSD256. It wasn’t CPU frequency issue in my case… it’s about GPU’s API call. 3~4GHz is required by AMD build for 8 channels DSD256 for sure. But 2~3GHz is sufficient for x86 build in the same scenario on the 12th Gen i9 12900K. AVX512 really shines in this part. If talking about maximum performance for 2 channels, AMD build is the winner. 5.1GHz can do DSD1024 or x48 with DSD5EC or AMSDM7EC modulators. 😄 Link to comment
El Guapo Posted March 17, 2022 Share Posted March 17, 2022 4 hours ago, StreamFidelity said: Why so little? GPU helps. 😁 Link to comment
Popular Post El Guapo Posted March 23, 2022 Popular Post Share Posted March 23, 2022 On 3/17/2022 at 12:57 PM, Miska said: That is even more strange, because that doesn't affect GPU work at all. Hi Miska, Just FYI the GPU memory problem is gone. What I did was... I disassembled my i9 server for replacing a new fluid cooling system (pulled CPU, GPU and DDR5 modules) then re-assembled it, updated to most current kernel and nvidia driver. Now EC multichannel DSD upsampling runs very smoothly <3 Guapo The Computer Audiophile and Miska 1 1 Link to comment
El Guapo Posted March 27, 2022 Share Posted March 27, 2022 57 minutes ago, wanta911 said: if someone can point out what I have wrong in the settings: wanta911 1 Link to comment
Popular Post El Guapo Posted March 27, 2022 Popular Post Share Posted March 27, 2022 6 minutes ago, The Computer Audiophile said: Perhaps a 32 channel Holo DAC is coming Before replied I really did check Holo’s website to make sure I didn’t miss anything…😅 MemoryPlayer, OzarkMtn and pavi 2 1 Link to comment
El Guapo Posted April 1, 2022 Share Posted April 1, 2022 1 hour ago, Quokka_61 said: I have tried to solve this problem by making processing in HQPlayer Desktop as light as possible Worth a try to set "Short buf" + NAA delay 5ms + *-mp (minimum phase) filters. That would decrease some latency (but still has lip-sync issue tho, just within tolerable range for me). Quokka_61 1 Link to comment
El Guapo Posted April 3, 2022 Share Posted April 3, 2022 19 minutes ago, Fredc said: Quite a few dacs out there have galvanize isolated usb input (e. g. Chord Qutest, ifi pro idsd). Wouldn't that do? You need USB cable to send 5V to your DAC because your DAC’s USB controller chip needs it. It’s not just for galvanic isolations. Link to comment
El Guapo Posted April 5, 2022 Share Posted April 5, 2022 25 minutes ago, Mark Labbett said: what causes these dropouts? The CPU utilization rate you saw was an average across all cores. Please check “single physical” core utilization rate. Link to comment
El Guapo Posted April 6, 2022 Share Posted April 6, 2022 35 minutes ago, Mark Labbett said: the audio had dropouts while taking these screen shots, the peaks on the GPU were at the time of the screen shots I think you met the performance limitation of your i7-4790K. Link to comment
El Guapo Posted April 11, 2022 Share Posted April 11, 2022 24 minutes ago, Miska said: According to information I have found, it shouldn't. But the information is scarce so it is hard to say for sure until someone tests and reports. My previous RTX3070 / 8GB GPU was LHR version. I didn't see large performance penalty compare to my current 3080ti / 12GB. Limitation was VRAM, not GPU's computing power. Here's my old test data, 3070 LHR doing AMSDM7EC -> DSD1024 Here's my current 3080ti no LHR Link to comment
El Guapo Posted April 12, 2022 Share Posted April 12, 2022 2 hours ago, Nagi said: this has got to be done via HQP Pro right? That's HQPlayer Embedded doing real-time upsampling.😊 Link to comment
El Guapo Posted April 13, 2022 Share Posted April 13, 2022 26 minutes ago, Hammer said: Does o mean it can do it and x means no, it can't? Correct. O means able to run 24x7; X means unable. Link to comment
El Guapo Posted April 23, 2022 Share Posted April 23, 2022 56 minutes ago, pis99 said: I have 12900K running all cores at 5.3, DDR5 at 5200 with CUDA off load from RTX-A6000. I can only do this with DSD5EC module. Just curious... What brand/model of the Z690 motherboard you're using? Your spec is better than mine but I can use DSD5EC upsampling to DSD1024 / x48 with all filters except 1x sinc-L due to GPU 12GB memory limit and DSD1024 / x48 using 80% filters with AMSDM7EC... Glad I left a message on the 1000 page! Link to comment
Popular Post El Guapo Posted May 13, 2022 Popular Post Share Posted May 13, 2022 2 hours ago, Andrey Boldakov said: Maybe you know, does it possible to go for DSD1024 with those settings at this planet? Just done my initial testing for Ubuntu Server 22.04 "Jammy Jellyfish" with HQPlayer Embedded 4.32. Good news is ASDM7ECv2 finally usable for DSD1024. My current listening combo is 1x gauss-long / Nx gauss with ASDM7ECv2 and fixed rate to 45MHz. All P-cores fixed at 5.3GHz and ring fixed at 4.4GHz. 44.1 -> gauss-long + ASDM7ECv2 -> DSD1024 192 -> gauss + ASDM7ECv2 -> DSD1024 Performance result BTW I personally still don't think 12900KS would help. Wait for 13th gen i9 is more practical. Miska and Andrey Boldakov 2 Link to comment
El Guapo Posted May 13, 2022 Share Posted May 13, 2022 3 minutes ago, sledwards said: Is the initial Jammy release of desktop CUDA compatible? I'm not sure about 22.04 Desktop but Server is fully CUDA compatible. 4 minutes ago, sledwards said: Any reason I should not use 22.04 LTS instead of 20.04? I personally suggest 22.04. Better support for Z690. Link to comment
El Guapo Posted May 14, 2022 Share Posted May 14, 2022 4 hours ago, sledwards said: Does the jammy version of embedded (hqplayerd_4.32.0-134avx2_amd64.deb) require Rocm to function with CUDA? This initial release does not include ROCm (Jammy not supported yet, even no pgp sign data) so it's not required for HQPe installation. Link to comment
El Guapo Posted May 14, 2022 Share Posted May 14, 2022 Accidentally sent twice. Deleted. Link to comment
El Guapo Posted May 14, 2022 Share Posted May 14, 2022 45 minutes ago, AudioDoctor said: Do you enter all that data into a spreadsheet manually? Yes. Change filter, listen more than 3min then enter data.😆 bogi 1 Link to comment
El Guapo Posted May 25, 2022 Share Posted May 25, 2022 42 minutes ago, GMG said: Any benefit or is the running the additional thread for the NAA just taking up resources? I'm using this way. Because USB thru ALSA would increase the utilization rate of the primary cores. Let NAA handle the USB using least usage cores could squeeze more MHz on primary cores for higher DSD rate. Link to comment
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now