After last 2 articles, I have gather more questions from audiophiles and hopefully this document can discuss more on these areas.

(1) I saw there is USB sound card, what is the different between USB and a normal sound card with digital output? Which one is better?

It's hard to determine which one is better because the details lies on the design. USB is user friendly connection, but in general USB sound devices produces higher jitter than normal digital output. USB relies on 'slave' mode. It does not has its own crystal and buffering. Sound card is like a master. It has its own pool to store data from anything ahead and generate its own timing signal for SPDIF, TOSLINK, AES etc with its on board oscillators.


(2) There are great reviews of USB DACs, if your theory is correct, how can that be?

Everything should based on the design. In general, USB connection is more jittery than normal digital audio connection, but jitter can be cleaned/recovered. As long as the jitter is recovered before it gets to the DAC section, the jitter will not be a problem. The most widely used de-jittering in today market is ASRC (asynchronous sample rate conversion). The incoming signal re-clocks with the DAC internal crystal to form a new sample rate. This method isolates the incoming jitter. Anagram Technologies licenses its ASRC to many consumer DACs from Nagra, Manley to Cambridge Audio. Benchmark Media uses ASRC method with very good results also. Another reason that ASRC is widely used in hifi marketing term - "upsampler" is an important selling point.


(3) What is the benefit of upsampling?

A huge audiophiles misunderstanding is "higher upsampling" rate is "better". This is entirely wrong. Almost all DAC chipsets are worked in Mhz and provides 8x or above oversampling. The reason why upsampling improves quality is based on the custom filter design. It has been proven that this filter design (around Nyquist freq.) affects our hearing the most.


(4) We love high sampling is not because of the extra bandwidth.

Human ears cannot hear frequency above 20khz. High sampling rate recording sounds better because the more relax filter design in the Nyquist frequency. The first famous filter design in high end audio history is Wadia Digimaster. Weiss custom algorithm on Sharc DSP with 40bit floating point produces industry leading performance, which is hard to compare with usual chipset solution.


(5) Weiss Minerva/DAC2 uses firewire IEEE1394 as computer connection, does it works better than USB?

Yes. Weiss High End website has a very clear document for its reasons:

"Why Firewire? Firewire is a peer-to-peer protocol, meaning that every device on a Firewire network is equally capable of talking to every other device. Two video cameras on a Firewire network can share data with each other. A Firewire audio interface could save sound data directly to a Firewire hard drive. Your computer is just another peer on this network, and has no inherent special status.

Firewire is always implemented in hardware, with a special controller chip on every device. So the load it puts on your CPU is much lighter than USB communications load, and you're much less likely to lose any sound data just because you're running fifteen things at once. Specialized hardware usually makes things faster and more reliable, and this is one of those times.

But the real reason Firewire is more reliable than USB is more fundamental than that. It's because Firewire allows two operating modes. One is asynchronous, similar to what USB uses. The other is isochronous mode, and it lets a device carve out a certain dedicated amount of bandwidth that other devices can't touch. It gets a certain number of time slices each second all its own. The advantages for audio should be obvious: that stream of data can just keep on flowing, and as long as there isn't more bandwidth demand than the wire can handle (not very likely) nothing will interfere with it. No collisions, no glitches.

From a practical perspective, this also makes it safer to send a lot more audio via Firewire. That's why most of the multichannel interfaces (16 channels, 24 channels, etc.) are Firewire devices, and USB devices usually just send a two-channel stereo signal.

For hooking up your mouse, keyboard or thumb drive, USB is plenty fast and plenty cheap. For hard drives, either one will do (although Firewire is somewhat more reliable). For audio devices, USB will do fine if no other devices are competing with it and if you have processor room to spare. But Firewire will always be able to handle more load with lower latency and no glitches, because it has resources it can set aside to make sure your audio gets where it needs to go."


(6) What about WordClock connection?

Some soundcards have wordclock input. It can improve the performance if you slave the soundcard with a high quality master clock. Remember that every digital audio setup should contain 1 reference clock only.

For example, there is a CD transport, an upsampler and a DAC digital playback combo. The CD transport reads the CD data and outputs with 44.1kHz timing information. The upsampler will be slave to this incoming clock signal, and generate a new sampling rate with a new clock signal. The DAC will again locks to this timing information. In this system, the CD transport is the master.

DAC master mode: Some devices allow DAC to work as master clock. The DAC outputs reference clock signal via Wordclock output. The CD transports/upsampler sync with this timing information. This mode produces better quality because of the DAC crystal is used for reference. It is closest to the DAC section (shorter signal path), hence a better result compares with multiply locking stages.

Master Clock mode: If you have a master clock, you can hook it up with all 3 devices. Every device works under this sync should produce more accurate timing.

These are general comments for various connections. It really depends on different design and approach. For example the original Weiss DAC1 has wordclock I/O. But after in depth researches, the wordclock input will never works better than its own DSP reclocking PLL performances. So the workclock input will always produce inferior result. Therefore Weiss decided to take this feature off.



(7) What about atomic rubidium clock? The precision is so many times more accurate than the normal oscillators.

Atomic clock is highly accurate. It takes 1000 years to shifts a second. There are indeed high end manufacturers use this method. If you look at these atomic clock internal structure, you will find there are quite some heavy PLL sections around. Atomic clock works in 10Mhz output. The master clock that needs to pull this reference frequency back to usable range, is certainly needs some extra devices. In another meaning, the PLL that may clean or pollute the clock source is in the signal path. We can hardly say atomic accuracy provides better result. Actually the result may even worse after those multiply PLLs.

As you may read from these articles, audiophiles are easily agree on certain audio theory with their traditional audio knowledge. Many of these wrong thoughts have restricted their own potential for higher quality playback.

The computer technologies has gone mature with digital audio. As of this writing, Apple just launched iTunes 8, which gives much better album display and GUI. As an audiophile, I do wish technologies can bring us higher quality result. For example high resolution 24bit and high sample rate recordings and playbacks.

It does not has to be more affordable because people who seek for "better" sound should always willing to give "more". But I wish audiophiles can spend their money on something that really improves their sounding, otherwise our new generation audiophiles will never exist because of the contradictive theory.


(1) Chapter One Review:

Last time we talked about how digital audio works. In every second of CD playback, there are 44,100 digital samples, which contain 16bit word-length (00000 to 65535). A lot of data is read from the disc, and these data can be perfectly transfer from one media to another. For example, we just post one of our upcoming album's song "Lush Life" for audiophile download and preview free of charge. The DXD version is over 600MB. Every second of this particular file carries 384,000 digital samples, that contains 24bit word-length each. A Canada audiophile took 2 hours to download this file to his harddisk, When we compare the file in his harddisk, with our server, they are 100% identical.

(2) Okay, if things are such simple and perfect, every digital source should has the same sounding?
Things are more complex than this, however we shall not forget the principal of digital audio existence. If the data is kept in digital domain, you can PERFECTLY clone, compare and verify. When you extract the data and playback through a sound interface, the interface will more or less affect the playback quality.

(3) That's starting more likely an audiophile article rather than plain pure boring digital theory.
When digital data is read and send from a soundcard through normal digital audio standard connections (AES/EBU, SPDIF, TOSLINK) it is necessary to include some extra information (how many bits, what sample rate) with the pure binary code, otherwise the DAC will not know how to process the incoming data. The timing information error becomes a very well know audiophile term "jitter".

(4) Jitter = timing error
Timing information for digital audio is like a music conductor in musical term. It controls the timing between different sections. Let's say all members in an orchestra play perfectly with their own parts, but a bad conductor completely messes up with their timing, the orchestra performance will obviously poor.

(5) How can this timing information create?
Another name of this timing information is called "Clock signal'. In most digital devices, crystals are installed to generate this timing (clock) signal. A better crystal can provide more accurate timing information. However, we must be careful that the accuracy of crystal does not show the whole picture. The problem most of the time lies on other areas such as power supply, temperature, clock signal path etc.

(5) Will that show a soundcard can never be good source compares to my ultra expansive CD transport?

That depends on how we define what is a "good source". Moreover, a lucky thing that jitter can be recovered and cleaned by good de-jittering devices and DACs. In all DACs, after it received the incoming digital data and timing information, it will match the clock input with its own reference timing, to form a pool. This is a basic PLL (Phase Lock Loop).

(6) Does it mean there is another clock inside the DAC?
Yes, DAC actually has 2 clocks (crystals). One is based on 44.1kHz, the other is based on 48kHz. If you meet a problematic DAC, try lock it with different sampling rates. If the problem exists on sampling rate 44.1kHz, 88.2kHz, 176.4kHz, and working fine with 48kHz, 96kHz, 192kHz, you may have a bad crystal.

(7) Would you suggest a better DAC, or a better CD Transport?
Good DAC can clean up incoming jitter from bad sources. It is hard to define the term "good" Transport. First of all it's a pure digital output device, the data should always be the same. If we talk about the jitter output, even if it provides perfect shape, it will(can) be alternate by the DAC clock/pool. Jitter can be cleaned/polluted in between the whole signal path. What's matter is how accurate this timing information reaches the digital audio conversion point. (That's means the clock input of DAC chipset) There were audiophiles who chained multi de-jittering boxes together a head of a DAC. If the DAC PLL circuit is poorly designed, jitter can be even higher than before it takes any de-jittering stage.

(8) Does transport mechanism vibration, power supply affect the jitter performance? If they do, then computer must not be a good thing for audio playback.
Yes they can affect the jitter performance. Again a good DAC can clean the incoming jitter. Computer is standard machine with standard power supply. The soundcard design is rather more directly related issue. Please do not custom made MIT caps for your 450Watt ATX power supply. This can not improve the audio performance.

(9) Can a faster CPU, newer graphic card, more ram or faster harddisk affects audio performance?
They do not affect the standard stereo playback quality. Stereo audio playback demands very little resources with today computer standard. Audiophile should not worry about the motherboard, CPU and computer performances. In recording, this case is rather different. The channel limitation of how one digital audio workstation (DAW) can recorded/playback is determine by the computer power (or specialized DSP soundcard such as Protools/Pyramix/Sadie).

(10) What about digital dropout that I heard from computer playback?
When a soundcard is playing/recording audio, it needs buffer to store a small amount of data. The larger the buffer size, the longer the latency. The buffer prevents audio dropout. The higher performance computer can reaches smaller buffer size, hence shorter latency. This lightly affects audiophile 2 channels playback. In studio recording/mixing world, multichannel playback/record latency becomes a critical elements. On Windows PC domain, there is a DPC latency checker that analyses the capabilities of a computer system to handle real-time data streams properly. It may help to find the cause for interruptions in real-time audio and video streams, also known as drop-outs.

(11) Is that Ram buffering? There are programs which store the entire audio song(s) in the computer ram and playback from there. Can this improve audio quality because of its isolation and no mechanical movement?
The audio data will be sent out by the soundcard, which determines the clock signal accuracy. The data storage location has no effect on audio quality. The latest SSD ram drive will not improve audio quality over SATA hard disk.

That's it for now.
Next time we will focus on newly development computer interface, USB, firewire IEEE1394, and take a closer look to various clocking methods and playback software/processing. If you have any question would like to discuss, please feel free to tell.
I learnt this term CAS (Computer As Source) recently from various local hifi websites and forums. While I enjoy more people are using computer technology for media playback, I read too many mysterious complicated methods. Some of them are even non-related to audio quality. A lot of discussions has shown lacking basic knowledge in digital audio. I decided to give some comments and build FAQ in this topic. This is not a technical paper, but aimed to explain some facts to audiophile in easy to understand perspective.

(1) Why use CAS?
Computer can be a "PERFECT" digital media playback machine. The word "PERFECT" means that the digital data output from computer is 100% same as what originally recorded on the media. I'm sure audiophile are skeptical about digital perfect idea. Before we go on, let's review some basic digital audio theory.

(2) Sample rate
Define the number of samples per second. Use CD as an example, CD has 16bit/44.1kHz. which means 44,100 samples in a second (88,200 stereo samples).

(3) Digital Word-length
Digital world uses binary numbers. It includes 1 and 0 only. Wordlength means that how many digits are used to represent a number. A word-length of 4 bits (4 digits) can have values from 0000 to 1111 (in decimal, 0-15).

(4) What about CD?
CD has 16 bits wordlength, which means in every sample, it carries 16 bits of information (in decimal, the possible value range between 00000 to 65535). As mentioned on the above, CD 44.1kHz sample rate equals to 88,200 stereo samples, each with 16bits word-length. Well, it all means tons of data are represent within one second of music.

(5) A "Perfect" Ripping?
In CAS, audiophiles rip their CDs to computer harddisk and playback via harddisk. There are many programs to perform the ripping process. A perfect ripped disc means whatever data/music is stored on the CD, is identical to what is stored on the hard disk. There is NO loss of data. The same idea like you copy a digital photo from your DC to computer. They look the same, 100% the same.

(6) How can a cheap CD-Rom can perform this task?
Maybe it cannot in 52x real times, but the perfect ripping program should read the disc in multiply times and verify of all data are exactly the same. The most well known CD ripping program is called EAC (Exact Audio Copy). There is no 30kg weight Esoteric CD-Rom in this world for a reason, although I do love and respect its strong and ultra smooth mechanism.

(7) What about perfect digital audio playback?
A perfect digital audio playback means the digital audio at the output (no matter AES/EBU, SPDIF, TOSLINK or HDMI) carries the identical digital data.

(8) How can you be so sure of this?

We can test it. The procedure is simple. Connect your computer digital output to digital input and forms a loop. Record/Playback the same test signal, compare the recorded music file with the playback signal, they should be identical.

(9) I want to know a little bit more about how to compare both files? A Null test?


Import both files (File A & B) to a computer audio editor. Align both files (A & B) in sync (identical starting point) as you never can hit "PLAY" & "RECORD" buttons at the exact same time, and delay between the digital I/O.

Reverse L/R channels phase on either File A, or File B, but not both files. Playback both files together. Since the digital audio editor mixes (sum) 2 files together, they cancel each other out and there should have no signal on the playback. With serious professional digital audio workstation (DAW), this null out perfectly down to -144dBFS (24bit).


(10) What about if they are not identical?


This means there is something wrong with either your digital input or digital output. Consult your soundcard manufacturer for further investigation.