This version of this document is no longer maintained. For the latest documentation, see http://www.qnx.com/developers/docs.

Audio Architecture

This chapter includes:

QNX Sound Architecture
Cards and devices
Control device
Mixer devices
Pulse Code Modulation (PCM) devices

QNX Sound Architecture

In order for an application to produce sound, the system must have:

hardware in the form of a sound card or sound chip
a device driver for the hardware
a well-defined way for the application to talk to the driver, in the form of an Application Programming Interface (API).

This whole system is referred to as the QNX Sound Architecture (QSA). QSA has a rich heritage and owes a large part of its design to version 0.5.2 of the Advanced Linux Sound Architecture (ALSA), but as both systems continued to develop and expand, direct compatibility between the two was lost.

This document concentrates on defining the API and providing examples of how to use it. But before defining the API calls themselves, you need a little background on the architecture itself. For those who want to jump in right away, full source for examples of a “wav” player and “wav” recorder are included in the appendix.

Cards and devices

The basic piece of hardware needed to produce or capture (i.e. record) sound is an audio chip or sound card, referred to simply as a card. QSA can support more than one card at a time, and can even mount and unmount cards “on the fly” (more about this later). All the sound devices are attached to a card, so in order to reach a device, you must first know what card it's attached to.

Cards

Cards and devices.

The devices include:

Control
Mixer
Pulse Code Modulation (PCM)

You can list the devices that are on your system by typing:

ls /dev/snd

The resulting list includes one control device for every sound card, starting from card 0, as well as the PCM and mixer devices for each card.

Control device

There's one control device for each sound card in the system. This device is special because it doesn't directly control any real hardware. It's a concentration point for information about its card and the other devices attached to its card. The primary information kept by the control device includes the type and number of additional devices attached to the card.

Mixer devices

Mixer devices are responsible for combining or mixing the various analog signals on the sound card. A mixer may also provide a series of controls for selecting which signals are mixed and how they're mixed together, adjusting the gain or attenuation of signals, and/or the muting of signals.

For more information, see the Mixer Architecture chapter.

Pulse Code Modulation (PCM) devices

PCM devices are responsible for converting digital sound sequences to analog waveforms, or analog waveforms to digital sound sequences.

Each device operates only in one mode or the other. If it converts digital to analog, it's a playback channel device; if it converts analog to digital, it's a capture channel device.

The attributes of PCM devices include:

the data formats that the device supports (16-bit signed little endian, 32-bit unsigned big endian, etc.) For more information, see “Data formats,” below.
the data rates that the device can run at (48KHz, 44.1kHz etc.)
the number of streams that the device can support (e.g. 2-channel stereo, mono, and 4-channel surround)
the number of simultaneous clients that the device can support, referred to as the number of subchannels the device has. Most sound cards support only 1 subchannel, but some cards can support more; for example, the Soundblaster Live! supports 32 subchannels).
The maximum number of subchannels supported is a hardware limitation. On single-subchannel cards, this limitation is artificially surpassed through a software solution: the software subchannel mixer. This allows 8 software subchannels to exist on top of the single hardware subchannel.
The number of subchannels that a device advertises as supporting is defined for the best-case scenario; in the real world, the device might support fewer. For example, a device might support 32 simultaneous clients if they all run at 48 kHz, but might support only 8 clients if the rate is 44.1 kHz. In this case, the device advertises 32 subchannels.

Data formats

The QNX Sound Architecture supports a variety of data formats. The <asound.h> header file defines two sets of constants for the data formats. The two sets are related (and easily converted between) but serve different purposes:

SND_PCM_SFMT_*: A single selection from the set of data formats. For a list of the supported formats, see snd_pcm_get_format_name() in the Audio Library chapter.
SND_PCM_FMT_*: A group of (one or more) formats within a single variable. This is useful for specifying the format capabilities of a device, for example.

Generally, the SND_PCM_FMT_* constants are used to convey information about raw potential, and the SND_PCM_SFMT_* constants are used to select and report a specific configuration.

You can build a format from its width and other attributes, by calling snd_pcm_build_linear_format().

You can use these functions to check the characteristics of a format:

PCM state machine

A PCM device is, at its simplest, a data buffer that's converted, one sample at a time, by either a Digital Analog Converter (DAC) or an Analog Digital Converter (ADC), depending on direction. This simple idea becomes a little more complicated in QSA because of the concept that the PCM subchannel is in a state at any given moment. These states are defined as follows:

SND_PCM_STATUS_NOTREADY: The initial state of the device.
SND_PCM_STATUS_READY: The device has its parameters set for the data it will operate on.
SND_PCM_STATUS_PREPARED: The device has been prepared for operation and is able to run.
SND_PCM_STATUS_RUNNING: The device is running, transferring data to or from the buffer.
SND_PCM_STATUS_UNDERRUN: This state happens only to a playback device and is entered when the buffer has no more data to be played.
SND_PCM_STATUS_OVERRUN: This state happens only to a capture device and is entered when the buffer has no room for data.
SND_PCM_STATUS_PAUSED: Not supported by QSA.

PCM states

General state diagram for PCM devices.

The transition between states is the result of executing an API call, or the result of conditions that occur in the hardware. For more details, see the Playing and Capturing Audio Data chapter.

Software PCM mixing

In the case where the sound card has a playback PCM device with only one subchannel, the device driver writer can choose to include a PCM software mixing device. This device simply appears as a new PCM playback device that supports many subchannels, but it has a few differences from a true hardware device:

The mixing of the PCM streams is done in software using the CPU. Even with only one stream, the CPU is used more than if the hardware device is used.
When the PCM software mixer is started, it opens a connection to the real hardware device. If the real hardware device is already in use, the PCM software mixer can't run. Likewise, if the PCM software mixer is running, the real hardware device is in use and is unavailable.

The PCM software mixer is specifically attached to a single hardware PCM device. This one-to-one mapping allows for an API call to identify the PCM software-mixing device associated with its hardware device.

PCM plugin converters

In some cases, an application has data in one form, and the PCM device is capable of accepting data only in another format. Clearly this won't work unless something is done. The application — like some MPG decoders — could reformat its data “on the fly” to a format that the device accepts. Alternatively, the application can ask QSA to do the conversion for it.

The conversation is accomplished by invoking a series of plugin converters, each capable of doing a very specific job. As an example, the rate converter converts a stream from one sampling frequency to another. There are plugin converters for bit conversions (8-to-16-bit, etc.), endian conversion (little endian to big endian and vice versa), channel conversions (stereo to mono, etc.) and so on.

The minimum number of converters is invoked to translate the input format to the output format so as to minimize CPU usage. An application signals its willingness to use the plugin converter interface by using the PCM plugin API functions. These API functions all have plugin in their names. For more information, see the Audio Library chapter.

Don't mix the plugin API functions with the nonplugin functions.