CS4 — Data Representation: Sound | N5 Computing Science

Learning intentions

Understand that sound is an analogue signal that must be converted to digital for computer storage
Describe the process of sampling an analogue sound wave
Explain what sampling frequency (sample rate) means and how it affects quality
Explain what bit depth means in the context of sound and how it affects quality
Calculate the file size of a digital audio recording
Compare WAV and MP3 audio file formats in terms of compression, quality, and file size

Success criteria

I can explain what analogue and digital signals are, and why conversion is needed
I can describe what sampling frequency means and give the standard CD sampling rate
I can explain how higher sampling frequency and bit depth improve quality but increase file size
I can calculate the file size of a digital audio file given sample rate, bit depth, and duration
I can compare WAV and MP3, stating that WAV is uncompressed and MP3 is lossy compressed

Key vocabulary

Analogue signal

A continuously varying signal (like a sound wave in air) that can take any value within a range.

Digital signal

A signal represented by discrete (step-by-step) values — in computing, always 1s and 0s.

ADC

Analogue-to-Digital Converter; the hardware that measures (samples) an analogue signal and converts it to binary numbers.

Sampling

The process of measuring the amplitude of an analogue sound wave at regular intervals.

Sampling frequency

The number of times per second a sound is sampled; measured in Hertz (Hz) or kilohertz (kHz). CD quality is 44,100 Hz (44.1 kHz).

Bit depth

The number of bits used to store each sample; determines how accurately the amplitude is recorded. Higher bit depth = more accurate = larger file.

File size (audio)

Calculated as: sampling frequency × bit depth × duration (in seconds). For stereo, multiply by 2.

WAV

Waveform Audio File Format; typically uncompressed, very high quality, large file size.

MP3

A lossy compressed audio format; reduces file size to ~10% of WAV by removing sounds humans are less likely to notice.

Data Representation: Sound

Sound is analogue — computers are digital

Sound is produced by vibrations in the air — pressure waves that travel to your ears and vibrate the eardrum. This is an analogue signal: it varies continuously and smoothly, taking any value along a continuous range. A microphone converts these pressure waves into an electrical signal that mirrors the shape of the original wave.

Computers, however, can only work with digital data — discrete values stored as binary numbers. To store or process sound on a computer, the continuous analogue signal must be converted into a series of numbers. This is done by a hardware component called an Analogue-to-Digital Converter (ADC).

Sampling — measuring the wave

The ADC measures (samples) the amplitude (height) of the sound wave at regular intervals. Each measurement is stored as a binary number. The more frequently the wave is measured, and the more precisely each measurement is recorded, the more accurately the digital version recreates the original sound.

An analogue sound wave (green curve) measured at regular intervals. Each dot is one sample, stored as a binary number.

Sampling frequency is the number of samples taken per second, measured in Hertz (Hz) or kilohertz (kHz). Common values:

CD audio: 44,100 Hz (44.1 kHz) — the industry standard for music
DVD / studio audio: 48,000 Hz or 96,000 Hz
Telephone / voice: 8,000 Hz — adequate for speech, noticeably poor for music

The Nyquist theorem states that sampling frequency must be at least twice the highest frequency you want to capture. Humans can hear up to roughly 20,000 Hz, so a minimum sample rate of 40,000 Hz is needed — CD audio at 44.1 kHz satisfies this with a small margin.

Bit depth — accuracy per sample

Each sample is stored as a binary number. The number of bits used per sample is called the bit depth. A higher bit depth means each amplitude measurement can be recorded more precisely:

8-bit audio: 2⁸ = 256 possible amplitude values — audible "stepping" artefacts
16-bit audio (CD standard): 2¹⁶ = 65,536 possible values — the steps are imperceptibly small
24-bit audio (studio recording): 2²⁴ ≈ 16.7 million values

Increasing bit depth increases accuracy and therefore sound quality — but also increases file size, since more bits are stored per sample.

Calculating audio file size

The file size of an uncompressed audio file is calculated with the formula:

File size (bits) = sampling frequency × bit depth × duration (seconds)

For stereo (two independent audio channels — left and right speaker), multiply the result by 2.

Example: a 3-minute (180-second) stereo WAV file at 44,100 Hz and 16-bit depth:
= 44,100 × 16 × 180 × 2 = 254,016,000 bits = 31,752,000 bytes ≈ 30.3 MB

This is why uncompressed audio files are large, and why compression formats like MP3 were developed.

File formats — WAV and MP3

WAV (Waveform Audio File Format) stores audio with no compression applied. Every sample is saved exactly as recorded. This gives very high quality but large file sizes — suitable for studio recording and professional audio editing, but impractical for streaming or storing large music libraries.

MP3 is a lossy compressed format. It reduces file size to roughly 10% of the equivalent WAV by permanently removing audio data that humans are statistically unlikely to notice:

Sounds at the very high and low extremes of human hearing are removed
When two sounds occur simultaneously and one is much louder, the quieter sound may be discarded (the masking effect)

The result is a much smaller file with acceptable quality for most listeners. A 30 MB WAV file might compress to ∼3 MB as MP3. The critical trade-off is that the removed data is permanently lost — you cannot recover a lossless version from an MP3.

Property	WAV	MP3
Compression	None (uncompressed)	Lossy compression
Quality	Very high — no data removed	Good — some perceptual data lost
File size	Large (∼30 MB per 3 min)	Small (∼3 MB per 3 min)
Typical use	Studio recording, editing	Music streaming, portable devices
Data loss	None	Permanent — cannot be undone

Worked examples

Example 1 — File size calculation (mono)

A 10-second mono audio recording is made at a sampling frequency of 8,000 Hz and a bit depth of 8. Calculate the file size in bits and bytes.

Write the formula: File size (bits) = sampling frequency × bit depth × duration

Substitute values: 8,000 × 8 × 10 = 640,000 bits

Convert to bytes: 640,000 ÷ 8 = 80,000 bytes (80 KB)

Example 2 — File size calculation (stereo)

A 1-minute stereo recording is made at 44,100 Hz and 16-bit depth. Calculate the file size in bits.

Convert duration to seconds: 1 minute = 60 seconds

Calculate mono file size: 44,100 × 16 × 60 = 42,336,000 bits

Multiply by 2 for stereo (two channels): 42,336,000 × 2 = 84,672,000 bits ≈ 10.1 MB

Example 3 — Effect of changing sample rate

A recording is made at 44,100 Hz. A second identical-length recording is made at 22,050 Hz (half the sample rate). How does this affect file size and quality?

File size: Halving the sampling frequency halves the number of samples stored per second — so the file size is halved.

Quality: Fewer samples means the reconstructed wave is less accurate — the digital version loses detail, particularly at higher frequencies. Quality decreases noticeably.

Trade-off: Lower sample rate → smaller file + lower quality. Higher sample rate → larger file + higher quality. Neither change is free.

Example 4 — Comparing WAV and MP3 file sizes

A 5-minute song is saved as a WAV file (44,100 Hz, 16-bit, stereo). Estimate the MP3 file size at 10% of the WAV size.

Convert duration: 5 minutes = 300 seconds

WAV file size: 44,100 × 16 × 300 × 2 = 423,360,000 bits = 52,920,000 bytes ≈ 50.5 MB

MP3 estimate: 50.5 MB × 10% ≈ ∼5 MB — ten times smaller, at the cost of permanently lost audio data.

Now you try

A 30-second mono audio clip is recorded at a sampling frequency of 22,050 Hz with a bit depth of 16.

Calculate the file size in bits and in bytes. Show your working.

Write the formula: File size (bits) = sampling frequency × bit depth × duration
Substitute values: 22,050 × 16 × 30 = 10,584,000 bits
Convert to bytes: 10,584,000 ÷ 8 = 1,323,000 bytes ≈ 1.26 MB
Note: The question specifies mono — no ×2 needed.

Common mistakes

✗

Forgetting to multiply by 2 for stereo. Stereo recordings have TWO channels — left and right. Always check whether a question specifies mono or stereo before applying the formula.

✗

Confusing sampling frequency with bit depth. Sampling frequency = how often you measure (samples per second). Bit depth = how precisely you measure (bits per sample). Both affect quality and file size, but through different parts of the formula.

✗

Saying "MP3 is better quality than WAV." MP3 is smaller, not better quality. MP3 permanently discards audio data. WAV retains every sample exactly as recorded. The right choice depends on the use case.

✗

Applying the formula in the wrong order. The formula is: frequency × bit depth × duration (× 2 for stereo). Always write it out before substituting values to avoid multiplication errors.

✗

Using minutes instead of seconds. Duration in the formula must be in seconds. Always convert before calculating — e.g. 3 minutes = 180 seconds.

Exam tip

This is an extension topic — it does not appear in the mandatory N5 SQA Computer Systems exam specification. However, audio concepts (sampling, bit depth, WAV vs MP3) do appear in the WDD unit specification under "Factors affecting file size and quality". If you choose the WDD question in the exam, this content is directly relevant.

The file size calculation method is identical in structure to bitmap graphics — both use: quantity × bit depth × time or area. Recognising this pattern makes both topics easier to remember and apply.

Teacher notes — Shift+T to hide

Suggested timing: ~40 minutes. Warm up 5 min; notes + diagram 15 min; worked examples 8 min; now you try 4 min; task set 8 min.

Extension status: This lesson is not assessed in the SQA N5 Computer Systems exam. Deliver it if time allows after CS3 (Graphics). It is worthwhile because audio file concepts (sampling frequency, bit depth, WAV vs MP3) appear in the WDD unit specification — pupils choosing WDD in the exam benefit directly.

Key connection to reinforce: The file size formula is structurally identical to the bitmap formula (pixels × bit depth). Making this explicit is powerful — it shows pupils the same underlying principle at work in two different contexts: "how many things × how many bits each". This cross-unit connection strengthens retention.

Key misconception to address: Pupils often say "MP3 is better quality than WAV" because it is more familiar. Correct this clearly: MP3 is smaller, not better. WAV retains all data; MP3 permanently discards some. Also watch for pupils forgetting the ×2 for stereo.

Live demo suggestion: Play the same audio clip as high-quality WAV vs heavily compressed MP3 — pupils can hear the difference, which motivates the theory. Audacity (free) lets you export at different quality settings side by side.

The Nyquist theorem: Background knowledge appropriate at this level; will not be examined at N5. Keep brief. Telephone quality at 8 kHz is a useful hook — pupils can relate to voice calls sounding different from music.

SQA command words covered: "Describe" (sampling process), "Explain" (WAV vs MP3, effect of parameters on quality and file size), "Calculate" (file size formula).