Section 1: basic terms
Mechanical Recording
Audio is produced by the air being alternately compressed and released by something vibrating or moving. The first recordings were made by taking this motion and using it to move a needle that was placed over a rotating wax cylinder. The needle’s motions made grooves in the wax.
In order to play the recording, the needle was placed in the wax groove and it’s motion was directed to a large horn that amplified the sound. No electricity was needed, as the cylinders rotation was spring powered. The early audio recordings were mono only, a single audio channel.
Vinyl Records eliminated the cylinder, and AC power eliminated the spring and added an amplifier for louder sound. Copies were made by making literal casts of the recording and pressing it onto warm vinyl.
Single microphones were used at first, and the audio was often recorded directly to vinyl recording as seen in “Brother where art thou?”
Multiple microphones allowed an audio engineer to control the volume of the various instruments or voices during a performance.
Electronic recording began with Magnetic wire recorders, which were created and used in an office environments for use by executives. A fluctuating magnetic field magnetized the wire as it moved past the recording head. Music was a secondary concern.
Steel wire was a challenge to use and store, so eventually a long strip of plastic, coated with iron oxide was introduced. Recording tape has been gradually improved since its introduction.
Stereo recording added 2 channels, left and right. This was a vast improvement on the original mono recordings, a path which eventually led to the now abandoned QUAD recordings, and eventually to the 6 channel 5.1 recording format used for DVD now. Other multi channel formats exist.
My favorite is 10.2, marketed as “Twice as good as 5.1!”
Mixing boards expanded to match the track count of the recorders. Outboard effects such as reverb, delay, compressors, equalizers, and more could easily to used on individual tracks. This added a LOT of creative control to the musicians and the audio engineers.
Cassette players gave consumers the ability to record their own music at last, and many a budget minded musician recorded their own performances, or mix tapes of their favorite music.
Multi track cassettes added a multi track capability for consumers, although their limited bandwidth made them more of a vehicle for creating demos, to be re-recorded in a “real” studio.
Section2: Getting Digital
Instead of mechanically recording sound, or recording a fluctuating magnetic field, digital recording slices a sound into thousands of sections and represents each slice with a number.
The bit rate is the number of slices per second. Standard CD audio is sliced into 44.1 thousand numbers per second. This is the bit rate. The range of numbers is the bit depth. CDs have a 16 bit depth, which translates to 65,536 possible values. Generally this standard range is perfectly adequate, and using this standard ensures your music is compatible with most uses. In studios engineers can use higher rates.
An analogue signal (in red) encoded to 4-bit PCM digital samples (in blue); the bit depth is four, so each sample’s amplitude is one of 16 possible values.
A graphic that showed the 44.1 bit rate would have 44,100 vertical lines, and 65,536 horizontal lines. It would be hard to see the audio through the lines, so this graphic is a very simplified visualization.
File formats
Once audio is recorded it can be saved in a bewildering variety of digital file formats. In addition to various bit rates and depths, audio can be mono, stereo or even multi channel. The file type is always placed at the end of the file name. Its important to know what kind of audio files you will need for your specific purpose.
1. Wave files are very high quality, and end in .WAV.
2. AIF is mostly used for Apple systems, and is a very high quality. AIF files end in .AIF.
3. The Ogg Vorbis format is completely lossless, but partly because of the large files sizes is seldom used. These end in .OGG.
4. MP3 the MP3 format has a reduced bit rate, depth and size. It is considered a “lossly” format, (not lousy!) because it loses some of the higher frequency audio considered by some to unnoticeable. This is still hotly debated. The MP3 format reduces the bit rate and consequently the range of audio, literally throwing away the higher frequencies. This greatly reduces the size of the MP3 files, allowing larger quantities of audio to be stored on devices, and eases transmission on the internet.
This was more important in the early days of the internet before high speed and larger storage devices were common. Early conversions to MP3 were not of a high quality, giving MP3 a bad name. This has been largely resolved, but I still recommend using a higher quality format for mix down and distribution wherever possible. Youtube, and other distribution mediums reduce the bit rate, so its best to give them a higher resolution to start with, lest the quality suffers.
5. MP4 are used for audio and video together. These end in .MP4.
6. M4V files are MP4 files with added copy protection. This is the format the files are in when purchased from Apple via iTunes. The file ends in .MV4. Most of the software I will be talking about ignores this copy protection, but this format generally prevents basic file sharing.
For our purposes, we will be sticking to the 44.1, 16 bit .WAV files.
MIDI
MIDI stands for Musical Instrument Digital Interface and was developed in the 1980s by a group of synthesizer manufacturers so that their equipment could communicate with each other. It uses a special round 5 pin connector or a USB connector.
MIDI stands for Musical Instrument Digital Interface and was developed in the 1980s by a group of synthesizer manufacturers so that their equipment could communicate with each other. It uses a special round 5 pin connector or a USB connector.
With MIDI, pressing a key on one synth can control another synthesizer. A computer using MIDI can control a large number of synths, each playing a different part, making a whole ensemble of musical instruments. If you are not a keyboard player, MIDI is not as much use to you, however it is quite powerful and can vastly expand your sonic palette. Notes can be entered one at a time in the editing mode, so it is not necessary to be a skilled keyboard player to create music with MIDI. Guitar, percussion, woodwind and other MIDI controllers exist.
VERY IMPORTANT: MIDI only records which note is played, when, and how hard the note is struck. After a MIDI sequence is recorded it can be further manipulated, allowing control over playback speed, instrument type, and more.
Let me be clear, and repetitive here: MIDI is NOT actual audio, but information that can be used to control musical instruments, similar to a piece of written music. MIDI files are literally list of numbers.
Here is a sample of raw MIDI data.
+9120 9120: 90 39 57
(Note A3, played at 42.3.88 with a velocity of 73, held for 2.1.04 and with an off velocity of 64)
To make it even more fun, it is in Hexadecimal, AKA base 16. Don’t worry, the computer handles the tough math!
Fortunately, the software we will be using can display the notes in a MUCH friendlier piano roll or musical notes.
This example shows some MIDI notes in the “Piano Roll” format. The upper of this illustration section shows the notes, and the lower shows the velocity that the notes were struck with. All can be edited further.
You can also view MIDI files as conventional music notation. There are program to control music notation very precisely such as Sibelius and Logic.
Since it is not audio yet, a MIDI file can be quickly changed to make play a different type of instrument. MIDI modules can have built in keyboards, or be controlled by a separate keyboard. This can save a lot of space!
MIDI is also used to control software synths which are recreations of existing synths, and even creating sound not based on existing sounds at all.
MIDI files can usually be played with a computers built in sounds, although this can be disappointing in the case of stock sound cards. Additional hardware or software can vastly improve the quality of MIDI file sounds.
Section 2: D.A.W – Digital Audio Workstations
A DAW is your software tool for recording. There are lots of choices ranging from free to thousands of dollars. Some of them feature basic video editing also. All record MIDI also.
Protools, Cubase, Logic, Sonar, Garage Band, Reaper, Ableton Live, Fruity Loops, are all very popular.
Garage Band comes pre-installed on most Apple computers. It has a large number of built in instruments and can control external synthesizers.
It is very easy to use, and Apple has regular classes teaching it.
FREE (on Apple only)
Logic is Apple only and is very solid, slightly more challenging than Garage Band, but more powerful. It has a powerful collection of additional instruments and effects.
$200.
Protools is the professional standard, very commonly used in high end studios, and can be used with very expensive hardware. They have switched to a subscription based model, and it is currently $300 PER year for the basic version. I used this for years and loved it.
One nice feature is because it is used in the larger studios, a project created in Pro Tools can be taken from studio to studio easily, or from a home studio to a larger studio. There are ways to do this with other software too.
$300 to $1000 PER year.
Fruity Loops, now called FL Studios is often used in hip hop and other beat or pattern based music types. It has a large number of add-ons, and features lifetime upgrades. It is available for Apple or PC.
$199.00 basic, $900 for the pro version.
Cubase is very useful, but the basic model is limited in adding in additional instruments and tracks. Add ons are extra. An introductory version is often included in hardware bundles.
Apple or PC.
$580 for the base version.
Ableton Live is quite powerful, with support for external instruments, MIDI, software synths and more. It appears to be aimed at loop based composers, but can be used for more traditional recording methods. Perfect if beat making is your goal. There is hardware that compliments this software well.
$99 to $749.
Reaper offers all the features of audio, MIDI and even basic video editing. Runs on both Mac or PC. regular updates for a number of years. The demo version has no limitations, except for a “nag notice”. You can customize it extensively.
This is my favorite DAW, and it’s purely coincidental that it’s the most inexpensive one!
$60
So many choices! I’ve used most of them, and seen them all improve through the years.
How do you choose one over the other?
Other than the Apple only Garage Band and Logic, all run on either platform.
There are financial, musical and social reasons to choose one over the other. Groups of similarly minded musicians tend to use certain programs. For instance, many hip hop and rap artists choose FL Studio. If you are a techno musician I recommend Ableton for its repetitive loop based work flow. It can, of course, be used to create other kinds of music. Other than that, let your budget and personal opinion decide.
Most have free demo versions limited in one way or another so you can download it and try it out.
FYI: I receive nothing for endorsement.
I LOVE reaper. It does 99% of what Pro Tools did, and I don’t need to synchronise with SMPTE time code and big machinery anymore, so I don’t even need those options.
Reaper has a great media explorer, video editing integration, extensive options for customization, can support many many soft synths and plug-ins simultaniously, and will run on a basic/old computer.
There is a great community for learning and asking questions too.
Jon Tidey’s very helpful instructional blog
I will focus on Reaper in the remaining section, although all of the concepts can be applied to any other DAW. The others can have specific features that might appeal to you more though.
I have included the latest version of Reaper for Mac and PC in the CD, and the links above will take you to the various download sites for the other DAWs.
Section 4: Hardware – The Audio Interface
While some audio can be recorded without additional hardware by using the tiny stereo 1/8 inch connectors built into all computers, adding an audio interface dramatically adds to the capability and quality. Advances in computer speed and storage capacity allow modern systems to record multiple audio sources at once, not possible on the 1/8 connector.
There are 2 main types of audio connectors. Microphones use the XLR type. Guitars and keyboards use ¼ connectors. Some types(shown here) allow both types in one connector.
The basic 2 channel interface allows control over gain, input type, headphones and master volume.
While USB microphones exist, I would not recommend them for any serious work as they tend to be quite noisy and can have latency, introducing an annoying delay into the sound.
2 channel interface
Focusrite Scarlett 2I2 $159
Modern interfaces allow the user to select between ¼ and XLR in the same connection.
There is an button to choose LINE or XLR. In this case it is marked Inst ( for instrument).
There is also a pad button, to reduce the volume by a fixed amount for certain types of instruments.
Some microphones also require “Phantom Power”, which is 48 Volts for the more expensive condenser mics. There is also an adjustment to accommodate various audio levels, usually labeled Gain. The gain is adjusted to keep the level as high as possible, but still below the overloading level.
In order to record more than 2 audio signals at once, a multi channel interface is needed.
The connection between the computer and the audio interface is usually USB or Firewire. Older interfaces utilize a card plugged into the computer with special cables.
Even if your interface is only 2 channels, you can still edit and mix more than just 2 channels, once recorded.
The number of channels desired is the main factor affecting price. You will need to consider your work flow & plans. Just how many instruments do you need to record at one time? With multi tracking, you can add more tracks later, building a performance over time.
Better interfaces offer more than just multiple inputs. There are options for fiber optic & digital inputs, multiple outputs for working in surround sound, and MIDI inputs. There can be more control over input volume, and they may even offer features that used to be only on high end mixers such as sending to external equipment, multiple head phone mixes, and more. What ever type you purchase, install the software supplied with it, or go to the manufacturers site and download the latest version.
PCs often need adjustment with the drivers.
See http://www.asio4all.org/ for an improved ASIO driver.
As an example, my Focusrite Pro 40 interface has 10 analog inputs and 10 outputs. ADAT / fiber inputs from other equipment can bring this up to 20 inputs and outputs. There is also a digital format called SPDIF. Each channel has phantom power, gain, and LED meters. I use the outputs to send my organ sound to a leslie speaker, the synthesizer to a delay unit, and the master volume to an LED volume indicator, and a power amp.
https://www.musiciansfriend.com/pro-audio/focusrite-saffire-pro-40-firewire-interface
Behringer UMC 204 stereo – $130.
2 Midas-designed mic preamps, 24-bit/192kHz resolution.
focusrite 18i8 $449
Four Focusrite mic/instrument preamps and two instrument inputs are right on the front panel. Eight line inputs, an ADAT input, four analog outs, plus S/PDIF and MIDI I/O.
Behringer U-Phoria UMC1820 USB Audio Interface. $300
18 inputs and 10 outputs. Eight Midas-designed mic pres and high-resolution 24-bit/96kHz converters.
Lynx Aurora (n) 24-TB3 24-channel AD/DA Converter with LM-PRE4 and Thunderbolt 3 Interface. $5,980
The 24-TB3 models provides up to 24-channel recording and playback to microSD for stand-alone operation or confidence recording, as well as two audiophile-grade headphone outs, while a TFT LCD display provides immediate status information.