Creating digital video

Digital video basics

Digital video is made up of the raw video signal (video stream) and the way it is encoded into a file format (the codec). The file format itself (the container) also contains the encoded audio stream, which has its own codec. Compatibility issues with playback of video files can be due to a mismatch between the video and audio codecs being used, resulting in a video signal without sound or sound without a video signal. Other video encoding issues arising from incorrect settings can result in a highly distorted or unplayable video signal.

Before starting to learn about recording, editing, and saving digital video for long-term use, it can be useful to be familiar with digital audio and the processes of using codecs, discussed in the Audio section of this guide.

The video signal

Video signal standards primarily relate to the number of images (or frames) created per second, the image ratio calculated by the length in pixels and number of horizontal lines (the definition) in an image scan, and whether the image is recorded using an interlaced or a progressive scan. A progressive scan is generally higher quality than an interlaced scan, which records two fields by separately scanning alternating horizontal lines of an image, and then interlacing them to make a whole frame.

Cinema-oriented digital video is recorded at 24 frames per second, at a picture aspect ratio of 2048 pixels x 1080 lines (2K) or 4096 pixels x 2160 lines (4K), using a progressive scan. Currently most recording devices at these resolutions are only affordable by professional cinematographers, and require high-end computer hardware and storage to process and edit the resulting video.

Television-oriented digital video is recorded at 25 or 50 (PAL) frames per second, or at 30 or 60 (NTSC) frames per second. Due to a combination of legacy and emerging technologies, multiple definition standards exist. For digital video these can be most simply grouped into standard definition (TV and DVD), and high definition (HDTV).

Standard definition is now largely a legacy standard, but can still be found in cheaper digital camera devices that capture either an interlaced 576 (PAL) or 480 (NTSC) line scan at a picture aspect ratio of 4:3 or 16:9 anamorphic. These are television studio hardware standards set within a broadcasting industry standard called ITU-R BT.601-4. Unlike high definition video and graphics, the pixels used in these standards are not square (meaning their width is not the same as their height). This can cause problems (such as 'squishing' or stretching a picture) when converting an analogue video to digital, converting between PAL and NTSC video, or inserting a standard-definition video clip into a high-definition video. Further complications may occur if dealing with legacy digital video, especially that created or edited in the 1990s, as standards for a time were often wrongly applied or not followed.

High-definition video standards fix a number of problems that existed in standard definition, in particular by using square pixels. The industry standard, SMPTE 274M-1995, sets out three groupings of high-definition systems: 1080 progressive line scan (1080p), 1080 interlaced line scan (1080i), and 720 progressive line scan (720p). The groupings may use different frame or field rates for their scans (24, 25, 30, 50 or 60 frames/fields scanned per second). Knowing the settings used for the source video is important for editing and outputting high-definition video into different formats. While the settings are usually picked up correctly by editing software, if something goes wrong it is useful to know that not all sources may be correctly labelled.

Encoding video

Encoding high-definition video always involves a trade-off between picture detail and file size. The bitstream of video data can be reduced to limit file size, but, as in all lossy reduction processes, the information that is removed can never be recovered. As a general rule, maintaining the highest bitstream version both preserves picture detail and your ability to edit and re-encode the file into new copies multiple times. This has significant storage and back-up implications for anyone interested in archiving high-definition video.

Almost all supported video codecs use lossy reduction to encode video signals, although some aim to be 'visually lossless' in that the lossy reduction is optimised to have no real visual impact on the image. One lossless codec, Motion JPEG 2000, while being an open standard and having future potential, at present places very high demands on hardware and can have limitations in audio support. In practice this means in 2010 all but the most state-of-the-art users need to choose lossy codecs to create, edit and archive their digital video with. Each video codec will also require a compatible audio codec for the soundtrack, which can be lossless (such as Linear PCM used by WAV, AIFF and CD audio) or lossy (such as AAC).

Lossy codecs commonly used for standard-definition encoding tend to be targeted at specific devices or recording formats, such as MPEG-2 (DVD players), MPEG-4 (Quicktime for computers, DivX and Xvid encoding for computers and DVD players, and FFmpeg for computers), WMV (Windows Media Video for computers), and DV (used in camera recorders). MPEG-2 and Quicktime are published ISO standards that can be subject to patents, while Xvid and FFmpeg are free and open-source standards. WMV consists of multiple proprietary standards and the open WMV 9 standard DV is a published industry standard, but has many proprietary modifications.

In high definition there are three popular lossy codecs in use:

the older MPEG-2 codec used for High-Definition Video (HDV, the format generally used in HD camera recorders)
H.264 (also known as AVC), a type of MPEG-4 encoding, and
VC-1, the open version of Microsoft's WMV 9 standard for HD.

Due to its efficient compression, H.264 is used in a large number of HD video cameras, and for a growing range of internet-based streaming and downloadable video. While some cameras remain in a manufacturer's proprietary encoding or older MPEG-2 encoding, most recognised brand-name consumer and semi-professional cameras now encode directly to H.264. All three codecs are approved to be part of the Blu-ray disc storage format.

Video formats

It's important to know that the video encoding determines the choice of format (or container). This can be confusing, as most codecs allow you to save your video into different formats, meaning you cannot determine the compatibility of a video file with your software or hardware by looking at the file extension. For instance, the WMV 9 and DivX codecs can both save video into .AVI format, while .MOV and .MP4 can both be used to store MPEG-4 encoded video. Adobe's proprietary Flash format, .FLV, is also able to hold several different types of video, including the high-definition H.264.

HD digital video cameras may store their video stream in a variety of formats, some of which are proprietary to the manufacturer. One industry standard format that is becoming more widely used by camera manufacturers is AVCHD, developed by Sony and Panasonic.

Similar to CDs, DVDs and Blu-Ray discs have their own formats for video storage, known as VOB and BDAV respectively. DVDs and Blu-Ray discs need to be "authored" through a process or organising a number of audio, video, and data files in a standard sequence that can be read by players.

Standards for video recording and editing

There are currently no satisfactory open video standards for video creation, encoding, and editing. However, we can provide the following guidance based on what we consider to be the best compromises and the general industry trends:

Standard definition vs high definition: If you have a choice, go with a widescreen 16:9 high-definition format when recording video. Standard definition is a sunset standard for older hardware and televisions, and you will have more options for editing and changing quality settings if you use high definition. Standard definition is okay if you have no plans to keep the video long-term. Webcams are often still sold in standard definition.
Interlaced vs progressive scanning: Interlaced video scanning is a hangover from the old analogue television standards, and its only real advantage today is reducing the size of video files. If you have the choice, use a progressive scan setting when recording and outputting video. The two progressive-scan high-definition standards are 720p and 1080p. Either are acceptable, although 1080p records more detail and is better suited where the video may be played back on a screen 50 inches or more in diameter. DVD quality of 480p or 576p will be a better choice than outputting at standard definition television quality of 480i or 576i.
Cinema mode vs television mode: This really depends on the destination for the video you are creating. The cinema progressive frame-rate of 24 fps will give you a 'film' look to your video, while the television frame-rates of 50 fps or 60 fps will be consistent with most broadcast video programmes. The Blu-ray standard limits encoding to 1080p 24 fps maximum, while 1080p 50 and 60 fps are possible future broadcast recording standards.
Camera choice: Look for a camera that allows you choices of 24 fps, 50 fps, or 60 fps progressive scanning. In most cases these cameras are capable of recording at 1080p as well as 720p and 1080i, and are sometimes advertised as 'Full HD'. Look also for a camera that supports H.264 encoding, preferably using AVCHD as the format. If you are on a tight budget, a camera that records 720p in H.264 is a good starting point. Removable flash media or hard-drive storage are the most common camera storage options today, although HDV tape is still available and allows you to keep the tape as your archived backup.
Editing and output: You will need a modern, fast, dual or multi-core computer to edit high-definition video, and plenty of storage (one hour of video can be many gigabytes in size). If you do not have access to such equipment, you may need to consider recording at no more than 720p. It is worth doing a test shoot and edit at different settings before undertaking a major video project. There are a range of video editing options available from free to professional-level prices, some providing more choices in editing tools and output options. If you are creating and editing a lot of video, you may want to consider creating a workflow process to assist you with creating access versions and keeping track of your masters and copies.
Archive and access copies: If possible, keep the master video in unedited form as it comes off the camera as your archival copy. Avoid automatic settings in the camera software that re-encode the video directly as you copy it onto your computer, as they will permanently lower the quality of your video. Keep a record of the make and model of the camera and the settings you used in a plain-text file alongside your archival copy for maximum future compatibility. If you acquire the video from someone else, try to get information about its source from them. Once you have your archive master safely stored on a computer hard drive, make another copy of it for editing purposes. This is the version you can save in a variety of formats, for access through the web, DVD, Blu-ray, or PC playback.