Count key data

Count key data (CKD) is a direct-access storage device (DASD) data recording format introduced in 1964, by IBM with its IBM System/360 and still being emulated on IBM mainframes. It is a self-defining format with each data record represented by a Count Area that identifies the record and provides the number of bytes in an optional Key Area and an optional Data Area. This is in contrast to devices using fixed sector size or a separate format track.

Count key data (CKD) also refers to the set of channel commands (collectively Channel Command Words, CCWs) that are generated by an IBM mainframe for execution by a DASD subsystem employing the CKD recording format. The initial set of CKD CCWs, introduced in 1964, was substantially enhanced and improved into the 1990s.

CKD track format
The reason for CKD track format is to allow data field lengths to vary, each recorded block of data on a DASD track, called a record has an associated count field which identifies the record and indicates the size of the key, if used (user-defined up to 255 bytes), and the size of the data area, if used. The count field has the identification of the record in cylinder-head-record format, the length of the key, and the length of the data. The key may be omitted or consist of a string of characters.

"The beginning of a track is signalled when the index marker (index point) is detected. ... The marker is automatically recognized by a special sensing device." Following the index marker is the home address, which indicates the location of this track on the disk, and contains other control information internal to the control unit. A fixed-length gap follows the home address. Next, each track contains a Record 0 (R0), the track descriptor record, which is "designed to enable the entire content of a track to be moved to alternate tracks if a portion of the primary track becomes defective." Following R0 are the data records, separated by gaps.

Because of the gaps and other information, the recorded space is larger than that required for just the count data, key data, or user data. IBM provides a "reference card" for each device, which can be used to compute the number of records per track for various key and data field sizes, and to optimize the capacity of the device. Later, programs were written to do these calculations. Because records are normally not split between tracks, specification of an incorrect record size create problems.

Most often, the key is omitted and the record is located sequentially or by direct cylinder-head-record addressing. If it is present, the key is any data used to find the record, usually using the Search Key Equal or Search Key High or Equal CCW. The key (and hence the record) is locatable via hardware commands. Since the introduction of IBM's System/360 in 1964, nearly all IBM large and intermediate system DASDs have used the count key data record format.

The advantages of count key data record format are:


 * The record size can be exactly matched to the application data block sizes
 * CPU and memory requirements can be reduced by exploiting search-key commands.
 * IBM CKD subsystems initially operated synchronously with the system channel and can process information in the gaps between the various fields, thereby achieving higher performance by avoiding the redundant transfer of information to the host. Both synchronous and asynchronous operations are supported on later subsystems.

Reduced CPU and memory prices and higher device and interface speeds have somewhat nullified the advantages of CKD, and it is retained only because IBM's flagship operating system z/OS does not support sector-oriented interfaces.

Originally CKD records had a one-to-one correspondence to a physical track of a DASD device; however over time the records have become more and more virtualized such that in modern IBM mainframes there is no longer a direct correspondence between a CKD record ID and the physical layout of a track.

Packaging


Initially there was a high degree of correspondence between the logical view of DASD accesses and the actual hardware, as shown in the illustration. Three digit labels were typically affixed to identify the address of channel, control unit and device.

On low end systems the Channel and the Control Unit were frequently physically integrated but remained logically separate. IBM's New Attachment Strategy beginning with the 3830 Model 2 in 1972 physically separated the SCU into two physical entities, a director and a controller while keeping them logically the same. The controller handles the CKD track formatting and is packaged with the first drive or drives in a string of drives and having a model number with the letter "A" as a prefix, an "A-Unit" (or "A-Box") as in 3350 Model A2 containing a controller and two DASDs. DASD without a controller, that is B-Units, have a "B" prefix in their model number.

CKD subsystems and directors were offered by IBM and plug compatible competitors until at least 1996 (2301 to 3390 Model 9); in total 22 unique DASD offered by IBM configured in at least 35 different subsystem configurations. Plug-compatible offered many of the same DASD including 4 CKD subsystems featuring unique DASD.

Programming
Access to specific classes of I/O devices by an IBM mainframe is under the control of Channel Command Words (CCWs), some of which are generic (e.g. No Operation) but many of which are specific to the type of I/O device (e.g. Read Backwards for a tape drive). The group of CCWs defined by IBM for DASD fall into five broad categories:


 * Control – control of the DASD including the path thereto
 * Sense – sense status of the DASD including the path thereto; some sense commands affect the status of the controller and DASD in a fashion more in keeping with a control command, e.g., RESERVE, RELEASE
 * Write – write information to the controller or DASD (which may be buffered or cached in the path)
 * Search – compare information from the CPU with information stored in the DASD; the Channel operates in the Write mode while the storage unit operates in the Read mode.
 * Read – read information from the DASD (which may be buffered or cached in the path)

CKD CCWs are the specific set of CCWs used to access CKD DASD subsystems. This is in contrast to fixed block architecture (FBA) CCWs which are used to access FBA DASD subsystems.

CKD DASD are addressed like other Input/Output devices; for System/360 and System/370 DASD are addressed directly, through channels and the associated control units  (SCU or Storage Control Unit), initially using three hexadecimal digits, one for channel and two for control unit and device, providing addressing for up to 16 channels, for up to 256 DASD access mechanisms/channel and 4,096 DASD addresses total. Modern IBM mainframes use four hexadecimal digits as an arbitrary subchannel number within a channel subsystem subset, whose definition includes the actual channels, control units and device, providing addressing for up to 65,536 DASD per channel subsystem subset. In practice, physical and design constraints of the channel and of the controllers limited the maximum number of attached DASD attachable to a system to a smaller amount than the number that could be addressed.

Initial CKD feature set
The initial feature set provided by IBM with its 1964 introduction of the CKD track format and associated CCWs included:.


 * Defective/Alternative Track – enables an alternate track to replace a defective track transparent to the access method in use.
 * Record overflow – AKA track overflow  –  The Write Special Count, Key and Data CCW can write a record segment that is logically part of a record segment on the next track. While an individual record segment can not exceed the maximum track length of a DASD track, a sequence of record segments is not so limited.


 * Multitrack operations – specific CCWs can continue onto the next sequential head
 * Command chaining – CCWs could be chained together to construct complex channel programs.  The gaps in a CKD track format provided sufficient time between the commands so that all channel and SCU activity necessary to complete a command can be performed in the a gap between appropriate fields.  Such programs can search a large amount of information stored on a DASD, upon successful completion returning only the desired data and thereby freeing CPU resources for other activity.  This mode of operating synchronous to the gap was later enhanced by additional CCWs enabling a nonsynchronous mode of operation.
 * Channel switching – an SCU can be shared between channels –  initially two channel switching was provided and it was expanded to up to eight channels in later SCUs.  The channels can be on the same or different CPUS.

A Scan feature set was also provided but not continued into future CKD subsystems beyond the 2314.

Forty one CCWs implemented the feature set:

Notes:
 * O = optional feature
 * S = standard feature
 * MT = multitrack: when supported CCW will continue to operate on next heads in sequence to end of cylinder
 * ‡ = TIC (Transfer In Channel) and other standard commands not shown.
 * † = code same as MT Off except as listed
 * 1. File Scan Feature (9 CCWs) only available on 2841 for 2302, 2311 and 2321; they were not available on subsequent DASD controllers for DASD later than 2314.
 * 2. Count is number of bytes in search argument, including mask bytes

The CCWs were initially executed by two types of SCU attached to the system's high speed Selector Channels. The 2820 SCU controlled the 2301 Drum while the 2841 SCU controlled combinations of the 2302 Disk Storage, 2311 Disk Drive, 2321 Data Cell and/or 7320 Drum Storage. IBM quickly replaced the 7320 with the faster and larger 2303.

Subsequently, the feature set was implemented on the 2314 family of storage controls and an integrated attachment of the System 370 Model 25.

The following example of a channel program reads a disk record identified by a Key field. The track containing the record and the desired value of the key is known. The SCU will search the track to find the requested record. In this example <> indicate that the channel program contains the storage address of the specified field.

SEEK             SEARCH KEY EQUAL TIC             *-8 Back to search if not equal READ DATA


 * The TIC (transfer in channel) will cause the channel program to branch to the SEARCH command until a record with a matching key (or the end of the track) is encountered. When a record with a matching key is found the SCU will include Status Modifier in the channel status, causing the channel to skip the TIC CCW; thus the channel program will not branch and the channel will execute the READ command.

Block multiplexer channel enhancements
The block multiplexor channel was introduced beginning in 1971 on some high end System/360 systems along with the 2835 Control Unit and associated 2305 DASD, This channel was then standard on IBM System/370 and subsequent mainframes; when contrasted to the prior Selector channel it offered performance improvements for high speed devices such as DASD, including:

Multiple Requesting
Allowed multiple channel programs,

to be simultaneously active in the facility as opposed to only one with a Selector channel. The actual number of subchannels provided depends upon the system model and its configuration. Sometimes described as disconnected command chaining, the control unit could disconnect at various times during a chained set of CCWs, for example, disconnection for a Seek CCW, freeing the channel for another subchannel.

Command Retry
The channel and storage control under certain conditions can inter-operate to cause a CCW to be retried without an I/O interruption. This procedure is initiated by the storage control and used to recover from correctable errors.

Rotational Position Sensing
Rotational position sensing (RPS) was implemented with two new CCWs, SET SECTOR and READ SECTOR enabled the channel to delay command chaining until the disk rotated to a specified angular track position. RPS permits channel disconnection during most of the rotational delay period and thus contributes to increased channel utilization. The control unit implements RPS by dividing each track into equal angular segments.

Example Channel Program
The following example channel program will format a track with an R0 and three CKD records. SEEK             SET FILE MASK SET SECTOR WRITE R0         WRITE CKD        WRITE CKD        WRITE CKD        In this example the Record 0 conforms to IBM programming standards. With a block multiplexer channel the channel is free during the time the DASD is seeking and again while the disk rotates to beginning of the track. A selector channel would be busy for the entire duration of this sample program.

Defect skipping
Defect skipping allows data to be written before and after one of more surface defects allowing all of a track to be used except for that portion that has the defect. This also eliminates the time that was formerly required to seek to an alternate track. Only a limited number of defects could be skipped so alternate tracks remained supported for those tracks with excess defects.

Defect skipping was introduced in 1974 with the 3340 attached via the 3830 Model 2 Storage Control Unit or integrated attachments on small systems. Defect skipping was essentially a factory only feature until 1981 when CCWs for management along with associated utilities were released.

Dynamic paths
First introduced with the 3380 DASD on the 3880 Storage Control Unit in 1981 the feature was included with the later CKD DASD subsystems. The dynamic path selection function controls operation of the two controllers, including simultaneous data transfer over the two paths. When supported by the operating system, each controller can serve as an alternate path in the event the other controller is unavailable.

Three additional commands, Set Path Group ID, Sense Path Group ID, and Suspend Multipath Reconnection, are used to support attachment of the 3380 Models having two controllers at the head of a string.

The Set Path Group ID command, with the dynamic path selection (DPS) function, provides greater flexibility in operations on reserved devices. Once a path group for a device has been established, it may be accessed over any path which is a member of the group to which it is reserved. In addition, on 370-XA systems which set the multipath mode bit in the function control byte (byte 0) to a 1, block multiplex reconnections will occur on the first available path which is a member of the group over which the channel program was initiated (regardless of the reservation state of the device).

If the controller designated in the I/O address is busy or disabled, the dynamic path selection allows an alternate path to the device to be established via another storage director and the other controller in the model AA.

Nonsynchronous operation
Prior to the 1981 introduction of the 3880 director, CKD records were synchronously accessed, all activities required that one CCW be ended and the next initiated in the gaps between the CKD fields. The gap size placed limitations on cable length but did provide for very high performance since complex chains of CCWs could be performed by the subsystem in real time without use of CPU memory or cycles.

Nonsynchronous operation provided by the Extended CKD ("ECKD") set of CCWs removed the gap timing constraint. The five additional ECKD CCWs are Define Extent, Locate Record, Write Update Data, Write Update Key and Data, and Write CKD Next Track.

In nonsynchronous operation, the transfer of data between the channel and the storage control is not synchronized with the transfer of data between the storage control and the device. Channel programs can be executed such that channel and storage control activities required to end execution of one command and advance to the next do not have to occur during the inter-record gap between two adjacent fields. An intermediate buffer in the storage control allows independent operations between the channel and the device. A major advantage of ECKDs is far longer cables; depending upon application it may improve performance.

ECKD CCWs are supported on all subsequent CKD subsystems.

This example nonsynchronous channel program reads records R1 and R2 from track X'0E' in cylinder X'007F'. Both records have a key length of 8 and a data length of X'64' (10010) bytes.

Define Extent       Locate Record       Read Data          

Caching
Caching first introduced in DASD CKD subsystems by Memorex (1978) and StorageTek (1981) was subsequently introduced in late 1981 by IBM on the 3880 Model 13 for models of the 3380 with dynamic pathing.

The cache is dynamically managed by an algorithm; high activity data is accessed from the high-performance cache and low activity data is accessed from less-expensive DASD storage. A large memory in the Director, the cache, is divided into track slots that store data from the 3380 tracks. A smaller area is a directory that contains entries that allow data to be located in the cache.

Caches were also provided on subsequently introduced storage controls.

Other extensions
Over time a number of path control, diagnostic and/or error recovery CCWs were implemented on one or more storage controls. For example:


 * Unconditional Reserve allowed the releasing a device reserved to another channel and reserving the device to the channel issuing the command.
 * Read Multiple Count Key Data could more efficiently read full tracks allowing for more efficient backups.

Beyond System/370
Reduced CPU and memory prices and higher device and interface speeds have somewhat nullified the advantages of CKD, and support continues by IBM to this date because its flagship operating system z/OS continues to use CKD CCWs for many functions.

Originally CKD records had a one-to-one correspondence to a physical track of a DASD device; however over time the records have become more and more virtualized such that in a modern IBM mainframe there is no longer a direct correspondence between the a CKD record ID and a physical layout of a track. An IBM mainframe constructs CKD track images in memory and executes the ECKD and CKD channel programs against the image. To bridge between the native fixed block sized disks and the variable length ECKD/CKD record format, the CKD track images in memory are mapped onto a series of fixed blocks suitable for transfer to and from an FBA disk subsystem.

Of the 83 CKD CCWs implemented for System/360 and System/370 channels 56 are emulated on System/390 and later systems.