Contingency and Recovery Planning : Checklist for Information Systems
INTRODUCTION
This checklist is designed to help organize and put in place an emergency recovery plan for information systems and computing groups in campus departments.
Its development was undertaken by a diverse group of MICROnet participants known as the Disaster Recovery Workgroup. Design participants came from Housing and Dining Services, Auxiliary Enterprises, Campus Police, College
of Engineering, Electrical Engineering and Computer Science, Physical Plant-Campus
Services, Parking and Transportation, Health Services, Letters & Science,
Information Systems and Technology, UCOP and others. The purpose was to
create a planning tool that would serve to guide campus information systems
groups when they embark on the emergency planning process. A series of
priorities, task definitions, instructions, flow charts, agreements and
reference materials result from following the checklist. When organized
into a document these materials become core elements of an information
systems contingency and recovery plan. This information systems emergency
plan should adjoin the overall department emergency plans for responding
to and recovering from a disaster.
Table of Contents
I. GETTING READY
 |
A. Obtain written commitment from top management of support for contingency planning
objectives.
Example of mission statement for an information systems unit's contingency
plan:
"Prepare a contingency and recovery plan for restoring the information
systems that support the crucial functions performed by the Department/Unit
following a catastrophic failure of current systems."
|
 | B. Assemble the contingency planning team to include one or
more permanent members from:
- Computer support staff
- Operational or unit managers
- Facilities management
- Department Safety Committee
|
 | C. Provide
for the planning committee to include participation on an "as needed"
basis from the following campus departments:
- Internal Audit (compliance)
- Purchasing (contracts)
- Environmental Health and Safety (coordination)
- UC Police Department (coordination)
- Office of Emergency Preparedness (coordination)
- Applied Risk Management (liability/insurance)
- Real Estate Services (relocation)
- Information Systems and Technology including:
Telecommunications, Central Computing Services, Data Communications
and Network Services, Administrative Systems, and Student Information Systems.
- Physical Plant--Campus Services
- Others as required
|
 | D. Define
the responsibility of planning committee members. Appoint;
- Group Moderator, to facilitate planning meetings
- Group Scribe, to take and prepare meeting notes and agenda's
- Group Administrator, to aggregate meeting materials
|
| III. GATHERING NECESSARY INFORMATION - RESOURCE ASSESSMENT |
 | A.Survey
the systems and data which are critical to the Department's functions.
Develop flow charts of the results. Verify flow diagrams with appropriate
system administer. The survey should ascertain;
- Source of all data used in the system.
- Nature of information or report.
- Frequency of need for data.
- How the data is obtained, paper, e-mail, remote access download,
tape or disk.
- Who in department receives or retrieves data.
- Who on campus do you speak to about access to the data, will
they be available in an emergency.
- What is the impact if this data is not available.
- Hardware/OS software.
- Network.
- Applications.
|
 | B.Identify
the areas where the Department's responsibility for disaster recovery begins
and that of the central computing facility ends. |
 | C.Determine
if the current backup plan is adequate for the completed risk assessment
and includes the following features:
- Routine periodic backups,
- Clear backup "strategy" (full vs. incremental backups,
frequency, etc.),
- Off-site storage and retrieval procedures,
- Alternate processing site (hot, warm, or cold site).
|
 | D.Complete
a resource inventory in each of the following areas (items that might have
to be replaced):
- Equipment
- Computer hardware
- Network hardware
- Other equipment
- Documentation
- Procedure manuals/handbooks
- Software
- Accounting procedures
- Communication documentation
- Supplies
- Current inventory
- Outstanding orders
- Special items, (e.g., toxic or hazardous materials)
|
 | E. Define
the responsibilities of emergency response team(s). |
 | F.Complete
staff responsibility chart for emergency response:
- Disaster evaluation team (management level)
- Interim operations team
- Recovery team
|
| IV. INTEGRATION WITH DEPARTMENT RESPONSE AND RECOVERY PLANNING |
 | A. Specify
who is authorized to declare a disaster and activate the information systems
emergency Plan. |
 | B. Define
the department's immediate response actions by referring to the Department
Safety Plan for evacuation and notification of staff.
- Accounting for staff and others in the building.
- Meeting location of disaster evaluation team.
- Reaching staff needed for emergency response.
- List of home telephone numbers
- Cellular phone or radio contact
|
 | C. The
Department Recovery Plan should define "manual" processes that
can be used until computer resources are recovered. This need for parallel
paper process is beyond the planning scope of information systems group.
It needs to be defined by a department administrative recovery team. This
plan should:
- Stock the required forms.
- Pre-assign job numbers, PO numbers, work order numbers, service
request numbers, etc.
- Document procedures to merge the manually tracked data with
the information on the system once it is restored.
- Document all manual procedures.
- Prescribe how the impact of changes in procedures will be
clear to customers, suppliers, and vendors.
|
| V. INTERIM OPERATION PLAN - PREARRANGED AGREEMENTS FOR RESOURCE
REPLACEMENT |
 | A. Possibilities
for alternate sites:
- Other company with similar facilities
- Other company in the immediate geographical area
- Computer manufacturer's facilities (or other suggestions from
them)
- Service bureaus in the immediate area
|
 | B. Considerations
for alternate site selection:
- Building type
- Floor capacity - space and load
- Raised flooring
- Electric circuits/ capacity/ special connectors
- Air conditioning and humidity control
- Chilled water
- Fire protection and suppression
- Security - personnel
- Security - physical
- Security - data
- Staff accommodations
- Communications
- Telephones
- Network between departmental systems and access to other
data
- Physical access to systems with critical data which are not
accessible remotely
|
 | C.Back-up
agreements:
- Written guarantee or contract with other companies
- Reciprocal agreements
- Service bureau commitments
- Vendor commitments
|
 | D. Alternate
hardware:
- Computer and components
- CPU model
- Memory
- Operating system
- Options
- Peripherals
- Network equipment and wiring
- Terminals
- Off-line equipment
- Furniture
- Office machines (including phones, fax, etc.)
|
 | E. Supplies:
- Paper
- Forms
- Disks
- Tapes
- Reel
- Cartridge (type)
|
 | F. Off-site
moving plans:
- Transportation of staff
- Transportation of data and supplies
- Staff phone list
- Other ____
|
| VI. TEST, EVALUATE AND UPDATE THE PLAN |
 | A. Specify
periodic testing of the contingency plan to assure processing compatibility:
- Frequency
- Scope
- Test data
- Test evaluation team
|
 | B. Periodically
review and update of emergency response documentation:
- Staff responsibility charts
- Staff telephone numbers
- Vendors
- Software license agreements
- Alternate site agreements
- Inventory of computer hardware and software
- interim operations procedures
|
 | C. Periodically
review and drill emergency response and recovery teams:
- Tabletop exercise to test documentation and communication
in controlled environment.
- Functional exercise to test documentation, communication
and procedures in controlled environment
- Field exercise to test documentation, communication, procedures
and logistics in a simulated"real" environment.
|
| VII. RECOVERY AND RESTORATION |
 | A. Permanent
site preparation:
- Building
- Floor capacity - space and load
- Raised flooring
- Electric circuits/ capacity/ special connectors
- Air conditioning and humidity control
- Chilled water
- Fire protection and suppression
- Security - staff
- Security - physical
- Security - data
- Communications
- Telephones
- Network between departmental systems and access to other
data
- Physical access to systems with critical data which are not
accessible remotely
|
 |
B. Procurement of hardware:
- Acquisition
- Purchase
- Lease
- Donation
- Loan
- Computer and components
- CPU model
- Memory
- Operating system
- Options
- Peripherals
- Network equipment and wiring
- Terminals
- Off-line equipment
- Furniture
- Office machines (including phones, fax, etc.)
|
 | C. Supplies:
- Paper
- Forms
- Disks
- Tapes
- Reel
- Cartridge (type)
|
 | D. Parallel
operations plans. |
 | E.Migration
plan. |
 |
F. Procedures to close down the interim operation. |
|