Skip to main content

Identifying what information about hardware is needed

The goal is to identify what information about FutureGrid hardware should be made available. Assuming that most of the following is important to you and in the interest of minimizing the number of things you need to check, please place a X to indicate which informaiton about FutureGrid hardware will NOT be important to you. 

Deciding how to collect and make the information available will be a separate discussion.

The following list was taken and slightly modified from the Common User Environment Documentation effort in the TeraGrid. http://www.teragridforum.org/mediawiki/index.php?title=CUE_CUED_Document_List

 

Check if NOT important
Subdocument / Information
 
  1 Basic Information
    1.1. Name of Resource
    1.2. Name of Resource Provider
    1.3. Login Node
    1.4. Recommended and/or Discouraged Uses
   
  2 Hardware Configuration
    2.1. Type of Machine (MPP, SMP, etc.)
    2.2. Detailed Description of Hardware: Blades, nodes, processors, cores, cores per processor, and a diagram of the basic unit (blade, node, etc.) if possible.
    2.3. CPU Type
    2.4. CPU Speed
    2.5. Number of Nodes
    2.6. CPUs per Node
    2.7. Memory per CPU
   
  3 Operating System
    3.1. Description of System Software including Version and any Special Dialect
   
  4 Interconnect
    4.1. Description of all Interconnect Configurations
    4.2. Typical Network Bandwidth and Latency for all Interconnects
   
  5 File Systems (for each available, specify all below)
    5.1. Filesystem Name
    5.2. Environment Variable for Path (if available)
    5.3. Availability Scope: Node, Cluster, Site, FutureGrid, etc.
    5.4. Recommended and/or Discouraged Uses
    5.5. Size and Quota (if applicable)
    5.6. Backup and Purge Policies
    5.7. Special Considerations (striping, etc.)
   
  6 Archive System (if any)
    6.1. Description of the Archive System
    6.2. Archive Policies
    6.3. Basic Archive Commands, including Access from Remote Systems
   
  7 Environment
    7.1. Environment Management System ("modules") and Basic Commands (include pointer to Futuregrid modules documentation)
    7.2. List of Modules Added at Login (should conform to CUE standard)
    7.3. List of Available Modules
    7.4. Supported Shells
    7.5. How to Change Shell
   
  8 Job Scheduler
    8.1. Job Scheduler Used, including any Modifications or Extensions
    8.2. Job Scheduling Policy
   
  9 Queues
    9.1. How to List Available Queues
    9.2. How to Obtain Detailed Information about Queues
      9.2.1. Recommended Use
      9.2.2. Maximum and Minimum Number of Nodes
      9.2.3. Maximum and Minimum Number of Cores
      9.2.4. Maximum and Minimum Wall Time
      9.2.5. Priority Boost/Loss Available
      9.2.6. Node Features (High-Mem, Accelerated, Interconnects, etc.)
      9.2.7. FutureGridGrid SU Premium/Reduction
   
  10 Job Management
    10.1. How to Submit a Job
    10.2. Available Job Submission Options
    10.3. Basic Example Job Script (/bin/hostname, etc.)
    10.4. Link to CUED "Sample Jobs" for More Examples
    10.5. How to Monitor a Job's Status
    10.6. How to Delete a Job
   
  11 File Transfer
    11.1. Recommended Methods for Large File Transfer (include examples)
    11.2. Recommended Entry/Endpoints and Environment Variables for Entry/Endpoints
    11.3. Recommended Front-End Hosts for File Transfer (if any)
    11.4. Recommended Options (striping, etc.)
   
  12 FutureGrid Accounting
    12.1. How to Monitor Usage
    12.2. How SUs are Computed
      12.2.1. Premium Charges for High Priority Queues, Large Memory Queues, etc.
      12.2.2. Reduced Charges for “Slow” or “Backfill” Queues
      12.2.3. Premium Charges for Reservations and Reservation Cancellation Policy
    12.3. Maximum Startup Allocation (in SUs)
   
  13 Supported Compilers (all of the following for each)
    13.1. Recommended Options
    13.2. Available and Recommended MPI Libraries
    13.3. Available and Recommended Math Libraries
    13.4. Additional or Official Compiler Documentation Links
   
  14 Sample Jobs (simple working examples)
    14.1. Simple Batch Job (/bin/hostname, etc.)
    14.2. MPI Job (if applicable, both C/C++ and Fortran)
    14.3. OpenMP Job (if applicable, both C/C++ and Fortran)
    14.4. Hybrid Job (if applicable, both C/C++ and Fortran)
    14.5. Specialized Use Job (if applicable, both C/C++ and Fortran)
    14.6. Profiling Job (if applicable, both C/C++ and Fortran)
    14.7. Debugging Job (if applicable, both C/C++ and Fortran)
   
  15 Debugging and Optimization Tools
    15.1. Compiler Options for Debugging
    15.2. How to Start a Job in the Debugger
    15.3. Attaching the Debugger to an Interactive or Batch Process
    15.4. Additional or Official Tool Documentation Links
   
  16 Profiling Tools
    16.1. Compiler Options for Profiling
    16.2. Post-Mortem Profiling Tool Documentation
    16.3. Additional or Official Tool Documentation Links
   
  17 Environment Variables
    17.1. Complete List of Environment Variables Defined
   
  18 Reservations
    18.1. Procedure for Obtaining a Reservation for Single System Job
    18.2. Procedure for Obtaining a Reservation for Cross-Site or Multi-Site Job
   
  19 Application Software
    19.1. Full Available Software List and Documentation
    19.2. List of Scientific Software by Discipline