The goal is to identify what information about FutureGrid hardware should be made available. Assuming that most of the following is important to you and in the interest of minimizing the number of things you need to check, please place a X to indicate which informaiton about FutureGrid hardware will NOT be important to you.
Deciding how to collect and make the information available will be a separate discussion.
The following list was taken and slightly modified from the Common User Environment Documentation effort in the TeraGrid. http://www.teragridforum.org/mediawiki/index.php?title=CUE_CUED_Document_List
| Check if NOT important | |||||
| Subdocument / Information | |||||
| 1 | Basic Information | ||||
| 1.1. | Name of Resource | ||||
| 1.2. | Name of Resource Provider | ||||
| 1.3. | Login Node | ||||
| 1.4. | Recommended and/or Discouraged Uses | ||||
| 2 | Hardware Configuration | ||||
| 2.1. | Type of Machine (MPP, SMP, etc.) | ||||
| 2.2. | Detailed Description of Hardware: Blades, nodes, processors, cores, cores per processor, and a diagram of the basic unit (blade, node, etc.) if possible. | ||||
| 2.3. | CPU Type | ||||
| 2.4. | CPU Speed | ||||
| 2.5. | Number of Nodes | ||||
| 2.6. | CPUs per Node | ||||
| 2.7. | Memory per CPU | ||||
| 3 | Operating System | ||||
| 3.1. | Description of System Software including Version and any Special Dialect | ||||
| 4 | Interconnect | ||||
| 4.1. | Description of all Interconnect Configurations | ||||
| 4.2. | Typical Network Bandwidth and Latency for all Interconnects | ||||
| 5 | File Systems (for each available, specify all below) | ||||
| 5.1. | Filesystem Name | ||||
| 5.2. | Environment Variable for Path (if available) | ||||
| 5.3. | Availability Scope: Node, Cluster, Site, FutureGrid, etc. | ||||
| 5.4. | Recommended and/or Discouraged Uses | ||||
| 5.5. | Size and Quota (if applicable) | ||||
| 5.6. | Backup and Purge Policies | ||||
| 5.7. | Special Considerations (striping, etc.) | ||||
| 6 | Archive System (if any) | ||||
| 6.1. | Description of the Archive System | ||||
| 6.2. | Archive Policies | ||||
| 6.3. | Basic Archive Commands, including Access from Remote Systems | ||||
| 7 | Environment | ||||
| 7.1. | Environment Management System ("modules") and Basic Commands (include pointer to Futuregrid modules documentation) | ||||
| 7.2. | List of Modules Added at Login (should conform to CUE standard) | ||||
| 7.3. | List of Available Modules | ||||
| 7.4. | Supported Shells | ||||
| 7.5. | How to Change Shell | ||||
| 8 | Job Scheduler | ||||
| 8.1. | Job Scheduler Used, including any Modifications or Extensions | ||||
| 8.2. | Job Scheduling Policy | ||||
| 9 | Queues | ||||
| 9.1. | How to List Available Queues | ||||
| 9.2. | How to Obtain Detailed Information about Queues | ||||
| 9.2.1. | Recommended Use | ||||
| 9.2.2. | Maximum and Minimum Number of Nodes | ||||
| 9.2.3. | Maximum and Minimum Number of Cores | ||||
| 9.2.4. | Maximum and Minimum Wall Time | ||||
| 9.2.5. | Priority Boost/Loss Available | ||||
| 9.2.6. | Node Features (High-Mem, Accelerated, Interconnects, etc.) | ||||
| 9.2.7. | FutureGridGrid SU Premium/Reduction | ||||
| 10 | Job Management | ||||
| 10.1. | How to Submit a Job | ||||
| 10.2. | Available Job Submission Options | ||||
| 10.3. | Basic Example Job Script (/bin/hostname, etc.) | ||||
| 10.4. | Link to CUED "Sample Jobs" for More Examples | ||||
| 10.5. | How to Monitor a Job's Status | ||||
| 10.6. | How to Delete a Job | ||||
| 11 | File Transfer | ||||
| 11.1. | Recommended Methods for Large File Transfer (include examples) | ||||
| 11.2. | Recommended Entry/Endpoints and Environment Variables for Entry/Endpoints | ||||
| 11.3. | Recommended Front-End Hosts for File Transfer (if any) | ||||
| 11.4. | Recommended Options (striping, etc.) | ||||
| 12 | FutureGrid Accounting | ||||
| 12.1. | How to Monitor Usage | ||||
| 12.2. | How SUs are Computed | ||||
| 12.2.1. | Premium Charges for High Priority Queues, Large Memory Queues, etc. | ||||
| 12.2.2. | Reduced Charges for “Slow” or “Backfill” Queues | ||||
| 12.2.3. | Premium Charges for Reservations and Reservation Cancellation Policy | ||||
| 12.3. | Maximum Startup Allocation (in SUs) | ||||
| 13 | Supported Compilers (all of the following for each) | ||||
| 13.1. | Recommended Options | ||||
| 13.2. | Available and Recommended MPI Libraries | ||||
| 13.3. | Available and Recommended Math Libraries | ||||
| 13.4. | Additional or Official Compiler Documentation Links | ||||
| 14 | Sample Jobs (simple working examples) | ||||
| 14.1. | Simple Batch Job (/bin/hostname, etc.) | ||||
| 14.2. | MPI Job (if applicable, both C/C++ and Fortran) | ||||
| 14.3. | OpenMP Job (if applicable, both C/C++ and Fortran) | ||||
| 14.4. | Hybrid Job (if applicable, both C/C++ and Fortran) | ||||
| 14.5. | Specialized Use Job (if applicable, both C/C++ and Fortran) | ||||
| 14.6. | Profiling Job (if applicable, both C/C++ and Fortran) | ||||
| 14.7. | Debugging Job (if applicable, both C/C++ and Fortran) | ||||
| 15 | Debugging and Optimization Tools | ||||
| 15.1. | Compiler Options for Debugging | ||||
| 15.2. | How to Start a Job in the Debugger | ||||
| 15.3. | Attaching the Debugger to an Interactive or Batch Process | ||||
| 15.4. | Additional or Official Tool Documentation Links | ||||
| 16 | Profiling Tools | ||||
| 16.1. | Compiler Options for Profiling | ||||
| 16.2. | Post-Mortem Profiling Tool Documentation | ||||
| 16.3. | Additional or Official Tool Documentation Links | ||||
| 17 | Environment Variables | ||||
| 17.1. | Complete List of Environment Variables Defined | ||||
| 18 | Reservations | ||||
| 18.1. | Procedure for Obtaining a Reservation for Single System Job | ||||
| 18.2. | Procedure for Obtaining a Reservation for Cross-Site or Multi-Site Job | ||||
| 19 | Application Software | ||||
| 19.1. | Full Available Software List and Documentation | ||||
| 19.2. | List of Scientific Software by Discipline | ||||
