User Guide - Huawei Technical Support

33
FusionCluster User Guide Issue 01 Date 2013-07-18 HUAWEI TECHNOLOGIES CO., LTD.

Transcript of User Guide - Huawei Technical Support

FusionCluster

User Guide

Issue 01

Date 2013-07-18

HUAWEI TECHNOLOGIES CO., LTD.

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

i

Copyright © Huawei Technologies Co., Ltd. 2013. All rights reserved.

No part of this document may be reproduced or transmitted in any form or by any means without prior

written consent of Huawei Technologies Co., Ltd.

Trademarks and Permissions

and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.

All other trademarks and trade names mentioned in this document are the property of their respective

holders.

Notice

The purchased products, services and features are stipulated by the contract made between Huawei and

the customer. All or part of the products, services and features described in this document may not be

within the purchase scope or the usage scope. Unless otherwise specified in the contract, all statements,

information, and recommendations in this document are provided "AS IS" without warranties, guarantees or

representations of any kind, either express or implied.

The information in this document is subject to change without notice. Every effort has been made in the

preparation of this document to ensure accuracy of the contents, but all statements, information, and

recommendations in this document do not constitute a warranty of any kind, express or implied.

Huawei Technologies Co., Ltd.

Address: Huawei Industrial Base

Bantian, Longgang

Shenzhen 518129

People's Republic of China

Website: http://enterprise.huawei.com

FusionCluster

User Guide About This Document

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

ii

About This Document

Change History

This issue is the first official release.

FusionCluster

User Guide Contents

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

iii

Contents

About This Document .................................................................................................................... ii

1 Login and Logout .......................................................................................................................... 1

1.1 Logging in to the FusionCluster ....................................................................................................................... 1

1.2 Logging out of the FusionCluster ..................................................................................................................... 2

2 Job System ...................................................................................................................................... 3

2.1 File Management .............................................................................................................................................. 3

2.1.1 Uploading a File ...................................................................................................................................... 3

2.1.2 Browsing Files ........................................................................................................................................ 4

2.1.3 Downloading a File ................................................................................................................................. 4

2.1.4 Deleting a File ......................................................................................................................................... 4

2.2 Job Submission................................................................................................................................................. 4

2.2.1 Submitting a Job Using a General Method ............................................................................................. 4

2.2.2 Submitting a Job Using a Template ......................................................................................................... 6

2.2.3 Submitting a Job by Running Commands ............................................................................................... 6

2.3 Job List ............................................................................................................................................................. 7

2.3.1 Viewing Job Status .................................................................................................................................. 7

2.3.2 Viewing Job Details ................................................................................................................................ 8

2.3.3 Deleting a Job ......................................................................................................................................... 8

2.3.4 Running a Job Again ............................................................................................................................... 9

2.3.5 Holding a Job .......................................................................................................................................... 9

2.3.6 Resuming a Job ....................................................................................................................................... 9

2.4 Viewing Compute Node Resources .................................................................................................................. 9

2.5 Queue Management........................................................................................................................................ 10

2.5.1 Queue List ............................................................................................................................................. 10

2.5.2 Viewing Queue Information .................................................................................................................. 10

2.6 Resource Reservation ..................................................................................................................................... 11

2.7 Viewing a License .......................................................................................................................................... 11

3 Application Software ................................................................................................................. 13

4 Workflow Management ............................................................................................................. 22

5 CLI .................................................................................................................................................. 24

FusionCluster

User Guide Contents

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

iv

6 Appendix ...................................................................................................................................... 28

6.1 Command Line Files and Log Description of Open Source Software ........................................................... 28

6.2 File Download Failures in IE8 ....................................................................................................................... 28

FusionCluster

User Guide 1 Login and Logout

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

1

1 Login and Logout

1.1 Logging in to the FusionCluster Open a browser such as Internet Explorer 8 or later, Mozilla Firefox 20 or later, or Chrome 20

or later, and enter http://master_ip_address:8081/static/index.html (master_ip_address

indicates the IP address of the management node) in the address box. On the login page, set

NetID and Password to log in to the FusionCluster. The default language is the first option

set by the browser, as shown in Figure 1-1.

Figure 1-1 Login page

After login, common users can view and operate the Job System, Cluster Configuration,

and Workflow Management system menus. With the system menus, you can switch between

functional modules. Click a language option in the upper right corner to switch the web user

interface (WebUI) language between Chinese and English, as shown Figure 1-2.

FusionCluster

User Guide 1 Login and Logout

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

2

Figure 1-2 Switching the WebUI language between Chinese and English

1.2 Logging out of the FusionCluster To log out of the FusionCluster, click Log Out in the upper right corner, as shown in Figure

1-3.

Figure 1-3 Logging out of the FusionCluster

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

3

2 Job System

2.1 File Management

2.1.1 Uploading a File

Before submitting a job, you can upload software or data on the local PC to a compute node.

To upload a file, perform the following operations:

Step 1 Choose Job System > File Management.

Step 2 Select a node whose data is to be uploaded from the node list.

Step 3 Click on the File Management page to upload the data.

Step 4 In the Upload File dialog box shown in Figure 2-1, click Browse, and select the file to be

uploaded.

Step 5 Enter the destination directory in Upload to directory, and click Upload, as shown in Figure

2-1.

The file is uploaded successfully.

Figure 2-1 Uploading a file

----End

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

4

2.1.2 Browsing Files

Choose Job System > File Management, and select a node to be browsed from the node list.

On the File Management page, you can view files or testing result files in your home

directory. See Figure 2-2.

Figure 2-2 Browsing Files

2.1.3 Downloading a File

Choose Job System > File Management, select a node from the node list, locate the file or

testing result file to be downloaded on the File Management page, and click . In the

dialog box that is displayed, select a directory for saving the file, and click Save.

2.1.4 Deleting a File

Choose Job System > File Management, select a node from the node list, locate the file to

be deleted on the File Management page, and click . In the displayed dialog box, click

Delete to delete the file.

2.2 Job Submission

2.2.1 Submitting a Job Using a General Method

Use the uploading function of file management to upload the script file to a specific directory

(home directory by default), or run commands to submit a job. See Figure 2-3.

To submit a job, perform the following operations:

Step 1 Choose Job System > Submit Job > Common Submit.

Step 2 Specify Job Name.

Step 3 In Job Type, select Script File Path, and click File path to specify a file path, or select

Command, and run commands for submitting a job.

Step 4 Enter job parameters in Program Parameters.

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

5

Step 5 In Work Directory, set a working directory.

This parameter is mandatory. Enter a working directory in the blank area, or click File path to

select a working directory.

Step 6 In Advanced Parameters, set advanced parameters.

These parameters are not set by default.

Advanced parameters are described as follows:

Nodes: Select Nodes, and set the number of nodes and slots in Nodes Count and Slot

Per Node, or enter a specific node in Nodes List.

Memory: Set memory size.

Priority: Set the job priority. The value must be an integer.

Environment Variables: Set environment variables.

Standard Output File Path: Set a standard output file path.

Standard Error File Path: Set a standard error file path.

Prologue Script File Path: Set a prologue script file path.

Epilogue Script File Path: Set an epilogue script file path.

Max Walltime: Set the maximum real-time period when a job is running.

Checkpointable: Set whether to enable the checkpoint function.

CheckpointInterval(minutes): Set the checkpoint time.

CheckpointDepth: Set the checkpoint depth.

CheckpointDir: Set the checkpoint directory.

Other Parameters: Set other parameters.

SharedFile System.CopyFilesList: Set the copied file list of the shared file system.

SharedFile System.DeleteFilesList: Set the deleted file list of the shared file system.

NeedNotification: Set NeedNotification.

Job Type: This parameter is left blank because this parameter is not set by default.

Step 7 Click Submit Job.

Figure 2-3 Common Submit

----End

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

6

2.2.2 Submitting a Job Using a Template

Use the uploading function of file management to upload the script file to a specific directory

(home directory by default). To submit a job, perform the following operations:

Step 1 Choose Job System > Submit Job > PBS Template.

Step 2 On the PBS Template page, select Remote Template FilePath, and click File path to

specify a file path.

You can also select a local script file or run commands. See Figure 2-4.

Step 3 Set task parameters, and click Submit Job.

Figure 2-4 PBS Template

----End

2.2.3 Submitting a Job by Running Commands

Choose Job System > Submit Job > Qsub Command. On the Qsub Command dialog box

that is displayed, run commands, or click Upload files to upload the script file, and click

Submit Job. See Figure 2-5.

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

7

Figure 2-5 Qsub Command

2.3 Job List

2.3.1 Viewing Job Status

Choose Job System > Job List on the menu bar to open the Job List page. See Figure 2-6.

The Job List page lists all jobs in the job system and allows you to sort jobs by JobId, User

Name, Name, or Job Status.

Figure 2-6 Job List

Select an option from the Job Status drop-down list, and click Search.

Jobs in the specified state are displayed in the job list.

Job status is described as follows:

COMPLETE: The job is completed.

RUNNING: The job is running.

DELETED: The job is deleted.

ERROR: An error occurs during job running.

QUEUE: The job is in a queue. If resources are insufficient, jobs are in queues to wait for resource allocation.

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

8

WAITING: The job is waiting to run.

BLOCKED: The job cannot be scheduled because resources do not meet requirements.

SUSPEND: The job is suspended.

EXIT: The job is exiting.

HOLD: The job is held.

MOVED: The job is moved.

The system keeps job records. In the job list, you can click Delete to delete a job, or Rerun to run a job

again.

2.3.2 Viewing Job Details

In the job list, locate the target job, and click in the Detail column to open the job details

dialog box. In this dialog box, you can view the values of JobId, JobName, UserName,

Status, Queue, Server, Resources.Used.Cput, ErrorPath, OutputPath, ExecNodeList,

SubmitTime, StartTime, EndTime, HoldTypes, SFS.CopyFilesList, SFS.DeleteFilesList,

Priority, SequenceNumber.InQueue, and Rerunable. See Figure 2-7.

Figure 2-7 Job details dialog box

If a job is in the COMPLETE state, you can click in the Output column to view the job

output result.

If a job is in the ERROR state, you can click in the Error column to view the cause of

the job running error.

2.3.3 Deleting a Job

To delete a job, perform the following operations:

Step 1 Select the job to be deleted from the job list.

Step 2 Click in the Delete column.

Jobs that are complete cannot be deleted.

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

9

----End

2.3.4 Running a Job Again

To run a job again, perform the following operations:

Step 1 Select the job to be run again from the job list.

Step 2 Click in the Rerun column.

----End

2.3.5 Holding a Job

To hold a job, perform the following operations:

Step 1 Select the job to be held from the job list.

Step 2 Click .

----End

2.3.6 Resuming a Job

To resume a job, perform the following operations:

Step 1 Select the job to be resumed from the job list.

Step 2 Click .

Only jobs that are in the HOLD and QUEUED states can be resumed. After jobs are resumed, the status of

these jobs turns to RUNNING.

----End

2.4 Viewing Compute Node Resources On the WebUI, choose Job System > Compute Nodes.

On the compute node page that is displayed, you can view compute node resources, including

Total Node, Total Core, Free Node, Busy Node, Down Node, Free Core, Busy Core, and

Down Core. See Figure 2-8.

Figure 2-8 Global Node Info

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

10

In the node list of the compute node page, you can view details about nodes that are added to

the scheduling system. See Figure 2-9.

Figure 2-9 Node List

2.5 Queue Management

2.5.1 Queue List

Choose Job System > Queue Management.

In the queue list, queue information, including Queue Name, Total Job, Queue Job,

Running Job, Nodes, Groups, Users, and Enable can be viewed. See Figure 2-10.

Figure 2-10 Queue List

You can submit a job only to an enabled queue. In the Enable column, enabled indicates that

a queue is enabled, and disabled indicates that a queue is disabled. If a queue is set to default

in the Default column, the queue is used as the default one. If a queue is set to default, a job

runs in this queue if no queue is specified for the job. If a queue is set to non-default, a job

does not run in this queue if no queue is specified for the job.

2.5.2 Viewing Queue Information

Choose Job System > Queue Management. In the queue list, select the queue to be viewed,

and click Detail.

You can view queue details, including QueueName, TotalJob, QueueJob, RunJob,

queue_type, and enabled. For example, view low queue information, as shown in Figure

2-11.

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

11

Figure 2-11 Queue details

2.6 Resource Reservation You can view resource reservation created by administrators on the reserved resource page.

Choose Job System > Schedule Configure.

On the Reservation Management page that is displayed, you can view resource information,

including ReservationID, Type, ACL, Nodes, Time_to_End, and Time_to_End from

Reservation List. See Figure 2-12.

Figure 2-12 Reservation List

2.7 Viewing a License On the License Management page, you can view license usage, including the available

license number and total license number.

Choose Cluster Configuration > License Management.

The License Usage Info page is displayed. See Figure 2-13.

FusionCluster

User Guide 2 Job System

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

12

Figure 2-13 License Usage Info

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

13

3 Application Software

3.1 HFSS

The high frequency structure simulator (HFSS) is 3D electromagnetic simulation software

developed by Ansoft. HFSS is the first commercial 3D electromagnetic simulation software in

the world and an industry recognized 3D electromagnetic design and analysis standard.

HFSS provides powerful antenna design functions. It can be used to calculate antenna

parameters, such as the antenna gain, antenna directivity, far-field pattern, far-field 3D pattern,

and 3 dB bandwidth, and draw polarization characteristics, including the spherical field

component, circularly polarized field component, Ludwig third definition field component,

and axial ratio. The following can be calculated by using HFSS:

Basic electromagnetic field numerical solution and open boundary, and near-field and

far-field radiation

Characteristic impedance and transmission constants of ports

S parameters and normalized S parameters of port impedance

Structural eigen mode or resonate

Ansoft HFSS and Ansoft Designer constitute the Ansoft high frequency solution, which is the

unique high frequency design solution that is based on physical prototype. This solution

provides precise methods for quickly designing systems, circuits, and components, covering

all phases in high frequency design.

On FusionCluster, choose Job System > Application > HFSS. The HFSS page is displayed.

See Figure 3-1.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

14

Figure 3-1 HFSS page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

Project Name: Set a project name.

Analysis Name: Set an analysis item name.

Optimize Name: Set an optimized item name.

Queue: Select a queue.

Parallel Task Number: Set the number of parallel tasks. The value can be set to 1, 3, 5,

or 10.

Core Number: Set the number of cores. The value can be set to 12 or 24.

MEM Per Task (G): Set the memory capacity. The unit is GB.

HFSS Design File: Enter the HFSS job design file. Click Select file to import an HFSS

design file.

Description: Provide supplementary information about a job.

On the HFSS page, set the preceding parameters, click Submit Job, and then click Job List.

You can view the job status from the job list.

3.2 HSPICE

Simulation program with integrated circuit emphasis (HSPICE) is a commercial general

circuit simulation program developed by Meta-Software. HSPICE is used for circuit

performance simulation analysis during integrated circuit design, including steady state

analysis, transient analysis, and frequency domain analysis.

HSPICE is compatible with many electronic design automation (EDA) design tools, such as

Cadence and Workview, and can provide circuit simulation and design results for integrated

circuits. HSPICE allows users to precisely simulate, analyze, and optimize circuits for direct

current (DC) and microwave frequency higher than 100 MHz. In applications, HSPICE can

provide key circuit simulation and design schemes. When HSPICE is used for circuit

simulation, the circuit scale depends on the actual storage capacity of users' computers.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

15

On FusionCluster, choose Job System > Application > Hspice. The Hspice page is displayed.

See Figure 3-2.

Figure 3-2 Hspice page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

Project Name: Set a project name.

Queue: Select a queue.

Parallel Task Number: Set the number of parallel tasks. The value can be set to 1, 3, 5,

or 10.

Core Number: Set the number of cores. The value can be set to 12 or 24.

Hspice Design File: Enter the HSPICE job design file. Click Select file to import an

HSPICE design file.

Description: Provide supplementary information about a job.

On the Hspice page, set the preceding parameters, click Submit Job, and then click Job List.

You can view the job status from the job list.

3.3 Maxwell

Maxwell is finite element analysis software developed based on Maxwell equations.

Maxwell 2D can be used to analyze and optimize the alternating current (AC)/DC magnetic

field, electrostatic field, transient electromagnetic field, and temperature field. Maxwell 2D

also provides high precision equivalent circuit models for the Ansoft SIMPLORER module

and other circuit analysis tools. Maxwell 3D can be used to analyze systems in which eddy

current, displacement current, skin effect, and proximity effect cannot be ignored and obtain

the overall characteristics of electromagnetic components, such as electromotor, buses,

transformers, and coils. Maxwell 3D can automatically calculate parameters, such as the

power loss, coil loss, impedance (R and L) at a frequency, force, torque, inductance, and

energy storage. Maxwell 3D can also display the phase magnetic line, B and H distribution,

energy density, and temperature distribution in graphs.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

16

On FusionCluster, choose Job System > Application > Maxwell. The Maxwell page is

displayed. See Figure 3-3.

Figure 3-3 Maxwell page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

Project Name: Set a project name.

Analysis Name: Set an analysis item name.

Optimize Name: Set an optimized item name.

Queue: Select a queue.

Parallel Task Number: Set the number of parallel tasks. The value can be set to 1, 3, 5,

or 10.

Core Number: Set the number of cores. The value can be set to 12 or 24.

MEM Per Task (G): Set the memory capacity. The unit is GB.

Maxwell Design File: Enter the Maxwell job design file. Click Select file to import a

Maxwell design file.

Description: Provide supplementary information about a job.

On the Maxwell page, set the preceding parameters, click Submit Job, and then click Job

List. You can view the job status from the job list.

3.4 PowerSI

PowerSI, power simulation software, provides quick and precise full-wave electromagnetic

analysis and easy simulation of IC packages and printed circuit boards (PCBs). PowerSI can

use frequency-related network parameter models to simplify design. PowerSI is used in PCB,

IC package, system-level package (SIP: single-inline package) design to provide better

designs.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

17

On FusionCluster, choose Job System > Application > PowerSI. The PowerSI page is

displayed. See Figure 3-4.

Figure 3-4 PowerSI page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

Project Name: Set a project name.

Queue: Select a queue.

Parallel Task Number: Set the number of parallel tasks. The value can be set to 1, 3, 5,

or 10.

Core Number: Set the number of cores. The value can be set to 12 or 24.

PowerSI Design File: Enter the PowerSI job design file. Click Select file to import a

PowerSI design file.

Description: Provide supplementary information about a job.

On the PowerSI page, set the preceding parameters, click Submit Job, and then click Job

List. You can view the job status from the job list.

3.5 VASP

The Vienna Ab-initio simulation package (VASP), a software package developed by the

Hafner team of University of Vienna, is used for electronic structure calculation and quantum

mechanics-molecule dynamics simulation. VASP is the most popular commercial software

used in material simulation and material science study.

On FusionCluster, choose Job System > Application > VASP. The VASP page is displayed.

See Figure 3-5.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

18

Figure 3-5 VASP page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

VASP version: Select a VASP version. Only VASP5.2 is available.

Network type: Select a network type. Only ethernet is available.

Job Name: Enter a job name.

Queue: Select a queue. Select a queue from the drop-down list.

Work Directory: Set a working directory. Enter a working directory in the blank area, or

click File path to select a working directory.

Number Of Nodes: Set the number of nodes.

Cores Per Node: Set the number of cores on each node.

Memory Maxsize(mb): Set the memory capacity.

WallTime Maxsize(hh:mm:ss): Set the maximum real time when a job is running.

On the VASP page, set the preceding parameters, click Submit Job, and then click Job List.

You can view the job status from the job list.

NOTE

Upload the VASP input files (including INCAR, KPOINTS, POSCAR, and POTCAR) to the working

directory before submitting a VASP job.

3.6 Gaussian

Gaussian is a powerful comprehensive quantum chemistry software package. The executable

programs of Gaussian can run on mainframe computers, super computers, workstations, and

PCs of various models and have different versions. Gaussian can be used to calculate the

following:

Molecule energy and structure

Transition state energy and structure

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

19

Chemical bond and chemical reaction energy

Molecular orbital

Atomic electric charge and electric potential

Vibration frequency

Infrared and Raman spectrum

Nuclear magnetic properties

Polarizability and hyper polarizability

Thermodynamic properties

Reaction path

Calculation can be performed on the ground state and excited state of systems. Gaussian can

predict the energy, structure, and molecular orbital of periodic systems. Gaussian is a

powerful tool used in chemical studies, such as studies on substituent impact, chemical

reaction mechanism, potential energy curved surface, and excitation energy.

On FusionCluster, choose Job System > Application > Gaussian. The Gaussian page is

displayed. See Figure 3-6.

Figure 3-6 Gaussian page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

Gaussian version: Select a Gaussian version. G09 is available.

Job Name: Enter a job name.

Queue: Select a queue.

Work Directory: Set a working directory. Enter a working directory in the blank area, or

click File path to select a working directory.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

20

Gaussian input File: Enter a Gaussian input file. Enter the file path in the blank area, or

click File path to select a file path. Upload the Gaussian input file to the specified path

in advance.

Scratch Directory(Default Working Directory): Set the default working directory.

Number Of Nodes: Set the number of nodes. The value can be set to 1 only, because

Gaussian can be executed only in a single node.

Cores Per Node: Set the number of cores on each node.

Memory Maxsize(mb): Set the memory capacity.

WallTime Maxsize(hh:mm:ss): Set the maximum real time when a job is running.

On the Gaussian page, set the preceding parameters, click Submit Job, and then click Job

List. You can view the job status from the job list.

3.7 MPI

The Message Passing Interface (MPI) is used for parallel computing.

On FusionCluster, choose Job System > Application > MPI. The MPI page is displayed.

See Figure 3-7.

Figure 3-7 MPI page

Parameters with an asterisk (*) are mandatory. Parameters are described as follows:

Job Name: Enter a job name.

Queue: Select a queue.

Number Of Nodes: Set the number of nodes.

Cores Per Node: Set the number of cores on each node.

FusionCluster

User Guide 3 Application Software

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

21

Memory Maxsize(mb): Set the memory capacity.

Max Walltime: Set the maximum real time when a job is running.

mpirun path: Enter the MPI run path. Enter a path in the blank area, or click File path

to select a path.

Executable File Path: Enter the executable file path. Enter a path in the blank area, or

click File path to select a path.

Program Parameters: Set program parameters.

Work Directory: Set a working directory. Enter a working directory in the blank area, or

click File path to select a working directory.

StdoutPath: Set a standard output file path.

StdErrPath: Set a standard error output file path.

On the MPI page, set the preceding parameters, click Submit Job, and then click Job List.

You can view the job status from the job list.

FusionCluster

User Guide 4 Workflow Management

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

22

4 Workflow Management

4.1 Process Definition Management

Choose Workflow Management > Work Flow on the menu bar and choose Process

Definition in the navigation tree to open the Process Definition Management page. See

Figure 4-1.

On the Process Definition Management page, you can view workflow information,

including ID, Name, ResourceName, DefinitionProId, and Detail about a workflow. Click

in the Detail column to view details about a workflow. To enable a workflow, click in

the Start column.

Figure 4-1 Process Definition Management page

4.2 Process Instance

Choose Workflow Management > Work Flow on the menu bar and choose Process

Instance in the navigation tree to open the Process Instance Management page. See Figure

4-2.

On the Process Instance page, click in the Detail column to view details about an

enabled workflow instance. To stop a workflow instance, click in the Terminate column.

FusionCluster

User Guide 4 Workflow Management

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

23

Figure 4-2 Process Instance page

FusionCluster

User Guide 5 CLI

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

24

5 CLI

5.1 Log in to the CLI

Step 1 Open a piece of login software, for example, Xshell or PuTTY.

Step 2 Enter the IP address of the management node and select Secure Shell (SSH), which uses port

22.

Step 3 Enter a user name and password to log in.

Figure 5-1 shows the Xshell CLI after login.

Figure 5-1 Xshell CLI

FusionCluster

User Guide 5 CLI

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

25

5.2 Submitting Jobs To submit a job, run the following command:

qusb <Job script>

If you want to define a job name when submitting the job, run the following command:

qsub –N <Job name> <Job script>

If you want to specify computing resources (for example, a node with four cores) when

submitting a job, run the following command:

qsub –N <Job name> -l nodes=1:ppn=4

If you want to specify the standard and error output file paths when submitting a job, run

the following command:

qusb –N <Job name> -o <Standard output file path> -e <Error output file path> <Job

script>

To submit a job script containing parameters, run the following command:

qsub <Option> <<END

<Job script><Parameter 1>…

END

5.3 Viewing Jobs

5.3.1 Viewing the Status of a Compute Node

To view the status of a compute node where a job is running, run the following command:

pbsnodes –a

The following is an example:

[root@master12 ~]# pbsnodes -a

node17

state = free

np = 16

properties = application,gaussian

ntype = cluster

status = rectime=1371452746,varattr=Mem= 24676248 24565628 110620 0 230840

23436960,jobs=,state=free,netload=57418385065,gres=,loadave=0.01,ncpus=16,physmem=2

4676248kb,availmem=24831576kb,totmem=25728496kb,idletime=796252,nusers=1,nsession

s=1,sessions=6096,uname=Linux node17 2.6.18-308.el5 #1 SMP Fri Jan 27 17:17:51 EST

2012 x86_64,opsys=linux

gpus = 8

The node status is described as follows:

Free: The node has free space to run more jobs.

Job-exclusive: The node has no free space to run more jobs.

FusionCluster

User Guide 5 CLI

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

26

Down: The node is inaccessible.

Offline: The node rejects submitted jobs.

5.3.2 Viewing the Running Status of a Job

To view the running status of all jobs, run the following command:

qstat

The job status in the command output is described as follows:

− R: The job is running.

− C: The job is complete.

− Q: The job is in a queue.

− E: The job is exiting.

− H: The job is held.

− W: The job is waiting to run.

To view the distribution of jobs on nodes, run the following command:

qstat –n

To view the running status of a single job, run the following command:

qstat –f <Job ID>

5.4 Querying Jobs

To query jobs by user name, queue name, job name, or job status, run the following

command:

qselect <Parameter> <User name/Queue name/Job name/Job status>

The values of Parameter are described as follows:

− -u: indicates a user name.

− -q: indicates a queue name.

− -n: indicates a job name.

− -s: indicates job status.

To query jobs based on the user name test, run the following command:

qselect –u test

5.5 Deleting a Job

To delete a job, run the following command:

qdel <Job ID>

5.6 Holding a Job

To hold a job, run the following command:

qhold <Job ID>

FusionCluster

User Guide 5 CLI

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

27

5.7 Releasing a Job

To release a job that has been held so that the job can queue again, run the following

command:

qris <Job ID>

5.8 Running a Job Again

If a job needs to be run multiple times and you have set Rerunable to True for the job when

submitting the job, run the following command to run the job again:

qrerun <Job ID>

5.9 Moving a Job

To move a job to implement load balancing between queues, run the following command:

qmove <Queue name> <Job ID>

Figure 5-2 License Usage Info

FusionCluster

User Guide 6 Appendix

Issue 01 (2013-07-18) Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd.

28

6 Appendix

6.1 Command Line Files and Log Description of Open Source Software

1. For description and log for the torque command, see the following document.

http://docs.adaptivecomputing.com/torque/torqueAdminGuide-4.2.6.pdf

2. For description and log for the maui command, see the following document.

http://docs.adaptivecomputing.com/maui/index.php

6.2 File Download Failures in IE8

Due to bugs of the IE8, the file download function is unavailable when a user accesses an SSL

page. This problem can be solved by installing a patch on the PC that runs the IE8.

The following lists a non-official website for downloading the patch.

− To download the patch, visit the following website:

http://www.cnblogs.com/ggjucheng/archive/2011/12/31/2308774.html

− You can copy the following website to your download tool to download the patch

directly:

http://files.cnblogs.com/ggjucheng/ieupdate.rar

After the patch is downloaded, run the file to install the patch.

For details about the official bug description provided by Microsoft, visit the following

website:

http://support.microsoft.com/kb/323308