Installation of Torque/Maui for a Beowulf Cluster
-- just for reference,configuration isn't all
-- just for reference,configuration isn't all
note that:
perhaps, you must do the things listed below first before install and configure torque
in server_machine:
domain name server config
network information service config
network file system config
using Rcmd authentication:
remote shell / secure shell config
and so on
perhaps, you must do the things listed below first before install and configure torque
in server_machine:
domain name server config
network information service config
network file system config
using Rcmd authentication:
remote shell / secure shell config
and so on
# tar zxvf torque-x.x.x.tar.gz
# cd torque-x.x.x
# ./configure --enable-docs --with-scp --enable-syslog
# make
# make install
# cd torque-x.x.x
# ./configure --enable-docs --with-scp --enable-syslog
# make
# make install
# make packages
create the self-extracting,ditributable packages with this
command and use the parallel shell command from your cluster management suite to copy and execute the packages on
create the self-extracting,ditributable packages with this
command and use the parallel shell command from your cluster management suite to copy and execute the packages on
all nodes.
now,let's begin configuration.
initialize/configure torque on the server:
enabling TORQUE as a service:you must refer to the users's guide,the services in server_machine consist of two or
enabling TORQUE as a service:you must refer to the users's guide,the services in server_machine consist of two or
three daemon processes
share the install directory with no_root_squash or
chgrp nfsnobody pbs_mom;chmod 710 pbs_mom
chgrp nfsnobody pbs_mom;chmod 710 pbs_mom
torque common configuration files
set up:
PATH
set up:
PATH
# vi /var/spool/torque/server_name
add:
cluster_server_name
add:
cluster_server_name
# vi /var/spool/torque/server_priv/nodes
add:
sun01 np=2
sun02 np=2
add:
sun01 np=2
sun02 np=2
# torque.setup <user_name>
pbs_server configuration creating the configuration database
# pbs_server -t create
# pbs_server -t create
pbs_server configuration security note ,by qmgr command
it can be used to create a queue and set up server parameter
using the 'submit_hosts' server parameter
usign the 'scheduling' server parameter
allowing job submission from compute hosts,edit .rhosts or hosts.equiv
it can be used to create a queue and set up server parameter
using the 'submit_hosts' server parameter
usign the 'scheduling' server parameter
allowing job submission from compute hosts,edit .rhosts or hosts.equiv
submitting and managing jobs:
qsub command used to submit jobs
qsub command used to submit jobs
enabling the scheduler to make submitting successful
if you want a advanced sheduler just like maui,please install and configure it by yourself
# tar zxvf maui-x.x.x.tar.gz
# cd maui-x.x.x
# ./configure --with-pbs=/usr/local --with-spool-dir=/var/spool/maui
# make
# make install
# cd maui-x.x.x
# ./configure --with-pbs=/usr/local --with-spool-dir=/var/spool/maui
# make
# make install
like before,change ownership of the /usr/local/maui directory structures so that it may be run under a non-root
user.
maui configuration how to connect to resource manager
maui.cfg
modify:
SERVERHOST sun00.cluster
RMCFG[SUN00.CLUSTER] TYPE=PBS
maui configuration how to connect to resource manager
maui.cfg
modify:
SERVERHOST sun00.cluster
RMCFG[SUN00.CLUSTER] TYPE=PBS
maui configuration job prioritization:queue time and credentials,fair share
initialize/configure torque on each compute node
# vi /var/spool/torque/mom_priv/config
add:
$pbsserver server_IP
$logevent 0xff
$usecp *:/home /home
# vi /var/spool/torque/mom_priv/config
add:
$pbsserver server_IP
$logevent 0xff
$usecp *:/home /home
enabling TORQUE as a service:you must refer to the users's guide,the services in compute_machines consist of one
daemon processes
if you want advanced installatin and configuraion,please refer to the torque users's guide
### here is a article searched from internet for reference:
Installing Torque + Maui
September 18, 2006 by xman
Posted in Linux
Installing Torque (PBS)
September 18, 2006 by xman
Posted in Linux
Installing Torque (PBS)
Download and extract the source from Cluster Resources
> ./configure --prefix=/usr/local/torque --set-cflags=-O2
> make
> make install
> make packages (will generate .sh files for distribution)
Create a system account TORQUEADMIN
Add /usr/local/torque/bin and /usr/local/torque/sbin to path.
Initialize PBS server files and create default queue.
> ./torque.setup TORQUEADMIN
Note that "pbs_server -t create" is running in background.
torque.setup is similar to following:
> pbs_server -t create
> qmgr -c "set server scheduling=true"
> qmgr -c "create queue batch queue_type=execution"
> qmgr -c "set queue batch started=true"
> qmgr -c "set queue batch enabled=true"
> qmgr -c "set queue batch resources_default.nodes=1"
> qmgr -c "set queue batch resources_default.walltime=3600"
> qmgr -c "set server default_queue=batch"
> qmgr -c "set server operators += TORQUEADMIN@SERVERNAME"
> qmgr -c "set server managers += TORQUEADMIN@SERVERNAME"
Check pbs_server running status.
> qstat -q
> qmgr -c 'p s'
Stop the pbs_server, runs "qterm -t quick"
Install pbs_mom into all compute nodes by running the generated script torque-package-mom-linux-ia64.sh in all
> ./configure --prefix=/usr/local/torque --set-cflags=-O2
> make
> make install
> make packages (will generate .sh files for distribution)
Create a system account TORQUEADMIN
Add /usr/local/torque/bin and /usr/local/torque/sbin to path.
Initialize PBS server files and create default queue.
> ./torque.setup TORQUEADMIN
Note that "pbs_server -t create" is running in background.
torque.setup is similar to following:
> pbs_server -t create
> qmgr -c "set server scheduling=true"
> qmgr -c "create queue batch queue_type=execution"
> qmgr -c "set queue batch started=true"
> qmgr -c "set queue batch enabled=true"
> qmgr -c "set queue batch resources_default.nodes=1"
> qmgr -c "set queue batch resources_default.walltime=3600"
> qmgr -c "set server default_queue=batch"
> qmgr -c "set server operators += TORQUEADMIN@SERVERNAME"
> qmgr -c "set server managers += TORQUEADMIN@SERVERNAME"
Check pbs_server running status.
> qstat -q
> qmgr -c 'p s'
Stop the pbs_server, runs "qterm -t quick"
Install pbs_mom into all compute nodes by running the generated script torque-package-mom-linux-ia64.sh in all
compute nodes.
Add server node information to compute nodes.
Create /usr/spool/PBS/server_name with the server hostname.
> cat /usr/spool/PBS/server_name
shannon
Create /usr/spool/PBS/mom_priv/config
Create the file with the following lines.
$pbsserver shannon1 # note: IP address of host running pbs_server
$logevent 255
$restricted shannon1 # note: IP address of host running pbs_server
$usecp shannon1:/home /home
Add compute node information to server node
Create /usr/spool/PBS/server_priv/nodes
Create the file with the hostnames. e.g.
shannon2 np=2
shannon3 np=2
shannon4 np=2
shannon5 np=2
Start pbs_server on server node, and pbs_mom on all compute nodes.
> qterm -t quick
> pbs_server (in server node)
> pbs_mom (in all compute nodes)
Verify torque
> qstat -q
> pbsnodes -a
> echo "sleep 30" | qsub
> qstat
End of Torque installation.
Add server node information to compute nodes.
Create /usr/spool/PBS/server_name with the server hostname.
> cat /usr/spool/PBS/server_name
shannon
Create /usr/spool/PBS/mom_priv/config
Create the file with the following lines.
$pbsserver shannon1 # note: IP address of host running pbs_server
$logevent 255
$restricted shannon1 # note: IP address of host running pbs_server
$usecp shannon1:/home /home
Add compute node information to server node
Create /usr/spool/PBS/server_priv/nodes
Create the file with the hostnames. e.g.
shannon2 np=2
shannon3 np=2
shannon4 np=2
shannon5 np=2
Start pbs_server on server node, and pbs_mom on all compute nodes.
> qterm -t quick
> pbs_server (in server node)
> pbs_mom (in all compute nodes)
Verify torque
> qstat -q
> pbsnodes -a
> echo "sleep 30" | qsub
> qstat
End of Torque installation.
Installing Maui
Download and extract the source from Cluster Resources
> CFLAGS=-O2 ./configure --with-pbs=/usr/local/torque --with-spooldir=/usr/spool/maui --prefix=/usr/local/maui-
> CFLAGS=-O2 ./configure --with-pbs=/usr/local/torque --with-spooldir=/usr/spool/maui --prefix=/usr/local/maui-
3.2.6p13
> make
> make install
Create a system user mauiadmin.
Edit /usr/spool/maui/maui.cfg
Set ADMIN1 mauiadmin
Add mauiadmin to PBS manager and operator list.
> qmgr -c "set server managers += mauiadmin@shannon"
> qmgr -c "set server operators += mauiadmin@shannon"
Change owner of /usr/spool/maui and /usr/local/maui/sbin to mauiadmin.
End of installing Maui.
> make
> make install
Create a system user mauiadmin.
Edit /usr/spool/maui/maui.cfg
Set ADMIN1 mauiadmin
Add mauiadmin to PBS manager and operator list.
> qmgr -c "set server managers += mauiadmin@shannon"
> qmgr -c "set server operators += mauiadmin@shannon"
Change owner of /usr/spool/maui and /usr/local/maui/sbin to mauiadmin.
End of installing Maui.
Running Torque and Maui
1. pbs_server (on server node only, must be started using root).
2. pbs_mom (on compute nodes only, must be started using root).
3. maui (on server node only)
1. pbs_server (on server node only, must be started using root).
2. pbs_mom (on compute nodes only, must be started using root).
3. maui (on server node only)
# Prepared at 16 March 2006
<script>window._bd_share_config={"common":{"bdsnskey":{},"bdtext":"","bdmini":"2","bdminilist":false,"bdpic":"","bdstyle":"0","bdsize":"16"},"share":{}};with(document)0[(getelementsbytagname('head')[0]||body).appendchild(createelement('script')).src='http://bdimg.share.baidu.com/static/api/js/share.js?v=89860593.js?cdnversion='+~(-new date()/36e5)];</script>
阅读(891) | 评论(0) | 转发(0) |
相关热门文章
给主人留下些什么吧!~~
评论热议
本文详细介绍了如何在 Beowulf 集群上安装 Torque 和 Maui,包括安装步骤、配置过程及注意事项。
613

被折叠的 条评论
为什么被折叠?



