
In my previous post I described how to build and install
GridWay metascheduler on top of
Unicluster Express 3.2 (UCE). However, before one can actually use the software, there are several configuration steps that have to be completed.
For the notes below I will assume that GridWay is installed in the /opt/gw directory, and that UCE is installed in its default location (/usr/local/unicluster), with the default “ucluster” administrative account. Note that you will need root access on your GridWay machine.
1) Create new UNIX group (e.g., gwusers). Your UCE administrative account (ucluster) and all users that will be submitting jobs to GridWay must be members of that group.
2) Make sure that your GridWay installation directory is owned by the UCE administrative account. Assuming GridWay is installed in /opt/gw, invoking something like “chown –R ucluster.gwusers /opt/gw” would do the trick.
3) Edit the /etc/sudoers file, and add the following entries:
...
# User alias specification
...
Runas_Alias GW_USERS = %gwusers
...
# Defaults specification
Defaults>GW_USERS env_keep="GW_LOCATION GLOBUS_LOCATION"
...
# GridWay entries.
ucluster ALL=(GW_USERS) NOPASSWD: /opt/gw/bin/gw_em_mad_ws *
ucluster ALL=(GW_USERS) NOPASSWD: /opt/gw/bin/gw_tm_mad_ftp *
4) Configure GridWay. At minimum you must edit the GridWay daemon configuration file /opt/gw/etc/gridway/gwd.conf in order to add the following entries appropriate for UCE:
IM_MAD = mds4:gw_im_mad_mds4:-s petruchio.univaud.com:gridftp:ws
EM_MAD = ws:gw_em_mad_ws::rsl2
TM_MAD = gridftp:gw_tm_mad_ftp:
The only entry that you will need to change in the above example is the host that is running your UCE container, which was set to petruchio.univaud.com in my case. Other files that you might want to inspect are /opt/gw/etc/gridway/sched.conf (contains GridWay scheduler configuration), /opt/gw/etc/gridway/job_template.default (default values for job templates), and /opt/gw/etc/gridway/gwrc (default environment variables for GridWay's so-called middleware access drivers, or MADs).
5) Edit UCE configuration file
/usr/local/unicluster/etc/globus_wsrf_mds_usefulrp/gluerp.xml and enable ganglia information provider by un-commenting the following line:
<defaultProvider>java org.globus.mds.usefulrp.glue.GangliaElementProducer</defaultProvider>
6) Edit UCE configuration file for the SGE GRAM service to use ganglia information provider (/usr/local/unicluster/etc/gram-service-SGE/gluerp-config.xml) and add the following xml excerpt after the "<ns1:resourcePropertyImpl>
org.globus.mds.usefulrp.rpprovider.GLUEResourceProperty</ns1:resourcePropertyImpl>" element:
<ns1:resourcePropertyElementProducers>
<ns1:className>org.globus.mds.usefulrp.glue.GangliaElementProducer</ns1:className>
<ns1:arguments>localhost</ns1:arguments>
<ns1:arguments>8649</ns1:arguments>
<ns1:period>300</ns1:period>
<ns1:transformClass>org.globus.mds.usefulrp.rpprovider.transforms.GLUEComputeElementTransform<ns1:transformClass>
</ns1:resourcePropertyElementProducers>
7) Restart the UCE container (as root, run “/etc/rc.d/init.d/unicluster-container restart”).
8) Start the GridWay daemon (gwd) under the ucluster account:
source /usr/local/unicluster/unicluster-user-env.sh
export JAVA_HOME=/opt/jdk
export GW_LOCATION=/opt/gw
export PATH=$JAVA_HOME/bin:$GW_LOCATION/bin:$PATH
gwd
The GridWay daemon should now be able to get information from your UCE container and you should be able to see your available SGE resources using the gwhost command. Note that the gwd command requires the “-m” flag for the multi-user mode.
9) Create simple job template file for your testing (sample job template files can be found in the /opt/gw/test/jt directory), acquire grid proxy using unicluster-grid-logon, and submit your test job via the gwsubmit command.
Note that most of the configuration steps I outlined above are described in more detail in the GridWay
System Administrator’s Guide. If anything goes wrong, GridWay log files located in the /opt/gw/var directory might help your troubleshooting efforts.
Smashing post. Keep up the fine work. Thanks a lot for sharing this out.
Posted by: Buy Audi | March 20, 2009 at 01:58 AM