Patches & Updates
Deploy and monitor updates to all of your systems through Sun from anywhere you have an Internet connection.
Start today with Knowledge-based software update services for Solaris and Linux.
Please review changes to Patch Access.
Patches:
Download Patch (9297715 bytes): HTTP 
Download Signed Patch (9301358 bytes): HTTP 
Signed Patch Documentation  Patch Finder

Sun Grid Engine 5.3 Linux: maintenance/security patch


Status: RELEASED
Patch Id: 113852-06
***********************************************************************
READ THE TERMS OF THE AGREEMENT ("AGREEMENT") IN THE LEGAL_LICENSE.TXT
FILE CAREFULLY BEFORE USING THIS SOFTWARE. BY USING THE SOFTWARE, YOU
AGREE TO THE TERMS OF THIS AGREEMENT. IF YOU DO NOT AGREE TO ALL OF THE
TERMS, PROMPTLY DESTROY THE UNUSED SOFTWARE.
***********************************************************************
Summary: Sun Grid Engine 5.3 Linux: maintenance/security patch
Date:  Jan/20/2006
Installation Requirements:
Additional instructions may be listed below
Solaris Release: 
Sun OS Release: 
Unbundled Product: Sun Grid Engine
Unbundled Release: 5.3
Xref: 
Topic: 
Relevant Architecture: i386
BugId's fixed with this patch:

4658716 4665780 4670664 4670669 4673738 4675410 4676340 4677087 4682966 4683852 4686157 4692957 4696768 4697491 4699665 4700286 4701640 4706929 4708235 4708239 4712023 4713013 4716824 4718880 4719218 4719755 4721129 4721134 4722060 4723543 4727515 4728293 4731288 4731347 4732031 4733043 4733089 4733859 4735258 4735972 4740335 4740350 4740578 4741230 4742082 4742189 4744523 4745387 4745399 4745404 4746705 4747829 4749151 4753668 4753669 4754435 4755931 4756556 4756557 4760981 4769608 4775325 4776016 4776754 4776821 4778757 4778758 4778762 4780316 4787598 4787623 4790540 4790547 4790592 4791238 4791908 4792036 4794242 4795475 4802171 4802831 4805423 4807677 4811230 4813188 4813965 4815774 4815795 4816529 4816541 4818741 4819479 4822799 4833346 4835832 4838549 4838595 4838650 4841414 4842844 4842878 4844838 4845505 4847814 4847819 4851939 4859658 4860391 4866711 4869772 4869784 4876169 4881949 4885719 4885906 4885930 4886017 4886025 4886026 4893432 4930786 4930789 4949917 4952236 4952767 4957760 4969825 5018669 5018695 5018726 5018733 5018884 5019595 5019601 5019624 5019635 5020131 5020134 5020139 5020141 5020143 5020153 5020278 5020371 5021405 5040728 6185208 6252525 6340741 6366691 6370003 6370481 6370485

Changes incorporated in this version:

5040728 6185208 6252525 6340741 6366691 6370003 6370481 6370485

Patches accumulated and obsoleted by this patch:

Patches which conflict with this patch: 

Required Patches:

Obsoleted by:
 
Files Included in this Patch: 
<install_dir>/bin/glinux/qacct
<install_dir>/bin/glinux/qalter
<install_dir>/bin/glinux/qconf
<install_dir>/bin/glinux/qdel
<install_dir>/bin/glinux/qhost
<install_dir>/bin/glinux/qmake
<install_dir>/bin/glinux/qmod
<install_dir>/bin/glinux/qmon
<install_dir>/bin/glinux/qsh
<install_dir>/bin/glinux/qstat
<install_dir>/bin/glinux/qsub
<install_dir>/bin/glinux/qtcsh
<install_dir>/bin/glinux/sge_commd
<install_dir>/bin/glinux/sge_coshepherd
<install_dir>/bin/glinux/sge_execd
<install_dir>/bin/glinux/sge_qmaster
<install_dir>/bin/glinux/sge_schedd
<install_dir>/bin/glinux/sge_shadowd
<install_dir>/bin/glinux/sge_shepherd
<install_dir>/bin/glinux/sgecommdcntl
<install_dir>/lib/glinux/libXltree.so
<install_dir>/utilbin/glinux/adminrun
<install_dir>/utilbin/glinux/checkprog
<install_dir>/utilbin/glinux/checkuser
<install_dir>/utilbin/glinux/filestat
<install_dir>/utilbin/glinux/gethostbyaddr
<install_dir>/utilbin/glinux/gethostbyname
<install_dir>/utilbin/glinux/gethostname
<install_dir>/utilbin/glinux/getservbyname
<install_dir>/utilbin/glinux/infotext
<install_dir>/utilbin/glinux/loadcheck
<install_dir>/utilbin/glinux/now
<install_dir>/utilbin/glinux/openssl
<install_dir>/utilbin/glinux/qrsh_starter
<install_dir>/utilbin/glinux/rlogin
<install_dir>/utilbin/glinux/rsh
<install_dir>/utilbin/glinux/rshd
<install_dir>/utilbin/glinux/testsuidroot
<install_dir>/utilbin/glinux/uidgid
Problem Description: 
6370485 stale resource_unknown_list
6370481 increase PATH size limit of 2048 characters
6370003 long lines in accounting entries break qacct
6366691 utilbin/rsh can be used to gain root access
6340741 scheduler dies when array jobs are submitted with -now y
6252525 qmon: complex attributes not removeable
6185208 qmon and equal job arguments
5040728 Job error state broken
 
(from 113852-05)
5021405 CSP reconnect problem of scheduler and execd
5020371 sge_shepherd creates world writable files
5020278 a colon in a job name breaks qacct
5020153 mail bomb upon abort with tightly integrated par jobs
5020143 qdel XXX.YY- will delete the first array task of job XXX
5020141 qsh and qlogin accepted the options -h and -hold_jid and ignored them later
5020139 a stored job template in qmon sets -hold_jid to a wrong value
5020134 qhost output broken for global consumables
5020131 renaming a user deletes the user
5019635 schedd_job_info=true causes large delays with parallel job scheduling
5019624 qselect/qstat -l selection wrongly considers load and utilization
5019601 "vmem" in qstat -j keeps the max value
5019595 Dateformat YYMMDDhhmm was interpreted wrong (qacct, qsub, qalter,...)
5018884 SSL vulnerabilities stated in Sun Alert 57524
5018733 Empty parameters crashes qstat and qhost
5018726 qalter lacks -dl option!
5018695 loadsensor doing output to stderr can block
5018669 qrsh/qlogin: "Connection refused" due to race condition in shepherd
4969825 not supported array task dependencies are not rejected
4930786 global load values are ignored
4930789 An overwritten string attribut was ignored in the scheduler
4949917 qmon seg faults with a user hold job from qtcsh qtask file
4952236 Broken mail option with SGE 5.3p4 qrsh
4952767 qrsh -notify doesn't work
4957760 Fix needed for CERT CA-2003-26 Multiple Vulnerabilities in SSL/TLS
 
(from 113852-04)
4749151 Adding user to CSP secure system fails on S2.6, 7, 8.
4775325 wrong qstat -j diagnosis message indicates not enough PE slots
4813965 tightly integrated parallel array jobs do not work
4815795 qstat -alarm broken
4819479 qhost -q -l arch=xx crashes if a grid execution host is down
4822799 SGE(EE) cannot be installed on Solaris 10
4833346 qsh/qrsh/qlogin might core with segementation violation
4835832 NOTIFY_SUSP signal only sent for first suspension of job
4838549 maxujobs scheduler config functionality is broken
4838595 maxujobs does not count jobs with certain state
4838650 Array job tasks may set queue in error state when started
4841414 Unable to delete task array job with negative increment
4842844 some jobs may stay long time in transferring state for hosts with many slots
4842878 qdel -u <user> does not delete all jobs of the user
4844838 sge_shepherd does not exit on SIGTERM
4845505 cannot qalter/qhold/qrls several tasks of same job
4847814 jobs in rescheduling state are not scheduled due to wrong ticket calculation
4847819 util/sge_update.sh fails to upgrade sge to sgeee
4851939 qmon->job control->pending jobs->Why? fails if not enough free slots in pe
4859658 scheduler crashes if user_sort=true and job priorities are changed with qalter
4860391 qmake dumps core when starting recursive make calls
4866711 SGE_O_* variables incorrectly set for tasks of tighly integrated jobs
4869772 Linux limits > 2GB are not set correctly
4869784 qmon "Qmon" resource file contains syntax errors
4876169 qrsh -l =1 -now no cause sge_qmaster to crash
4881949 Parallel jobs exceeding wall clock time are not killed
4885719 during installation error message about unset SGE_ROOT is printed
4885906 NSLOTS and NHOSTS incorrectly set in environment of tightly integrated tasks
4885930 failure of master task of a tightly integrated parallel job does not delete job
4886017 qstat -r -s z  command aborts
4886025 queuenames in qstat and in gui need more characters
4886026 max_u_jobs settings rejects submission though limit not reached
4893432 upgrade to openssl 0.9.7a
 
(from 113852-03)
"tar.gz" file was truncated and did not include all necessary files.
 
(from 113852-02)
4713013 qacct may display incorrect accounting information
4756556 .cshrc error causes [pro|epi]log,pe-[start|stop] failure
4756557 non-resolvable hosts in host_aliases file cause wrong hostname resolving
4760981 Empty sge_request file causes submission error
4769608 qalter shows wrong priority number when using negative priorities with -p option
4776016 execd does res. consuming process tracking even if no job is to be controlled   
4776754 complex values for user defined complexes are rejected with global host
4776821 qtcsh can't be used as normal tcsh
4778757 stepsize 0 in array job specification results in qmaster exception
4778758 memcpy leak in execd
4778762 Array jobs which contain only one task (id=1) will be handled as single job
4780316 race condition if signals are to be delivered in job's startup phase
4787598 schedd_job_info messages shown by qstat -j even if it is set to false
4787623 failover to shadow master leaves sge_schedd on the original qmaster host
4790540 sge_schedd process consumes more memory than needed if schedd_job_info=true
4790547 Job notification signals won't be delivered if user redefines suspend_method ...
4790592 conflicting policies can cause job being started and immediately suspended
4791238 SGE may create duplicate accounting entries for parallel jobs
4791908 job logging file exists but is empty in certain configurations
4792036 job arguments larger than 10k crash qmaster
4794242 wrong usage reported by qstat -j <jobid>   
4795475 qstat -f output broken for pe jobs on same queue
4802171 qacct -l selection broken
4802831 cannot set -C to null string as described in man qsub
4805423 STRING complex attribute handling with RELOP "!=" is broken
4807677 qrsh crash when command line arguments are longer than 4K  
4811230 qconf -Muser and qconf -Auser report no success messages 
4813188 qstat -r shows wrong dependencies
4815774 Uninitialized pointer cause segmentation fault in qsh/qrsh on submit only hosts
4816529 qmon crash when pressing Why for a list of selected jobs
4816541 no newline character at end of sge_aliases file may crash qsub
4818741 startup failure of qrsh job is reported as regular job exit
 
(from 113852-02)
4755931 possible file access problems on 64-bit file system with 32-bit binaries
4754435 OpenSSL 0.9.6c security vulnerability
4753669 qconf gets commd timeout
4753668 prevent deletion of still referenced objects
4747829 accounting record about qrsh termination incomplete
4746705 Typo in the message
4745404 qmake does an incorrect resource request if ARCH is an empty string
4745399 qmake without any information about parallel execution fails
4745387 qsh, qrsh and qlogin silently ignore options -ac, -dc, -sc and -w
4744523 no error message for interactive job start failure due to wrong DISPLAY settings
4742189 schedd_job_info = true causes immense daemons memory growth in large systems
4742082 Calculation failure in Functional policy
4741230 qmod help output is incomplete
4740578 load formula of the scheduler
4740350 problems with destin_id_list syntax
4740335 qmon dies with changes in Edit Tickets on Solaris64.
4735972 scheduler crashes if all subnodes of a node have 0 shares in sharetree
4735258 CLI: Wrong info for usage
4733859 Userset "defaultdepartment" accepts users in CLI
4733089 qmon dies after checking 'transfer' in 'queue control' window
4733043 qmon dumps core when mouse over an interactive job in Job Control window
4732031 OK without hostname in host_configuration kill sge_qmaster
4731347 can configure fshare/otickets in acls of type DEPT
4731288 qmon cluster config dialog does not show gid_range in SGE product mode
4728293 qmon gets shutdown with a word "global" in Cluster Configuration
4727515 maxujobs prevents dispatching even if job limit is not reached
4723543 Too small panes and cells to display some item names
4722060 CLI: invalid option "-jid jid" for qconf in qconf -help
4721134 qmon gets shutdown with the message "Segmentation Fault" in terminal
4721129 A misoperation in host configuration on qmon leads to qmaster daemon's death
4719755 wrong port output in qstat error message when qmaster not reachable
4719218 "Job Submission" GUI: blank text in pop out window
4718880 qsub/qalter -l <shortcut_name>... might select wrong resource.
4716824 qlogin and qrsh accept unsupported options
4658716 protocol doing termin. on failure for tightly integr. par. job could be leaner
 
(from 113852-01)
4712023 global load values can prevent dispatching of jobs
4708239 Allow specification of arguments to [rsh|rlogin|qlogin]_[daemon|command]
4708235 SGE should allow to start qrsh jobs when /etc/nologin exits
4706929 qmon does not display job predecessors in job control
4701640 problems launching jobs from qtcsh with "&"
4700286 complex default value not considered for load/suspend thresholds
4699665 qstat resource based job selection broken
4697491 signal notification can prevent delivery of actual suspend/termination signal
4696768 SGE(EE) allows to submit binary job scripts
4692957 non-privileged users can submit jobs with priority higher than 0
4686157 qhost -j is broken
4683852 qalter on running jobs can confuse consumable mgmt
4682966 qsh(1) ignores -S <shell> in sge_request(5) files
4677087 execd could crash when executing tightly integrated parallel jobs
4676340 Memory leak in sge_schedd
4675410 queue suspend threshold alarm nsuspend>1 does not susp. multiple jobs
4673738 allow or disallow "none" load formula
4670669 error message "can't set additional group id for job" for interactive jobs
4670664 parallel jobs (e.g. qmake) fail
4665780 qmaster error message during startup: global configuration doesn't exist - creat
Revision History: 

113852-01 113852-03 113852-04 113852-05 113852-02

Patch Installation Instructions: 
--------------------------------
Special Install Instructions: 
-----------------------------
 
   Important note if Sun Grid Engine has been installed with openSSL support
   -------------------------------------------------------------------------
 
   If Sun Grid Engine has been installed with openSSL support ("CSP mode")
   prior to SGEEE 5.3p3 (which was linked with openSSL 0.9.6.c), the
   certificates which have been installed with these versions are
   incompatible with certificates installed with SGEEE 5.3p4 or later. All
   such certificates will need to be recreated after installing this patch
   and before restarting Sun Grid Engine. Please refer to the Sun Grid
   Engine Administration and User Manual for how to create new certificates
   with the utility script "sge_ca", which comes with the distribution.
 
   The reason for the incompatibility is a changed field name between
   openSSL version 0.9.6 and 0.9.7 in the certificates, where
   "uniqueIdentifier" has been renamed to "userId".
 
   Note for bug id 5020371 ("sge_shepherd creates world writable files")
   ---------------------------------------------------------------------
 
   If the execution daemon spool directory is located on NFS and the
   execution host machine does not have read/write permissions for user root
   (which is often the case due to security reasons) the shepherd process
   will continue to create some of the files in its job directory with world
   writable permissions. If the NFS client has write permissions the fix
   will be effective without further changes after patch installation.
 
   To make the fix effective it is required to install the execution daemon
   spool directory on a local file system. Also for performance reasons it
   is recommended to install the execution daemon spool directory on a local
   file system.
 
   1. Changing the execution daemon spool directory for all hosts
   simultaneously - there may be no running jobs in the cluster
 
        - shut down qmaster
        - shutdown all execution daemons
        - edit the global cluster configuration file
 
               <sge_root>/<sge_cell>/common/configuration
 
          and change the path to the configuration value
 
               execd_spool_dir
 
        - restart qmaster
        - restart your execution daemons
 
   2. Changing the execution daemon spool directory for each execution host
      individually:
 
        - no jobs may be running on the execution host where the spool
          directory is going to be changed
        - edit the local configuration for this execution host:
 
             % qconf -mconf  <hostname>
 
          and add the local spool directory:
 
             execd_spool_dir  <path_to_exec_spool_directory>
 
        - shutdown and restart the execution daemons
 
   In addition to these notes please read the full "Special Install
   Instructions" section later in this file about requirements when the
   patch itself can be installed.
 
tar.gz Patch Installation:
--------------------------
 
   This patch in 'tar.gz' format cannot be installed with 'patchadd' on Solaris
   systems. The patch is installed by unpacking the 'tar.gz' file(s) in this
   directory in <install_dir>. <install_dir> is usually your <sge_root>
   directory. The installation of this patch later is not visible with the
   "showrev -p" command on Solaris.
 
   This patch later cannot be backed out. You may make a backup copy of the
   files which will be overwritten when this patch is installed.
 
   Please read "Install Instructions" later in this file and carry out
   all steps before you unpack the 'tar.gz' file(s) included in this patch.
 
   This patch in 'tar.gz' format may not be installed if the original
   package has been installed with 'pkgadd' on Solaris. In this case please
   install the available patches for Sun Grid Engine, Enterprise Edition
   from http://sunsolve.sun.com in 'pkgadd' format.
 
   The patch is installed by user root by unpacking the file(s) in the
   directory where the original package has been installed:
 
      # cd <install_dir>
      # gzip -dc <patchid>/<targzfile> | tar xvpf -
 
   After installing the patch you should correct the file permissions if
   your Sun Grid Engine installation is installed as an "admin user" system:
 
      # cd <sge_root>
      # util/setfileperm.sh <adminuser> <admingroup> <sge_root>
 
   where <adminuser> is the username of the "admin_user" of your global
   cluster configuration and <admingroup> is the group which you set during
   your initial installation for the files of your Sun Grid Engine
   distribution.
 
Install Instructions:
---------------------
 
   These installation instructions assume that you are running a homogenous
   Sun Grid Engine cluster where all hosts share the same directory for the
   binaries. If you are running Sun Grid Engine in a heterogenous
   environment (mix of 32-bit and 64-bit binaries for Solaris and/or other
   operating systems) it is only necessary to shutdown the daemons for the
   architecture for which the patch is applied.  If you installed the
   binaries on a local partition, you only need to stop the Sun Grid Engine
   daemons for that host on which you are installing the patch.
 
   By default there may by no running jobs when the patch is installed. 
   There may pending batch jobs, but no pending interactive jobs (qrsh,
   qmake, qsh, qtcsh).
 
   It is possible to install the patch with running batch jobs. To avoid a
   failure of the active "sge_shepherd" binary it is necessary to move the
   old shepherd binary (and copy it back prior the installation of the
   patch).
 
   In no case it is supported to install the patch with running interactive
   jobs, 'qmake' jobs or with running parallel jobs which use the tight
   integration support (control_slaves=true in PE configuration is set). 
 
 
   Stopping the Sun Grid Engine cluster to start jobs
   --------------------------------------------------
 
   Disable all queues that no new jobs are started:
 
      # qmod -d '*'
 
   Optional (only needed if there are running jobs which should continue to
   run when the patch is installed):
 
      # cd $SGE_ROOT/bin
      # mv <arch>/sge_shepherd <arch>/sge_shepherd.sge53
 
   It is important that the binary is moved with the "mv" command. It may
   not be copied because this could cause a crash of an active shepherd
   process of a running job when the patch is installed.
 
 
   Shutting down Sun Grid Engine qmaster and scheduler
   ---------------------------------------------------
 
   You need to shutdown (and restart) the qmaster and scheduler daemon and
   all execution daemons on all Sun Grid Engine hosts.
 
   Shutdown all your execution hosts. Login to all your execution hosts and 
   stop the 'sge_execd' and 'sge_commd':
 
      # /etc/init.d/rcsge stop
 
   Then login to your qmaster machine and stop 'sge_qmaster', 'sge_schedd', 
   'sge_commd' and if the machine is also an execution host 'sge_execd'
 
      # /etc/init.d/rcsge stop
 
   Now verify with the 'ps' command that all Sun Grid Engine daemons on all
   hosts are stopped. If you decided to rename the shepherd binary that
   running patch job continue to run during the patch installation you may
   not kill the 'sge_shepherd' binary.
 
 
   Installing the patch and restarting Sun Grid Engine
   ---------------------------------------------------
 
   Now please install the patch by unpacking the 'tar.gz' files included in
   this patch as outline above.
 
   After installing the patch you need to restart your cluster. Please login
   to your qmaster machine and enter:
 
      # /etc/init.d/rcsge
 
   Now you should repeat this step on all your execution hosts.
 
   After restarting Sun Grid Engine you may again enable your queues:
 
      # qmod -e '*'
 
   If you renamed the shepherd binary you may safely delete the old binary
   when all jobs finished which where running prior the patch installation.

README -- Last modified date:  Friday, January 20, 2006

 
 
Login Required

You must login and have a valid contract to access Sun's Premium content which includes:

  • Sun Alerts
  • Bugs
  • Patches
  • Solutions
  • White Papers
  • Documentation
  • Support Knowledge

Login Required

You must login and have a valid contract to access Sun's contracted features

Access Legend:

(Login to access)   Sun Contracted Content
(Login to access)   Sun Contracted Feature

Please make use of SunSolve Feedback application by selecting the floating [+] to provide feedback about this specific document.

Search

Article Details
Article ID : 113852-06
Article Type : Patch Descriptions
Last reviewed : 2006-01-20
Audience : PUBLIC
Keywords : security qstat qhost scheduler execd pe jobs qacct i18n l10n ssl
Provide feedback  (help)
Page Tools
»  Print This Page
»  Email This Article
»  Bookmark This Article