Hi I was working on installing Slurm and got most things sorted out but upon launching
sudo journalctl -fu slurmdbd
I get the following:
Jan 25 12:49:49 ... systemd[1]: Stopped slurmdbd.service - Slurm DBD accounting daemon. Jan 25 12:49:49 ... systemd[1]: Started slurmdbd.service - Slurm DBD accounting daemon. Jan 25 12:49:49 ... (slurmdbd)[1027887]: slurmdbd.service: Referenced but unset environment variable evaluates to an empty string: SLURMDBD_OPTIONS Jan 25 12:49:49 ... slurmdbd[1027887]: slurmdbd: error: Unable to open pidfile
/run/slurmdbd.pid': Permission denied Jan 25 12:49:49 ... slurmdbd[1027887]: slurmdbd: Not running as root. Can't drop supplementary groups Jan 25 12:49:49 ... slurmdbd[1027887]: slurmdbd: accounting_storage/as_mysql: _check_mysql_concat_is_sane: MySQL server version is: 8.0.40-0ubuntu0.24.04.1 Jan 25 12:49:49 ... slurmdbd[1027887]: slurmdbd: error: mysql_query failed: 1193 Unknown system variable 'wsrep_on' Jan 25 12:49:49 ... slurmdbd[1027887]: select @@wsrep_on; Jan 25 12:49:49 ... slurmdbd[1027887]: slurmdbd: error: mysql_db_get_var_str: null result from query
select @@wsrep_on;` Jan 25 12:49:49 ... slurmdbd[1027887]: slurmdbd: slurmdbd version 23.11.4 started
which I assume being the cause for sinfo not displaying correctly with the following message:
sinfo: error: Couldn't find the specified plugin name for auth/munge looking at all files sinfo: error: cannot find auth plugin for auth/munge sinfo: error: cannot create auth context for auth/munge sinfo: fatal: failed to initialize auth plugin
To be noted that munge seems to be working correctly, see below, and therefore I cannot pinpoint the cause of Slum not showing sinfo
option other than the issue I presented before.
I also attach my slurm.conf and slurmdbd.conf for clarity in case helpful; this should be the recommended installation for a single node machine. Thanks in advance!
slurm.conf
# Put this file on all nodes of your cluster.
# See the slurm.conf man page for more information.
#
ClusterName=<cluster_name>
SlurmctldHost=<host>
MpiDefault=none
ProctrackType=proctrack/cgroup
ReturnToService=2
SlurmctldPidFile=/run/slurmctld.pid
#SlurmctldPort=6817
SlurmdPidFile=/run/slurmd.pid
#SlurmdPort=6818
SlurmdSpoolDir=/var/lib/slurm/slurmd
SlurmUser=slurm
StateSaveLocation=/var/lib/slurm/slurmctld
SwitchType=switch/none
TaskPlugin=task/cgroup
#
# TIMERS
InactiveLimit=0
KillWait=30
MinJobAge=300
SlurmctldTimeout=120
SlurmdTimeout=300
Waittime=0
# SCHEDULING
SchedulerType=sched/backfill
SelectType=select/cons_tres
SelectTypeParameters=CR_CPU_Memory
#
#ACCOUNTING STORAGE PORT
AccountingStorageType=accounting_storage/slurmdbd
AccountingStorageHost=<host>
AccountingStoreFlags=job_comment
JobCompType=jobcomp/none
JobAcctGatherType=jobacct_gather/cgroup
JobAcctGatherFrequency=30
SlurmctldDebug=info
SlurmctldLogFile=/var/log/slurm/slurmctld.log
SlurmdDebug=info
SlurmdLogFile=/var/log/slurm/slurmd.log
#
# COMPUTE NODES
NodeName=<host> CoresPerSocket=8 ThreadsPerCore=2 RealMemory=126397 Sockets=1
PartitionName=local Nodes=ALL Default=YES MaxTime=INFINITE State=UP
slurmdbd.conf
DbdHost=localhost
DebugLevel=info
StorageHost=localhost
StorageLoc=slurm_acct_db
StoragePass=slurmdbpass
StorageType=accounting_storage/mysql
StorageUser=slurm
LogFile=/var/log/slurm/slurmdbd.log
PidFile=/run/slurmdbd.pid
SlurmUser=slurm
P.S. output of munge -n | unmunge
STATUS: Success (0)
ENCODE_HOST: <host> (127.0.1.1)
ENCODE_TIME: 2025-02-11 09:27:00 +0100 (1739262420)
DECODE_TIME: 2025-02-11 09:27:00 +0100 (1739262420)
TTL: 300
CIPHER: aes128 (4)
MAC: sha256 (5)
ZIP: none (0)
UID: <user_name> (1000)
GID: <user_name> (1000)
LENGTH: 0