Skip to main content

Questions tagged [hpc]

High Performance Computing encompasses using "supercomputers" with high numbers of CPUs, large parallel storage systems and advanced networks to perform time-consuming calculations. Parallel algorithms and parallelization of storage are essential to this field, as well as issues with complex, fast networking fabrics such as Infiniband.

0 votes
0 answers
91 views

I am using LSFjob manager on an HPC cluster. Occasionally the Ansys jobs become "stuck" during execution. Once the jobs are stuck the Ansys log and result files stop getting updated.The ...
Anand Patil's user avatar
0 votes
0 answers
67 views

In a cluster where the nodes are interconnected over Intel True Scale InfiniBand, an Open MPI job executed via Slurm fails on send unless, for testing purposes, I run it as root: Traceback (most ...
Youssef Eldakar's user avatar
0 votes
0 answers
75 views

So as the question says, I am in the process of migrating to new compute nodes. The new servers are HPE Proliant DL360 Gen 10 and the operating system installed is Ubuntu. These are the specifications ...
Sâu's user avatar
  • 101
0 votes
1 answer
120 views

I have a NUMA system with two socket, I'm curious why the core id in NUMA0 is 0-15&32-47, instead of 0-31. Additional information: Hyper thread disabled in BIOS; Some related boot args: ...
Eric's user avatar
  • 1
1 vote
1 answer
963 views

I have been facing this problem for a few days by now, and I would like to preface this by stating that this is the first time I have ever used ldap. So after debugging a bit around the error I have ...
omarelkady22's user avatar
3 votes
2 answers
2k views

I’ve got accounts on many HPC clusters. The machines have a minimal install, and the admins won’t add much else. I need to install lots of typical software. Normally I’d do this with apt, but of ...
projectshave's user avatar
0 votes
1 answer
192 views

I've noticed that connecting to the internet from the allocated compute node via Slurm-GCP keeps failing. For example, using wget from the login node works successfully: [me@gcp-login0 ~]$ wget https:/...
Mathews24's user avatar
  • 101
0 votes
1 answer
342 views

hello so i have a ubuntu hpc cluster and i got a problem with storage whenever i try to access the storage from my compute nodes i cant i keep getting this error mount:mounting 192.168.100.211:/cm/...
Dhamer Nader's user avatar
-1 votes
1 answer
900 views

I'm curious about SAS data transfer speed. Maximum is 12Gbps in the whole bus (not per drive) as far as I understand, but I have a scenario where I would like to have a faster data rate (hopefully ...
zRISC's user avatar
  • 13
0 votes
0 answers
477 views

I recently added two new Compute Nodes on HPE CLuster , But surprisingly, I am Unable to ssh into the new Compute Nodes from the Head Node . [Unable to SSH to new Compute Nodes][1] (base) [root@hn001 ~...
Aditya Kaushal's user avatar
1 vote
1 answer
358 views

I'm writing because I'm facing an issue that I cannot solve trying to configure a cluster with a master node ( or Frontend node ) as a Virtual machine managing nodes with infiniband network. I use ...
SimoneM's user avatar
  • 121
0 votes
1 answer
57 views

I have a number of physical (desktop) machines running at the office as part of a new network to handle processing & serving Open Source data; some of these machines also house VMs. At the moment, ...
Michael Hillman's user avatar
2 votes
0 answers
492 views

I am trying to connect 3 HP z840 workstations using: Mellanox ConnectX-3 VPI 40 / 56GbE Dual-Port QSFP Adapter MCX354A-FCBT Mellanox SX6005 12-port Non-blocking Unmanaged 56Gb/s Description of ...
theenemy's user avatar
  • 121
2 votes
1 answer
1k views

I'm managing a PBS/torque HPC cluster, and now I'm setting up another cluster with SLURM. On the PBS cluster, I can set a queue to accept only interactive jobs by qmgr -c "set queue interactive_q ...
wdg's user avatar
  • 153
0 votes
0 answers
1k views

My IT admin has setup a cluster with 3 nodes, which is administered via Windows server. VMs are hosted via Hyper-V, including an Ubuntu VM to which a substantial portion of the cluster's resources ...
ml_white_belt's user avatar

15 30 50 per page
1
2 3 4 5
8