MPI+OMP in AOCC

export OMP_NUM_THREADS=2
export OMP_PLACES=cores
export OMP_PROC_BIND=close

mpirun -np 48 \
  --map-by ppr:12:numa:pe=2 \
  --bind-to core \
  ./lmp -sf omp -pk omp 2 -in in.AK1

FlagMeaning

-np 48	Use 48 MPI ranks total
--map-by ppr:12:numa:pe=2	12 ranks per NUMA node × 4 = 48 ranks; each gets 2 cores (for OMP)
--bind-to core	Binds each rank and its threads to specific physical cores
OMP_PLACES=cores	Each thread sticks to its allocated core
OMP_PROC_BIND=close	Threads are kept near their parent MPI rank

Updated in 250421

export OMP_NUM_THREADS=2
export OMP_PROC_BIND=spread
export OMP_PLACES=cores

mpirun -np 32 --map-by numa:PE=2 --bind-to core \
        -x OMP_NUM_THREADS \
        -x OMP_PLACES \
        -x OMP_PROC_BIND \
        lmp -sf omp -in benchmark.in

'LAMMPS' 카테고리의 다른 글

Possible Acclerations (0)	2025.05.16
LAMMPS with AOCC Updated 250502 (0)	2025.05.02
IntelOneAPI command line options for NUMA (0)	2025.04.03
LAMMPS IntelOneAPI build (2025.5.14) (0)	2025.03.04
AOCC, AOCL Cmake (optional) (0)	2025.03.03

NMGE

MPI+OMP in AOCC

'LAMMPS' 카테고리의 다른 글

티스토리툴바

MPI+OMP in AOCC

'LAMMPS' 카테고리의 다른 글

'LAMMPS' Related Articles

티스토리툴바