| 1 |
#LyX 1.2 created this file. For more info see http://www.lyx.org/ |
|---|
| 2 |
\lyxformat 220 |
|---|
| 3 |
\textclass article |
|---|
| 4 |
\language english |
|---|
| 5 |
\inputencoding auto |
|---|
| 6 |
\fontscheme default |
|---|
| 7 |
\graphics default |
|---|
| 8 |
\paperfontsize default |
|---|
| 9 |
\papersize Default |
|---|
| 10 |
\paperpackage a4 |
|---|
| 11 |
\use_geometry 0 |
|---|
| 12 |
\use_amsmath 0 |
|---|
| 13 |
\use_natbib 0 |
|---|
| 14 |
\use_numerical_citations 0 |
|---|
| 15 |
\paperorientation portrait |
|---|
| 16 |
\secnumdepth 3 |
|---|
| 17 |
\tocdepth 3 |
|---|
| 18 |
\paragraph_separation indent |
|---|
| 19 |
\defskip medskip |
|---|
| 20 |
\quotes_language english |
|---|
| 21 |
\quotes_times 2 |
|---|
| 22 |
\papercolumns 1 |
|---|
| 23 |
\papersides 1 |
|---|
| 24 |
\paperpagestyle default |
|---|
| 25 |
|
|---|
| 26 |
\layout Section* |
|---|
| 27 |
|
|---|
| 28 |
Beowulf Performance Suite (BPS) |
|---|
| 29 |
\layout Standard |
|---|
| 30 |
|
|---|
| 31 |
The Beowulf Performance Suite (BPS) was designed to provide a comprehensive |
|---|
| 32 |
and comparative way of measuring cluster performance. |
|---|
| 33 |
Although BPS contains many benchmarking programs, BPS is not designed to |
|---|
| 34 |
benchmark clusters. |
|---|
| 35 |
BPS is designed as an analysis tool to measure differences due to hardware |
|---|
| 36 |
or software changes onthe same cluster. |
|---|
| 37 |
In addition, successfully running all the tests provides some assurances |
|---|
| 38 |
that the cluster is configured properly. |
|---|
| 39 |
The suite provides a graphical user interface for running the programs |
|---|
| 40 |
as well as generating HTML output files. |
|---|
| 41 |
The following tests are available: |
|---|
| 42 |
\layout Itemize |
|---|
| 43 |
|
|---|
| 44 |
bonnie - hard drive performance |
|---|
| 45 |
\layout Itemize |
|---|
| 46 |
|
|---|
| 47 |
stream - memory performance |
|---|
| 48 |
\layout Itemize |
|---|
| 49 |
|
|---|
| 50 |
netperf - general network performance |
|---|
| 51 |
\layout Itemize |
|---|
| 52 |
|
|---|
| 53 |
netpipe - detailed network performance |
|---|
| 54 |
\layout Itemize |
|---|
| 55 |
|
|---|
| 56 |
nas - nas NASA parallel tests |
|---|
| 57 |
\layout Itemize |
|---|
| 58 |
|
|---|
| 59 |
unixbench - general Unix benchmarks |
|---|
| 60 |
\layout Itemize |
|---|
| 61 |
|
|---|
| 62 |
lmbench - micro Linux benchmarks |
|---|
| 63 |
\layout Subsection* |
|---|
| 64 |
|
|---|
| 65 |
Installing BPS |
|---|
| 66 |
\layout Standard |
|---|
| 67 |
|
|---|
| 68 |
If the BPS package, bps-1.0-8.i386.rpm (version may vary), is not installed |
|---|
| 69 |
on your cluster, the package can be found at ftp.plogic.com. |
|---|
| 70 |
\layout Subsection* |
|---|
| 71 |
|
|---|
| 72 |
Running in X windows |
|---|
| 73 |
\layout Standard |
|---|
| 74 |
|
|---|
| 75 |
The BPS provides an X windows interface. |
|---|
| 76 |
The interface can be started by executing the following command: |
|---|
| 77 |
\layout LyX-Code |
|---|
| 78 |
|
|---|
| 79 |
# xbps |
|---|
| 80 |
\layout Standard |
|---|
| 81 |
|
|---|
| 82 |
The desired set of benchmark(s) can then be selected from the list. |
|---|
| 83 |
The selected benchmarks can then be run by pressing the Run selected benches |
|---|
| 84 |
button. |
|---|
| 85 |
This will not work with Netpipe or Netperf as they require user settings. |
|---|
| 86 |
In addition, the NAS suite will only run on one node unless other settings |
|---|
| 87 |
are provided. |
|---|
| 88 |
After your benches have run, an HTML file of the results can be produced |
|---|
| 89 |
by clicking on the Create html file of results button. |
|---|
| 90 |
This will create a file called index.html in the logs folder which you can |
|---|
| 91 |
open with your browser. |
|---|
| 92 |
\layout Subsubsection* |
|---|
| 93 |
|
|---|
| 94 |
Settings |
|---|
| 95 |
\layout Standard |
|---|
| 96 |
|
|---|
| 97 |
Using the settings is the better way to run the benchmarks, and in the case |
|---|
| 98 |
of Netpipe and Netperf is necessary. |
|---|
| 99 |
The settings, however, are not much more complex than running the benches. |
|---|
| 100 |
|
|---|
| 101 |
\layout Itemize |
|---|
| 102 |
|
|---|
| 103 |
Log Directory - Sets the directory your bench logs will be placed into. |
|---|
| 104 |
Also, when creating an html file from the results, you must create the |
|---|
| 105 |
html file in the same directory as your log directory. |
|---|
| 106 |
Otherwise the program will not be able to find the files it needs to create |
|---|
| 107 |
the page. |
|---|
| 108 |
\layout Itemize |
|---|
| 109 |
|
|---|
| 110 |
Netperf and Netpipe - The settings for both of these look the same but are |
|---|
| 111 |
distinct, as Netpipe is different than Netperf. |
|---|
| 112 |
You must provide a valid receiver and sender node in both cases. |
|---|
| 113 |
The receiver node setting is the node that will be running the bench as |
|---|
| 114 |
a receiver, and the sender node is the one that will be running it as a |
|---|
| 115 |
sender. |
|---|
| 116 |
Since these benches run as benchmarks for the network communications between |
|---|
| 117 |
the two nodes, it is necessary to have both nodes specified. |
|---|
| 118 |
The interface is determined by the names of the hosts. |
|---|
| 119 |
\layout Itemize |
|---|
| 120 |
|
|---|
| 121 |
NAS - The default NAS settings are shown when you choose thisoption. |
|---|
| 122 |
The default settings will run the NAS suite on a single node for the S |
|---|
| 123 |
class size (i.e. |
|---|
| 124 |
workstation). |
|---|
| 125 |
However, if you have two or more nodes/processors, you can add them in |
|---|
| 126 |
the machine list. |
|---|
| 127 |
Because compilation is a part of the tests NAS runs, you can specify which |
|---|
| 128 |
compiler you want it to use. |
|---|
| 129 |
You should also select the MPI versionand number of processors you have. |
|---|
| 130 |
|
|---|
| 131 |
\layout Itemize |
|---|
| 132 |
|
|---|
| 133 |
Machine Info - This is standard information about your motherboard,memory, |
|---|
| 134 |
type of network interface, and the distribution of Linux youare using. |
|---|
| 135 |
\layout Itemize |
|---|
| 136 |
|
|---|
| 137 |
Prompting - This turns window prompts on/off. |
|---|
| 138 |
\layout Subsubsection* |
|---|
| 139 |
|
|---|
| 140 |
Finding out more about the benchmarks |
|---|
| 141 |
\layout Standard |
|---|
| 142 |
|
|---|
| 143 |
In the Help menu, you can find a one sentence synopsis of what each benchmark |
|---|
| 144 |
does by selecting Benchmark Info. |
|---|
| 145 |
\layout Subsection* |
|---|
| 146 |
|
|---|
| 147 |
Running in text mode |
|---|
| 148 |
\layout Standard |
|---|
| 149 |
|
|---|
| 150 |
To run bps in text mode enter: |
|---|
| 151 |
\layout LyX-Code |
|---|
| 152 |
|
|---|
| 153 |
# bps |
|---|
| 154 |
\layout Standard |
|---|
| 155 |
|
|---|
| 156 |
The following command line options are available: |
|---|
| 157 |
\layout LyX-Code |
|---|
| 158 |
|
|---|
| 159 |
# bps -h |
|---|
| 160 |
\layout LyX-Code |
|---|
| 161 |
|
|---|
| 162 |
Usage: /usr/bps/bin/bps <OPTIONS> |
|---|
| 163 |
\layout LyX-Code |
|---|
| 164 |
|
|---|
| 165 |
Options: |
|---|
| 166 |
\layout LyX-Code |
|---|
| 167 |
|
|---|
| 168 |
-b bonnie++ |
|---|
| 169 |
\layout LyX-Code |
|---|
| 170 |
|
|---|
| 171 |
-s stream |
|---|
| 172 |
\layout LyX-Code |
|---|
| 173 |
|
|---|
| 174 |
-f <send node>,<receive node> netperf to remote node |
|---|
| 175 |
\layout LyX-Code |
|---|
| 176 |
|
|---|
| 177 |
-p <send node>,<receive node> netpipe to remote node |
|---|
| 178 |
\layout LyX-Code |
|---|
| 179 |
|
|---|
| 180 |
-n <compiler>,<#processors), NAS parallel benchmarks |
|---|
| 181 |
\layout LyX-Code |
|---|
| 182 |
|
|---|
| 183 |
<test size>,<MPI>, compiler={gnu,pgi,intel} |
|---|
| 184 |
\layout LyX-Code |
|---|
| 185 |
|
|---|
| 186 |
<machine1,machine2,...> test size={A,B,C,dummy} |
|---|
| 187 |
\layout LyX-Code |
|---|
| 188 |
|
|---|
| 189 |
MPI={mpich,lam,mpipro} |
|---|
| 190 |
\layout LyX-Code |
|---|
| 191 |
|
|---|
| 192 |
-k keep NAS directory when finished |
|---|
| 193 |
\layout LyX-Code |
|---|
| 194 |
|
|---|
| 195 |
-u unixbench |
|---|
| 196 |
\layout LyX-Code |
|---|
| 197 |
|
|---|
| 198 |
-m lmbench |
|---|
| 199 |
\layout LyX-Code |
|---|
| 200 |
|
|---|
| 201 |
-l <log_dir> benchmark log directory |
|---|
| 202 |
\layout LyX-Code |
|---|
| 203 |
|
|---|
| 204 |
-w preserve existing log directory |
|---|
| 205 |
\layout LyX-Code |
|---|
| 206 |
|
|---|
| 207 |
-i <mboard manufacturer>, machine information |
|---|
| 208 |
\layout LyX-Code |
|---|
| 209 |
|
|---|
| 210 |
<mboard model>,<memory>, |
|---|
| 211 |
\layout LyX-Code |
|---|
| 212 |
|
|---|
| 213 |
<interconnect>,<linux ver> |
|---|
| 214 |
\layout LyX-Code |
|---|
| 215 |
|
|---|
| 216 |
-v show version |
|---|
| 217 |
\layout LyX-Code |
|---|
| 218 |
|
|---|
| 219 |
-h show this help |
|---|
| 220 |
\layout Standard |
|---|
| 221 |
|
|---|
| 222 |
If you are running the NAS tests from the bps script, you may tell it to |
|---|
| 223 |
keep the NAS directory (npb) in your log directory. |
|---|
| 224 |
This can be useful in tracking down compile errors (see the ~npb/make.log). |
|---|
| 225 |
HTML output files can be produced from the results files in <log directory> |
|---|
| 226 |
by executing: |
|---|
| 227 |
\layout LyX-Code |
|---|
| 228 |
|
|---|
| 229 |
# bps-html <log directory> |
|---|
| 230 |
\layout Subsection* |
|---|
| 231 |
|
|---|
| 232 |
Important Notes |
|---|
| 233 |
\layout Itemize |
|---|
| 234 |
|
|---|
| 235 |
All tests are archived in the src directory. |
|---|
| 236 |
|
|---|
| 237 |
\layout Itemize |
|---|
| 238 |
|
|---|
| 239 |
The bps suite is best run as a normal user, not root. |
|---|
| 240 |
Some of the tests (i.e. |
|---|
| 241 |
NAS parallel) will not run as root. |
|---|
| 242 |
\layout Itemize |
|---|
| 243 |
|
|---|
| 244 |
Not all features of the command line interface are possible with the GUI. |
|---|
| 245 |
\layout Itemize |
|---|
| 246 |
|
|---|
| 247 |
When using Netpipe and Netperf Benchmarks, rsh with no password must be |
|---|
| 248 |
permitted between the nodes. |
|---|
| 249 |
\layout Itemize |
|---|
| 250 |
|
|---|
| 251 |
Under normal operation, xbps will always preserve the existing log directory. |
|---|
| 252 |
This feature is to ensure previous results will not be overwritten. |
|---|
| 253 |
You can copy previous log files (from log directories) into the current |
|---|
| 254 |
log directory for bps-html conversion. |
|---|
| 255 |
\layout Itemize |
|---|
| 256 |
|
|---|
| 257 |
Also, the tests have been designed so that the BPS rpm only needs to be |
|---|
| 258 |
installed on the master node. |
|---|
| 259 |
For this to work, the BPS log directory must be mounted on all nodes (e.g. |
|---|
| 260 |
under /home). |
|---|
| 261 |
|
|---|
| 262 |
\layout Itemize |
|---|
| 263 |
|
|---|
| 264 |
When using the NAS Parallel Benchmarks it is advisable to use the the Paralogic |
|---|
| 265 |
MPI's which which have been tested to work with the NAS suite. |
|---|
| 266 |
Please see the NAS documentation for more information. |
|---|
| 267 |
Rather than limit potential BPS users, these are not made a part of the |
|---|
| 268 |
required packages list. |
|---|
| 269 |
The benchmark scripts have been written to rely on the three environment |
|---|
| 270 |
variables (for LAM-MPI, MPI-PRO, and MPICH). |
|---|
| 271 |
If you are having problems with the NAS benchmarks, extract the npb.tar.gz |
|---|
| 272 |
archive in the/usr/bps/src directory and try running the scripts manually. |
|---|
| 273 |
Consult the README.plogic file for more information. |
|---|
| 274 |
Also, if you wish to use the Portland Group or the Intel Compilers make |
|---|
| 275 |
sure they are properly configured. |
|---|
| 276 |
\layout Subsection* |
|---|
| 277 |
|
|---|
| 278 |
In Case of Problems |
|---|
| 279 |
\layout Standard |
|---|
| 280 |
|
|---|
| 281 |
The BPS suite is a collection of many tests. |
|---|
| 282 |
You should have minimal or no problems with the single machine tests. |
|---|
| 283 |
As more machines become involved with the tests, there is room for more |
|---|
| 284 |
configuration errors to arise. |
|---|
| 285 |
If a test does not run, check the test_name.log file in the log directory. |
|---|
| 286 |
In the case of the NAS tests, the resultsare in the form npb.COMPILER.MPI.CLASS.PRO |
|---|
| 287 |
CESSORS. |
|---|
| 288 |
|
|---|
| 289 |
\layout Standard |
|---|
| 290 |
|
|---|
| 291 |
In general, if you have problems with a test it may be best to run it from |
|---|
| 292 |
the command line. |
|---|
| 293 |
In the case of the NAS suite, the -k option will keep the npb directory |
|---|
| 294 |
in the log directory so you can run the tests more directly by using the |
|---|
| 295 |
run_suite script in the npb directory. |
|---|
| 296 |
Also the README.plogic file in the npb directly should provide more information |
|---|
| 297 |
on how the tests are run and how to resolve possible problems. |
|---|
| 298 |
\layout Subsection* |
|---|
| 299 |
|
|---|
| 300 |
NASA Benchmark Suite (details) |
|---|
| 301 |
\layout Standard |
|---|
| 302 |
|
|---|
| 303 |
The NAS suite can be run from the command line. |
|---|
| 304 |
The following options are required. |
|---|
| 305 |
You also need to list the machine names in npb/cluster/machines file (one |
|---|
| 306 |
per line). |
|---|
| 307 |
This file is used by MPI to start your programs. |
|---|
| 308 |
In the npb directory, run |
|---|
| 309 |
\layout LyX-Code |
|---|
| 310 |
|
|---|
| 311 |
# run_suite -h |
|---|
| 312 |
\layout LyX-Code |
|---|
| 313 |
|
|---|
| 314 |
Usage: ./run_suite <OPTIONS> |
|---|
| 315 |
\layout LyX-Code |
|---|
| 316 |
|
|---|
| 317 |
Options: |
|---|
| 318 |
\layout LyX-Code |
|---|
| 319 |
|
|---|
| 320 |
run_suite,v 0.3 2/12/2002 |
|---|
| 321 |
\layout LyX-Code |
|---|
| 322 |
|
|---|
| 323 |
Usage: ./run_suite <OPTIONS> |
|---|
| 324 |
\layout LyX-Code |
|---|
| 325 |
|
|---|
| 326 |
Options: |
|---|
| 327 |
\layout LyX-Code |
|---|
| 328 |
|
|---|
| 329 |
-v verbose output from make stage (default=make.log) |
|---|
| 330 |
\layout LyX-Code |
|---|
| 331 |
|
|---|
| 332 |
-c <compiler> compiler (gnu/pgi/intel) |
|---|
| 333 |
\layout LyX-Code |
|---|
| 334 |
|
|---|
| 335 |
-n <processors> number of processors |
|---|
| 336 |
\layout LyX-Code |
|---|
| 337 |
|
|---|
| 338 |
-t <test> test size (A,B,C,S) |
|---|
| 339 |
\layout LyX-Code |
|---|
| 340 |
|
|---|
| 341 |
-m <mpi> mpi version(lam,mpich,mpipro,dummy) |
|---|
| 342 |
\layout LyX-Code |
|---|
| 343 |
|
|---|
| 344 |
-o only build programs |
|---|
| 345 |
\layout LyX-Code |
|---|
| 346 |
|
|---|
| 347 |
-h show this help |
|---|
| 348 |
\layout LyX-Code |
|---|
| 349 |
|
|---|
| 350 |
To run on a single CPU use: '-c gnu -n 1 -t S -m dummy' |
|---|
| 351 |
\layout Subsubsection* |
|---|
| 352 |
|
|---|
| 353 |
The following is a description of the NAS tests. |
|---|
| 354 |
\layout Description |
|---|
| 355 |
|
|---|
| 356 |
BT is a simulated CFD application that uses an implicit algorithm to solve |
|---|
| 357 |
3dimensional (3D) compressible NavierStokes equations. |
|---|
| 358 |
The finite differences solution to the problem is based on an Alternating |
|---|
| 359 |
Direction Implicit (ADI) approximate factorization that decouples the x, |
|---|
| 360 |
y, and z dimensions. |
|---|
| 361 |
The resulting systems are BlockTridiagonal of 5x5 blocks and are solved |
|---|
| 362 |
sequentially along each dimension. |
|---|
| 363 |
|
|---|
| 364 |
\layout Description |
|---|
| 365 |
|
|---|
| 366 |
SP is a simulated CFD application that has a similar structure to BT. |
|---|
| 367 |
The finite differences solution to the problem is based on a BeamWarming |
|---|
| 368 |
approximate factorization that decouples the x, y, and z dimensions. |
|---|
| 369 |
The resulting system has scalar Pentadiagonal bands of linear equations |
|---|
| 370 |
that are solved sequentially along each dimension. |
|---|
| 371 |
|
|---|
| 372 |
\layout Description |
|---|
| 373 |
|
|---|
| 374 |
LU is a simulated CFD application that uses symmetric successive overrelaxation |
|---|
| 375 |
(SSOR) method to solve a seven block diagonal system resulting from finite |
|---|
| 376 |
difference discretization of the NavierStokes equations in 3D by splitting |
|---|
| 377 |
to into block Lower and Upper triangular systems. |
|---|
| 378 |
|
|---|
| 379 |
\layout Description |
|---|
| 380 |
|
|---|
| 381 |
FT contains the computational kernel of a 3D fast Fourier Transform (FFT)based |
|---|
| 382 |
spectral method. |
|---|
| 383 |
FT performs three one dimensional (1D) FFT's, one for each dimension. |
|---|
| 384 |
|
|---|
| 385 |
\layout Description |
|---|
| 386 |
|
|---|
| 387 |
MG uses a Vcycle MultiGrid method to compute the solution of the 3D scalar |
|---|
| 388 |
Poisson equation. |
|---|
| 389 |
The algorithm works continuously on a set of grids that are made between |
|---|
| 390 |
coarse and fine. |
|---|
| 391 |
It tests both short and long distance data movement. |
|---|
| 392 |
|
|---|
| 393 |
\layout Description |
|---|
| 394 |
|
|---|
| 395 |
CG uses a Conjugate Gradient method to compute an approximation to the smallest |
|---|
| 396 |
eigenvalue of a large, sparse, unstructured matrix. |
|---|
| 397 |
This kernel tests unstructured grid computations and communications by |
|---|
| 398 |
using a matrix with randomly generated locations of entries. |
|---|
| 399 |
|
|---|
| 400 |
\layout Description |
|---|
| 401 |
|
|---|
| 402 |
EP is an Embarrassingly Parallel benchmark. |
|---|
| 403 |
It generates pairs of Gaussian random deviates according to a specific |
|---|
| 404 |
scheme. |
|---|
| 405 |
The goal is to establish the reference point for peak performance of a |
|---|
| 406 |
given platform. |
|---|
| 407 |
|
|---|
| 408 |
\layout Subsection* |
|---|
| 409 |
|
|---|
| 410 |
Additional benchmark information: |
|---|
| 411 |
\layout Itemize |
|---|
| 412 |
|
|---|
| 413 |
General Information http://www.plogic.com/bps |
|---|
| 414 |
\layout Itemize |
|---|
| 415 |
|
|---|
| 416 |
bonnie++ Hard drive performance - http://www.coker.com.au/bonnie++ |
|---|
| 417 |
\layout Itemize |
|---|
| 418 |
|
|---|
| 419 |
stream Memory performance http://www.cs.virginia.edu/stream |
|---|
| 420 |
\layout Itemize |
|---|
| 421 |
|
|---|
| 422 |
netperf General network performance http://www.netperf.org/netperf/NetperfPage.html |
|---|
| 423 |
\layout Itemize |
|---|
| 424 |
|
|---|
| 425 |
netpipe Detailed network performance |
|---|
| 426 |
\layout Itemize |
|---|
| 427 |
|
|---|
| 428 |
http://www.scl.ameslab.gov/Projects/ClusterCookbook/nprun.html |
|---|
| 429 |
\layout Itemize |
|---|
| 430 |
|
|---|
| 431 |
unixbench General Unix benchmarks http://www.linuxdoc.org/HOWTO/Benchmarking-HOWTO. |
|---|
| 432 |
html |
|---|
| 433 |
\layout Itemize |
|---|
| 434 |
|
|---|
| 435 |
LMbench Low level benchmarks |
|---|
| 436 |
\layout Itemize |
|---|
| 437 |
|
|---|
| 438 |
http://www.bitmover.com/lmbench |
|---|
| 439 |
\layout Itemize |
|---|
| 440 |
|
|---|
| 441 |
NAS Parallel tests http://www.nas.nasa.gov/Software/NPB |
|---|
| 442 |
\the_end |
|---|