Compiler and tools tricks

Theoretical background

Floating point arithmetic

IEEE Standards for Floating-Point Arithmetic

[From Thomas Wolf]

There are two different IEEE standards for floating-point arithmetic. They have numbers 754 and 854. Usually, people talk about the 754 standard, which is document ANSI/IEEE Std 754-1985, also an IEC standard: IEC 559:1989 and has also been published as ACM SIGPLAN Notices 22(2), pp. 9-25, Feb. 1987.

The ANSI/IEEE Std 854-1987 standard allows both binary and decimal bases for floating-point values, and it doesn't specify how floating-point numbers are encoded (i.e. the bit layout).
IEEE 754 revision group, draft standard

Language bindings

C99 includes a binding to IEEE 754. So does Fortran 2003 (chapter 14).
Language Independent Arithmetic (ISO/IEC 10967) defines some language bindings.
Most implementations default to the "non stop arithmetic" behaviour where arithmetic exceptions are masked at start up and consequently do not get delivered synchronously to the program. This can be usually overwritten by using a compiler flag (C/C++, Fortran) or by calling a system API in order to modify the exception mask. Unfortunately, these APIs are not part of any standard.

Here are the main ones:

Linux (and glibc based systems)

feenableexcept (1, 2)

Microsoft Windows

_controlfp or _control87 (1)

x86/x86_64 based systems

refer to (1)

Several packages contain some relevant code:
- Yorick:
  yorick, developed by David H. Munro of LLNL, contains a comprehensive inventory of these APIs in play/unix/{fpuset.c,config.sh}. David H. Munro posted on Usenet in July 1999 a clear explanation of the problem under the title "SIGFPE delivery". The code included is now obsolete. However, as I updated somewhat a local copy, it may still provide some useful information.
- f2c:
  One can refer to uninit.c in the source of libf2c.

Miscellaneous

Goldberg, David, What Every Computer Scientist Should Know about Floating-Point arithmetic, ACM Computing Surveys, Vol. 23, #1, March 1991, pp. 5-48
- original site
- html version (from Sun's Numerical Computation Guide along with Doug Priest's addendum)
- PostScript version including Doug Priest's supplement.
- Differences Among IEEE 754 Implementations by Doug Priest (included in the PostScript format document above)
Numerical Computing with IEEE Floating Point Arithmetic, Michael L. Overton, SIAM, 2001: a fine textbook
Accuracy and Stability of Numerical Algorithms, Second edition, Nicholas J. Higham, SIAM, 2002: the reference on the subject
Using accurate arithmetics to improve numerical reproducibility and stability in parallel applications, Yun He and Chris H.Q. Ding, Journal of Supercomputing, Vol.18, Issue 3, March 2001, pp. 259-277. Also Proceedings of International Conference on Supercomputing (ICS'00), May 2000, pp. 225-234 (see as well 1).
Some advices (from Herman D. Knoble)
[From J. Giles]
D. W. Matula, A Formalization of Floating-Point Numeric Base Conversion, IEEE Transactions on Computers, vol. C-19, no. 8, pp. 681-692, August 1970

Basically, the condition is that the number of decimal digits D, and the number of binary bits B should be related as follows:

10^(D-1) > 2^B

If that's the case, then translating the binary to decimal and back to binary again is an identity operation. So, for IEEE single precision, the number of bits is 24 (counting the hidden normalization) so you should have D=9 (or more). For IEEE double, B is 53 and you want D=17 (or more). For Intel's version of double extended, you have B=64, so D>=21.

Similarly, if you want to translate from decimal to binary and back to decimal and get the same answer, the required relation between the precisions is:

2^(B-1) > 10^D

That is, some people like to enter 0.1 and not get 0.09999997 back. For this, the maximum decimal digits you should use for the IEEE and Intel binary representations is:

single: D<=6
double: D<=15
double-extended: D<=18

Aliasing

In Fortran:
- Restrictions on dummy arguments are discussed in Fortran 90/95 Explained, M. Metcalf, J. Reid, Oxford University Press, section 5.7.2
  (Ref.: F66 section 8.4.2, F77 section 15.9.3.6., F90 section 12.5.2.9., F95 section 12.4.1.6)
- Evaluation rules related to "Function" are exposed in section 7.1.7 of the Fortran 95 standard.
  
  The evaluation of a function reference shall neither affect nor be affected by the evaluation of any other entity within the statement. If a function reference causes definition or undefinition of an actual argument of the function, that argument or any associated entities shall not appear elsewhere in the same statement.
A discussion from Object Oriented Numerics List
Restricted Pointers are Coming, Arch D. Robison, C/C++ Users Journal, July 1999
Type-Based Alias Analysis, Optimization that makes C++ faster than C, Mark Mitchell, Dr. Dobb's Journal, October 2000
HPC (High Performance Computing) in C, or why Fortran is often referred to as the HPC language..., Andy Polyakov
Todd Veldhuizen's papers

Bentley's Rules

Jon Bentley, author of Programming Pearls, published Writing Efficient Programs, in which he provides a unified, pragmatic treatment of program efficiency, independent of language and host platform. For ease of presentation, he codified his methods as a set of terse rules in the Appendix C of Writing Efficient Programs.

War stories related to development

Thomas Huckle's Collection of software bugs: some famous bug induced disasters
D. N. Arnold's Some disasters attributable to bad numerical computing
Jacques-Louis Lions, Lennart Lebeck, Jean-Luc Fauquembergue, Gilles Kahn, Wolfgang Kubbat, Stefan Levedag, Leonardo Mazzini, Didier Merle Thomson, Colin O'Halloran, ARIANE 5 Flight 501 Failure: Report by the Inquiry Board, European Space Agency Report, Paris, July 1996
Computers in Spaceflight. The NASA Experience: how NASA builds its software
For a summary of NASA flight computers and software reliability, see: The Reliability Challenge and Software Development (from Computers Take Flight. A History of NASA's Pioneering Digital Fly-By-Wire Project, James E. Tomayko, NASA, 2000)
Kernel traffic: a digest of linux-kernel, the linux kernel mailing list.
Communications of the ACM, Vol. 40, #4, April 1997, A Special Issue with the subtitle "The Debugging Scandal"
Byte, December 1995, How Software Doesn't Work, Byte, April 1998, Crash-Proof Computing
comp.risks (archive)
Safety-Critical Mailing List Forum (University of York)
Tom Van Vleck Software Engineering (of Multics fame)
Les Hatton's papers (1, 2, 3)

		GNU compilers - Linux (and any supported platform by gcc)	Sun compilers - Sun Solaris	HP compilers - HP HP-UX	IBM compilers - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compilers - Tru64 Unix	Microsoft C++ compiler - Windows
version of targeted tools		GNU compilers (gcc, g++ [3.4.x])	Sun compilers (cc, CC [Workshop 6.0 update 1])	HP compilers (cc [11.x], aCC [3.x])	IBM compilers (xlc [8.0])	SGI compilers (cc, CC)	Compaq compilers (discontinued: cc [5.3], cxx [6.2])	Microsoft Visual C++ (CL.EXE) [7.0]
verify syntax only		`-fsyntax-only`	[cc] `-xe`	N/A	`-qsyntaxonly`	[all compilers] `-Hf` (was `-fe`)	`-Hf`	`/Zs`
floating point trapping (note)	principle	system dependent API calls (references)	compiler support: `-ftrap=xxx`	linker support: `+FP xxx`	compiler support: `-qflttrap=xxx`	link with `-lfpe` + `setenv TRAP_FPE ...`	compiler support (`-fptm<x>`)	API calls (`_control87`) + debugger support
floating point trapping (note)	trap DIV, INV, OV	API calls (Linux/glibc and x86 examples)	`-ftrap=common` or `-fnonstd`	`+FP VZO`	`-qflttrap=inv:ov:zero:en`	see example	default (`-fptm n`)	API calls
integer trapping		overflow: `-ftrapv` divide by zero: default	overflow: N/A divide by zero: default	overflow: N/A divide by zero: default	overflow: N/A divide by zero:`-qcheck=divzero` (implied by `-qcheck=all`)	overflow: `-DEBUG:div_check=3` (seems not to work) divide by zero: default	overflow: N/A divide by zero: default	debugger support
		GNU compilers - Linux (and any supported platform by gcc)	Sun compilers - Sun Solaris	HP compilers - HP HP-UX	IBM compilers - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compilers - Tru64 Unix	Microsoft C++ compiler - Windows
standard conformance		[gcc, g++] `-ansi -pedantic`	[cc] `-Xa, -Xc`	[cc] `-Aa`, `+Mlevel` [aCC]`-Aa`, `+p`	`-qlanglvl=<xx>`	[cc, CC] `-ansi` [CC] `-LANG:std`	[cc] `-std<n>` [cxx] `-std strict_ansi`	`/Za`
run-time detection of uninitialized variable (note)		[tools] On linux x86, valgrind	[tools] various available	N/A	[compiler support] for stack storage: `-qinitiauto=FF` In practice `-qinitauto=FF -qflttrap=inv:ov:zero:en -qfloat=nans` (??) for heap storage `-qheapdebug` (AIX specific)	[compiler support] `-DEBUG:trap_uninitialized` (was `-trapuv`) static memory: `-Wl,'-f 0xFFFFFFFF'`	[compiler support] `-trapuv` (static memory: `-Wl,'-f 0xFFFFFFFF'` doesn't operate as on SGI) [tools] `atom -tool third`	[compiler support] `/GZ` (MS C++ 6.0), `/RTC1` (MS C++ 7.0) [tools] commercial tools
compile time flow analysis (note)		`-Wuninitialized -O`	[C] `lint -Nlevel=n` (n>=2)	[hp compilers] `+Onoinitcheck`	`-qinfo=uni`	N/A	N/A	`/Z3`, `/Z4`
put literal strings in read-only memory		default	[cc] `-xstrconst` [CC] `-features=conststrings`	[cc] `+ESlit`	`-qro`, `-qroconst`	[all compilers] `-use_readonly_const -G0 -rdata_shared`	[all compilers] `-readonly_strings`	`/GF`
abort on deferencing null pointer		default	default	`-z`	`-qcheck=nullptr` (implied by `-qcheck=all`)	default	N/A	default
		GNU compilers - Linux (and any supported platform by gcc)	Sun compilers - Sun Solaris	HP compilers - HP HP-UX	IBM compilers - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compilers - Tru64 Unix	Microsoft C++ compiler - Windows
take advantage of aliasing rules (note)		`-fstrict-aliasing` (implied by `-O2` (gcc>=3.x))	[cc] `-xalias_level=std`	[cc] `+Optrs_ansi`, `+Optrs_strongly_typed`, `+Otype_safety=ansi`	`-qalias=ansi -O` (was `-qansialias`)	`-OPT:alias=typed`, (seems not to work: `-LANG:alias_const`)	`-ansi_alias`	N/A
check varargs		N/A (?)	N/A	N/A	N/A	`-DEBUG:varargs_interface_check`, `-DEBUG:varargs_prototypes`	[cc] `-vararg`	N/A
check calls	compile time	[gcc] K&R decl.: `-Wstrict-prototypes`, `-Wold-style-definition`, missing decl.: `-Wmissing-prototypes`	[cc] K&R decl.: `-fd`	[cc] missing decl.: `+w1`	decl. consistency: `-qinfo=dcl`, missing decl.: `-qinfo=pro`	[cc] missing decl.: `-fullwarn`	[cc] missing decl.: `-warnprotos`	??
check calls	link or run time	N/A	N/A	N/A	link time:`-qextchk` (AIX specific)	N/A	N/A	N/A
		GNU compilers - Linux (and any supported platform by gcc)	Sun compilers - Sun Solaris	HP compilers - HP HP-UX	IBM compilers - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compilers - Tru64 Unix	Microsoft C++ compiler - Windows
link or run time memory debugging (note)		[tools] various (Valgrind, Electric Fence, ...) [library] glibc	[compilers] stack overflow check: `-xcheck=stkovf` (>= 7.0) [tools] various available [library] `man watchmalloc`	[tools] `gdb` (aka wdb)	`-qheapdebug` (AIX specific), `-qcheck=bound` (implied by `-qcheck=all`)	[all compilers] `-DEBUG:subscript_check` [library] `man malloc_ss`	[cc] `-check_bounds` [tools] `atom -tool third`	build in debug mode, buffer security check: `/GS` (MS C++ 7.0), `/RTC1` (MS C++ 7.0) [tools] commercial tools, MS pageheap, built-in facilities defined in `crtdbg.h`
flags for debugger		`-g`, `-ggdb`, `-g3`, `-ggdb3`	`-g`, `-xs`, `-g0`	`-g`, [aCC] `-g0`, `+objdebug`, `+d`	`-g`, `-qfullpath`, `-qlinedebug`	`-g`, `-g3`, [CC] `-gslim`	`-g0`, `-g1`, `-g2`, `-g3`, [cxx] `-gall`	build in debug mode
reentrant code		(glibc based systems, e.g. Linux) `-D_REENTRANT`	`-mt`	`-D_POSIX_C_SOURCE=199506L` (c.f. `man pthread`) [aCC >=3.30] `-mt`	use the `..._r` commands (`xlc_r`, ...)	`-D_POSIX_C_SOURCE=199506L` (c.f. `man 3 intro`)	[all compilers] `-pthread`	`/MD`, `/ML`, `/MT`
		GNU compilers - Linux (and any supported platform by gcc)	Sun compilers - Sun Solaris	HP compilers - HP HP-UX	IBM compilers - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compilers - Tru64 Unix	Microsoft C++ compiler - Windows

		GNU compiler - Linux (and any supported platform by gcc)	Intel compiler - Linux	Sun compiler - Sun Solaris	HP compiler - HP HP-UX	IBM compiler - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compiler - Tru64 Unix	Compaq Fortran compiler - Windows	Salford Fortran compilers - Windows	NAGWare Fortran compiler (any supported platform)	Lahey Fortran compiler
version of targeted tools		GNU compiler (gfortran [4.2.x], g77 [discontinued after 3.4.x])	Intel Fortran compiler (ifort [9.1])	Sun compiler (f90, f95 [Workshop 7.0])	HP compiler (f90 [2.4])	IBM compiler (xlf [10.1])	SGI compilers (f90 [7.3])	Compaq compiler (discontinued: f90, f95 [5.5])	Compaq Visual Fortran (discontinued: DF [6.6]) (Windows Fortran compilers)	Salford Fortran compilers FTN77 [4.0], FTN95 [3.0] (Windows Fortran compilers)	NAGWare compiler (f95 [5.0])	Lahey/Fujitsu compiler lf95 [6.0] (Windows Fortran compilers)
verify syntax only		`-fsyntax-only`	`-syntax`, `-y`	N/A	N/A	N/A	`-Hf` (was `-fe`)	-syntax_only	`/syntax_only`	N/A	`-M` `-M -nomod` (no module files produced)	N/A
floating point trapping (note)	principle	[gfortran] `-ffpe-trap=xxx` [g77] system dependent API calls (references)	compiler support (`-fpe<x>`)	compiler support: `-ftrap=xxx`	linker support: `+FP xxx`	compiler support: `-qflttrap=xxx`	link with `-lfpe` + `setenv TRAP_FPE ...`	compiler support (`-fpe<x>`)	`/fpe:<level>`	API calls	compiler support: `-ieee=xxx`	`--trap <args>`
floating point trapping (note)	trap DIV, INV, OV	[gfortran] `-ffpe-trap=invalid,zero,overflow` [g77] API calls (Linux/glibc and x86 examples)	`-fpe 0`	`-ftrap=common` or `-fnonstd`	`+FP VZO`	`-qflttrap=inv:ov:zero:en`	see example	default (`-fpe`)	`/fpe:0` (non default on x86)	default	default (`-ieee=stop`)	`-trap dio`
integer trapping		overflow: `-ftrapv` divide by zero: default	overflow: N/A divide by zero: default	overflow: N/A divide by zero: default	overflow: possible with directive divide by zero: default	overflow: N/A divide by zero: default	overflow: `-DEBUG:div_check=3` (seems not to work) divide by zero: default	overflow: `-check overflow` divide by zero: default	overflow: `/check:overflow` divide by zero: default	overflow: N/A divide by zero: default	overflow: N/A divide by zero: default	overflow: N/A divide by zero: default
		GNU compiler - Linux (and any supported platform by gcc)	Intel compiler - Linux	Sun compiler - Sun Solaris	HP compiler - HP HP-UX	IBM compiler - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compiler - Tru64 Unix	Compaq Fortran compiler - Windows	Salford Fortran compilers - Windows	NAGWare Fortran compiler (any supported platform)	Lahey Fortran compiler
standard conformance		`-pedantic`	`-stand`	`-ansi`	`+langlvl=xx`	`-qlanglvl=<xx>`	`-ansi`	`-std<xx>`	`/stand`	`/ANSI`, [FTN95] `/ISO`, `/RESTRICT_SYNTAX`	default	`--f95`
run-time detection of uninitialized variable (note)		[tools] On linux x86, valgrind [gfortran >= 4.3] `-finit-real=nan`, `-finit-init=xxx`, `-finit-logical=xxx` (f2c has `-trapuv` since June 2001)	(compile with `-auto`) [compiler support] `-ftrapuv` (ifort 9.0 does not use NaN which makes it less useful), `-check uninit` [tools] valgrind on x86	[tools] various available (compile with `-stackvar`)	N/A	[compiler support] for stack storage: `-qinitiauto=FFF00000` In practice [xlf] `-qnosave -qinitauto=FFF00000 -qflttrap=inv:ov:zero:en`	[compiler support] `-DEBUG:trap_uninitialized` (was `-trapuv`) static memory: `-Wl,'-f 0xFFFFFFFF'`	[compiler support] `-trapuv` (compile with `-automatic`) (static memory: `-Wl,'-f 0xFFFFFFFF'` doesn't operate as on SGI) [tools] `atom -tool third`	[compiler support] N/A (`/automatic` may help somewhat, see as well 1) [tools] commercial tools	`/UNDEF`	[compiler support] `-nan`, `-C=undefined` [tools] some may help	`--check`
compile time flow analysis (note)		`-Wuninitialized -O`	(>=10.x) `-diag-enable sv` (disable object file generation)	`-XlistE`	`+Onoinitcheck`	N/A	`ftnlint` (limited analysis)	`-automatic` (optimisation must be on) (`-warn uninitialized` on by default)	default with `/automatic`	default	limited	??
put literal strings in read-only memory		default	default (`/assume:protect_constants`)	N/A	N/A	N/A	N/A	`-readonly_strings`, `-assume protect_constants` (default)	default (`/assume:protect_constants`)	`/CHECK`, [FTN95] `/FULL_UNDEF`	default or N/A	`--npca`/`--pca`
		GNU compiler - Linux (and any supported platform by gcc)	Intel compiler - Linux	Sun compiler - Sun Solaris	HP compiler - HP HP-UX	IBM compiler - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compiler - Tru64 Unix	Compaq Fortran compiler - Windows	Salford Fortran compilers - Windows	NAGWare Fortran compiler (any supported platform)	Lahey Fortran compiler
abort on deferencing null pointer		N/A	`-check pointer`	??	??	??	??	??	??	`/FULL_UNDEF`	`-C=pointer`	??
stack oriented/static allocation		[gfortran >= 4.3] `-frecursive`, [g77] default / `-fno-automatic`	`-auto` / `-save` (default is `-auto_scalar`)	`-stackvar` / default	default (`+nosave`) / `+save`	`-qnosave` / `-qsave`	default / `-static`	`-automatic` / default (`-static`)	`/automatic` / default (`/static`)	default / `/SAV`	default / `-save`	default (`--nsav`) / `--sav`
disallow implicit declaration		[gfortran] `-fimplicit-none` [g77] `-Wimplicit`	`-u`, `-implicitnone`	`-u`	`+implicit_none`	`-u` (or `-qundef`)	`-u`	`-u` (or `-warn declarations`)	`/warn:declarations`	`/IMPLICIT_NONE`	`-u`	`--in`
check calls	compile time	[g77] (per file) default	(across files) `-gen-interfaces` and `-warn interfaces`	(per file) default	??	N/A	(per file) default	(per file) `-warn argument_checking`	(per file) `/warn:argument_checking`	N/A	(per file) default	(per file) default
check calls	link or run time	N/A	mismatch in number of arguments (Windows only) `/iface:cvf`	N/A	N/A	link time:`-qextchk` (AIX specific)	N/A	N/A	mismatch in number of arguments detected due to stdcall convention	`/CHECK`, `/FULLCHECK`, [FTN95] `/FULL_UNDEF`	run time: `-C=calls`	`--check`, `--checkglobal`
		GNU compiler - Linux (and any supported platform by gcc)	Intel compiler - Linux	Sun compiler - Sun Solaris	HP compiler - HP HP-UX	IBM compiler - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compiler - Tru64 Unix	Compaq Fortran compiler - Windows	Salford Fortran compilers - Windows	NAGWare Fortran compiler (any supported platform)	Lahey Fortran compiler
link or run time memory debugging (note 1, note 2)		[gfortran, g77] `-fbounds-check` [tools] various (Valgrind, ...) [library] Linux/glibc	`-check bounds`	`-C`, `-xcheck=stkovf` (f95 >= 7.0) [tools] various available	`+check=all` [tools] `gdb` (aka wdb)	`-C`	`-DEBUG:subscript_check` (set the environment variable `F90_BOUNDS_CHECK_ABORT` to "YES")	`-C` [tools] `atom -tool third`	[DF] `/check:bounds`	`/CHECK`, `/FULLCHECK`, [FTN95] `/FULL_UNDEF`	[f95] `-C` memory tracing: `-mtrace` [tools] various	`--check`
stack trace on crash		gfortran >= 4.3 `-fbacktrace`	`-traceback`		`+fp_exception`	`-qsigtrap=xl__trcedump`			`/traceback`		`-gline`	default (`--trace`)
flags for debugger		`-g`, `-ggdb`, `-g3`, `-ggdb3`	`-g`, `-inline_debug_info`	`-g`, `-xs`, `-g0`	`-g`	`-g`, `-qfullpath`	`-g`, `-g3`	`-g0`, `-g1`, `-g2`, `-g3`, [f95] `-assume gfullpath`, [f95] `-ladebug`	build in debug mode	`/DEBUG`, [FTN95] `/FULL_DEBUG`	`-g`	`-g`
reentrant code		??	`-recursive`, `-threads`	??	??	use the `..._r` commands (`xlf_r`, ...)	??	`-reentrancy threaded`	`/recursive`, `/threads`	`/MULTI_THREADED` (FTN95>=3.0)	`-thread_safe`	`??`
		GNU compiler - Linux (and any supported platform by gcc)	Intel compiler - Linux	Sun compiler - Sun Solaris	HP compiler - HP HP-UX	IBM compiler - IBM AIX, Linux, …	SGI compilers - SGI IRIX	Compaq compiler - Tru64 Unix	Compaq Fortran compiler - Windows	Salford Fortran compilers - Windows	NAGWare Fortran compiler (any supported platform)	Lahey Fortran compiler

Systems	Example for x87	Example for SSE
Windows APIs	Use `_controlfp` or `_control87`. `#include <float.h> unsigned int cw; /* could use _controlfp */ cw = _control87(0,0) & MCW_EM; cw &= ~(_EM_INVALID\|_EM_ZERODIVIDE\|_EM_OVERFLOW); _control87(cw,MCW_EM);`	With Visual Studio 2005, `_controlfp` and `_control87` affect the control words for both the x87 and the SSE FPU.
Linux/glibc 2.2 and later	Use `feenableexcept`. `#define _GNU_SOURCE 1 #include <fenv.h> feenableexcept(FE_INVALID\|FE_DIVBYZERO\|FE_OVERFLOW);`	Use `feenableexcept`, which sets both the x87 and the SSE control words from glibc 2.3.3 onwards for x86_32.
Linux/glibc 2.1 and older	`#include <fpu_control.h> fpu_control_t cw; _FPU_GETCW(cw); cw &= ~(_FPU_MASK_IM \| _FPU_MASK_ZM \| _FPU_MASK_OM); _FPU_SETCW(cw);`	These APIs do not affect the SSE FPU.
FreeBSD	FreeBSD post March 2005 implements `feenableexcept`. For older versions, follow the example below. `#include <floatingpoint.h> fp_except_t cw; cw = fpgetmask(); fpsetmask(cw & ~(FP_X_INV \| FP_X_DZ \| FP_X_OFL));`	Unknown
Compilers supporting `xmmintrin.h`	These APIs do not affect the x87 FPU.	`#include <xmmintrin.h> _MM_SET_EXCEPTION_MASK(_MM_GET_EXCEPTION_MASK() & ~(_MM_MASK_INVALID\| _MM_MASK_DIV_ZERO\| _MM_MASK_OVERFLOW) );`
gcc compatible assembler (e.g. Cygwin)	`unsigned int cw; __asm__ __volatile__ ("fnstcw %0" : "=m" (cw)); cw &= ~(0x01 \| 0x04 \| 0x08); __asm__ __volatile__ ("fldcw %0" : : "m" (cw));`	`unsigned int cw; __asm__ __volatile__ ("stmxcsr %0" : "=m" (cw)); cw &= ~((0x01\|0x04\|0x08) << 7); __asm__ __volatile__ ("ldmxcsr %0" : : "m" (cw));`

Compiler and tools tricks

Table of Contents

Theoretical background

Floating point arithmetic

IEEE Standards for Floating-Point Arithmetic

Language bindings

Miscellaneous

Aliasing

Bentley's Rules

War stories related to development

Numerical topics

Exchange Fortran unformatted data between heterogeneous machines

Approximate "diff"

Increasing the stack size

Code coverage tools

Code profiling

make

Bourne shell

Static analyzers

lint

Source browser

Source beautifier

Metrics

Debuggers

"Command line" debuggers

Graphical debuggers

Availability and setup of DDD

Debuggers for parallel applications

Trace debugger

Various debugger tricks

Comparison tables

C/C++

Fortran

Notes

Compaq (Digital)

Debug flags

Tracing flags

Runtime checking

Linux/glibc

Documentation

Runtime checking

Floating point trapping

GNU compilers

Documentation

Debug flags

Tracing flags

libstd++ debug mode

Floating point implementation

HP

Debug flags

Tracing flags

Runtime checking

IBM

xlc and xlf debug flags

xlc debug flags

xlf debug flags

NAGWare

Availability and documentation

Debug flags

"Mandatory" options

Other useful flags

Runtime checking

Portland Group compilers

Availability and documentation

Debug flags

SGI

Debug flags

Floating point trapping

Tracing flags

Sun

Debug flags

Tracing flags

Runtime checking

Windows and Linux Fortran compilers

Debug flags

x86 specifics

Floating point trapping

Floating point precision mode

Compilers on super-computers

NEC SX