-
Notifications
You must be signed in to change notification settings - Fork 41
/
Copy pathblas3.html
246 lines (212 loc) · 6.67 KB
/
blas3.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
<html>
<head>
<title>
BLAS3 - Basic Linear Algebra Subprograms - Level 3
</title>
</head>
<body bgcolor="#EEEEEE" link="#CC0000" alink="#FF3300" vlink="#000055">
<h1 align = "center">
BLAS3 <br>
Basic Linear Algebra Subprograms - Level 3
</h1>
<hr>
<p>
<b>BLAS3</b>
is a C++ library which
implements the Level 3 BLAS, or Basic Linear Algebra Subprograms.
</p>
<p>
The BLAS are a small core library of linear algebra utilities,
which can be highly optimized for various architectures. Software
that relies on the BLAS is thus highly portable, and will typically
run very efficiently.
</p>
<p>
The Level 3 BLAS are primarily for use in matrix-matrix operations,
such as matrix multiplication.
In certain cases, they may also be used to operate on the rows
or columns of a two-dimensional array.
</p>
<p>
The Level 3 BLAS were originally written in FORTRAN77. This version
of the library only contains a subset of the BLAS3 library (at
the moment, mainly just DGEMM and SGEMM). Arrays are stored according to the
FORTRAN convention. Thus, an M by N mathematical array might be stored
in a double precision FORTRAN array called "A" that is declared as
<pre>
double precision a(lda,sda)
</pre>
where LDA is the "leading" dimension and SDA is the "second" dimension.
It is natural for LDA to equal M, but it is only necessary that
M <= LDA. The matrix is stored as a vector of concatenated column
vectors, and the dimension statement is used to determine that columns
should be LDA units long. Thus, a typical entry A(I,J) of the
mathematical matrix is actually stored in the location FORTRAN
vector location I+(J-1)*LDA. Of course, in C, this location would be
I-1+(J-1)*LDA.
</p>
<h3 align = "center">
Licensing:
</h3>
<p>
The computer code and data files made available on this
web page are distributed under
<a href = "../../txt/gnu_lgpl.txt">the GNU LGPL license.</a>
</p>
<h3 align = "center">
Languages:
</h3>
<p>
<b>BLAS3</b> is available in
<a href = "../../c_src/blas/blas.html">a C version</a> and
<a href = "../../cpp_src/blas/blas.html">a C++ version</a> and
<a href = "../../f77_src/blas/blas.html">a FORTRAN77 version</a> and
<a href = "../../f_src/blas/blas.html">a FORTRAN90 version</a> and
<a href = "../../m_src/blas/blas.html">a MATLAB version.</a>
</p>
<h3 align = "center">
Related Data and Programs:
</h3>
<p>
<a href = "../../cpp_src/blas0/blas0.html">
BLAS0</a>,
a C++ library which
contains auxilliary functions for the Basic Linear Algebra Subprograms
(BLAS).
</p>
<p>
<a href = "../../cpp_src/blas1_c/blas1_c.html">
BLAS1_C</a>,
a C++ library which
contains basic linear algebra routines for vector-vector operations,
using single precision complex arithmetic.
</p>
<p>
<a href = "../../cpp_src/blas1_d/blas1_d.html">
BLAS1_D</a>,
a C++ library which
contains basic linear algebra routines for vector-vector operations,
using double precision real arithmetic.
</p>
<p>
<a href = "../../cpp_src/blas1_s/blas1_s.html">
BLAS1_S</a>,
a C++ library which
contains basic linear algebra routines for vector-vector operations,
using single precision real arithmetic.
</p>
<p>
<a href = "../../cpp_src/blas1_z/blas1_z.html">
BLAS1_Z</a>,
a C++ library which
contains basic linear algebra routines for vector-vector operations,
using double precision complex arithmetic.
</p>
<p>
<a href = "../../cpp_src/blas2/blas2.html">
BLAS2</a>,
a C++ library which
contains basic linear algebra subprograms (BLAS)
for matrix-vector operations;
</p>
<h3 align = "center">
Reference:
</h3>
<p>
<ol>
<li>
Thomas Coleman, Charles vanLoan,<br>
Handbook for Matrix Computations,<br>
SIAM, 1988,<br>
ISBN13: 978-0-898712-27-8,<br>
LC: QA188.C65.
</li>
<li>
Jack Dongarra, Jim Bunch, Cleve Moler, Pete Stewart,<br>
LINPACK User's Guide,<br>
SIAM, 1979,<br>
ISBN13: 978-0-898711-72-1,<br>
LC: QA214.L56.
</li>
<li>
Charles Lawson, Richard Hanson, David Kincaid, Fred Krogh,<br>
Algorithm 539:
Basic Linear Algebra Subprograms for Fortran Usage,<br>
ACM Transactions on Mathematical Software,<br>
Volume 5, Number 3, September 1979, pages 308-323.
</li>
</ol>
</p>
<h3 align = "center">
Source Code:
</h3>
<p>
<ul>
<li>
<a href = "blas3.cpp">blas3.cpp</a>, the source code.
</li>
<li>
<a href = "blas3.hpp">blas3.hpp</a>, the include file.
</li>
<li>
<a href = "blas3.sh">blas3.sh</a>,
BASH commands to compile the source code.
</li>
</ul>
</p>
<h3 align = "center">
Examples and Tests:
</h3>
<p>
<ul>
<li>
<a href = "blas3_prb.cpp">blas3_prb.cpp</a>
a sample calling program.
</li>
<li>
<a href = "blas3_prb.sh">blas3_prb.sh</a>,
BASH commands to compile and run the sample program.
</li>
<li>
<a href = "blas3_prb_output.txt">blas3_prb_output.txt</a>,
the output file.
</li>
</ul>
</p>
<h3 align = "center">
List of Routines:
</h3>
<p>
<ul>
<li>
<b>DGEMM</b> computes C = alpha * A * B and related operations.
</li>
<li>
<b>DTRMM</b> performs B:=A*B or B:=B*A, A triangular, B rectangular.
</li>
<li>
<b>DTRSM</b> solves A*X=alpha*B or X*A=alpha*B, for triangular A, rectangular B.
</li>
<li>
<b>SGEMM</b> computes C = alpha * A * B and related operations.
</li>
<li>
<b>STRMM</b> performs B:=A*B or B:=B*A, A triangular, B rectangular.
</li>
<li>
<b>STRSM</b> solves A*X=alpha*B or X*A=alpha*B, for triangular A, rectangular B.
</li>
</ul>
</p>
<p>
You can go up one level to <a href = "../cpp_src.html">
the C++ source codes</a>.
</p>
<hr>
<i>
Last revised on 05 April 2014.
</i>
<!-- John Burkardt -->
</body>
<!-- Initial HTML skeleton created by HTMLINDEX. -->
</html>