[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Drop in sgemm
Greetings! The last stuff I released to Clint can still be found at
http://master.debian.org/~camm/nblas.tar.gz
These unpack into the new beta atlas developer's version at Clint's
site just fine. There are a few minor edits to my routines, though,
which are needed for the latest gcc. I described these earlier on the
list. Briefly, one needs to replace the 'static __inline__ void' with
simply 'void' in dpa.h, ga.h, and maa.h, and to replace the
ATL_sger_nx32.c file in include with the almost identical one below.
I hope to be making a new tarball soon once the double precision stuff
is finished.
Take care,
=============================================================================
ATL_sger_nx32.c
=============================================================================
#include <stdio.h>
#include <stdlib.h>
#define Mjoin(a,b) mjoin(a,b)
#define mjoin(a,b) a ## b
#define EXT5 5g
#define EXT4 4g
#define EXT3 3g
#define EXT2 2g
#define EXT1 1g
#define NDP 5
#define EXT EXT5
#include "ga.h"
#undef NDP
#define NDP 4
#undef EXT
#define EXT EXT4
#include "ga.h"
#undef NDP
#define NDP 3
#undef EXT
#define EXT EXT3
#include "ga.h"
#undef NDP
#define NDP 2
#undef EXT
#define EXT EXT2
#include "ga.h"
#undef NDP
#define NDP 1
#undef EXT
#define EXT EXT1
#include "ga.h"
#undef NDP
#define NDP NDPM
#undef EXT
#define EXT Mjoin(Mjoin(NDP,g),m)
#include "ga.h"
void
ATL_sger1_a1_x1_yX(int m,int n,float alpha,const float *c,int cinc,
const float *b,int binc,float *a,int lda) {
int i,mm,nn;
const float *ae;
ae=a+n*lda;
nn=STRIDE*lda;
#if NDPM == 1
for (;a<ae;a+=lda,b+=binc)
Mjoin(g,EXT)(b,STRIDE,a,nn,c,m);
#else
while (a+NDPM*nn<=ae) {
for (i=0;i<STRIDE;i++,a+=lda,b+=binc)
Mjoin(g,EXT)(b,STRIDE*binc,a,nn,c,m);
a+=(NDPM-1)*nn;
b+=(NDPM-1)*STRIDE*binc;
}
for (i=0;a<ae && i<STRIDE;i++,a+=lda,b+=binc) {
mm=(ae-a)/nn;
if (((ae-a)/lda)%STRIDE)
mm++;
if (mm == 1)
Mjoin(g,EXT1)(b,STRIDE,a,nn,c,m);
else if (mm == 2)
Mjoin(g,EXT2)(b,STRIDE,a,nn,c,m);
else if (mm == 3)
Mjoin(g,EXT3)(b,STRIDE,a,nn,c,m);
else if (mm == 4)
Mjoin(g,EXT4)(b,STRIDE,a,nn,c,m);
else if (mm == 5)
Mjoin(g,EXT5)(b,STRIDE,a,nn,c,m);
}
#endif
}
=============================================================================
Doug ABERDEEN <[email protected]> writes:
> Hi guys,
>
> Some time ago there was a discussion of drop in gemv/ger. Camm was
> working on SSI GEMV/GER, and atlas_goto.tgz was an example of a drop
> in gemm. A new developer release with this stuff was on the way?
>
> I haven't heard anything for a while. I've got time now to
> incorporate my SSE SGEMM into ATLAS. Of course I'd like to do this
> on a bugfixed dist with a couple of examples to work from. Is there any
> chance of getting a copy of a release with the SSE GEMV/GER stuff in
> it and the fixes that Clint and Camm described on this list?
>
> Otherwise I'll work with the goto version.
>
> --
> -Doug -- http://beaker.anu.edu.au, Ph:(02) 6279-8608, Fax:(02) 6279-8651
>
>
--
Camm Maguire [email protected]
==========================================================================
"The earth is but one country, and mankind its citizens." -- Baha'u'llah