[Developers] A possible GPU project
dave fournier
davef at otter-rsch.com
Sun Apr 8 07:32:24 PDT 2012
On 12-04-07 07:15 PM, Matthew Supernaw wrote:
Unfortunately looking at the code in newfmin.cpp more carefully the
main o(n^2) loop does not look parallelizable.
int iu=n;
int iv=2*n;
int ib=3*n;
for (int j=2;j<=n;j++)
{
double * pd=&(h.elem(j,1));
double * qd=&(w.elem(iu+j));
double * rd=&(w.elem(iv+1));
double * sd=&(w.elem(ib+1));
for (int i=1;i<j;i++)
{
*qd-=*pd * *rd++;
*pd++ +=*sd++ * *qd;
}
}
> Dave,
> Great idea! Would you use opencl or cuda? I believe double precision is a add on for opencl, not sure about cuda.
> Matthew
>
>
>
> On Apr 6, 2012, at 3:00 PM, developers-request at admb-project.org wrote:
>
>> A possible GPU project
> _______________________________________________
> Developers mailing list
> Developers at admb-project.org
> http://lists.admb-project.org/mailman/listinfo/developers
>
More information about the Developers
mailing list