[Developers] A possible GPU project

dave fournier davef at otter-rsch.com
Sun Apr 8 07:32:24 PDT 2012


On 12-04-07 07:15 PM, Matthew Supernaw wrote:


Unfortunately looking at the code in newfmin.cpp more carefully the
main o(n^2) loop does not look parallelizable.


    int iu=n;
    int iv=2*n;
    int ib=3*n;
    for (int j=2;j<=n;j++)
    {
       double * pd=&(h.elem(j,1));
       double * qd=&(w.elem(iu+j));
       double * rd=&(w.elem(iv+1));
       double * sd=&(w.elem(ib+1));
       for (int i=1;i<j;i++)
       {
          *qd-=*pd * *rd++;
          *pd++ +=*sd++ * *qd;
       }
    }








> Dave,
> Great idea! Would you use opencl or cuda? I believe double precision is a add on for opencl, not sure about cuda.
> Matthew
>
>
>
> On Apr 6, 2012, at 3:00 PM, developers-request at admb-project.org wrote:
>
>> A possible GPU project
> _______________________________________________
> Developers mailing list
> Developers at admb-project.org
> http://lists.admb-project.org/mailman/listinfo/developers
>



More information about the Developers mailing list