Doc fixes

mergeparmatch
Salvatore Filippone 4 years ago
parent 47acde313f
commit 8bf1e30d66

Binary file not shown.

@ -31,7 +31,7 @@ class="cmr-12">GPU example</span></h4>
<!--l. 422--><p class="noindent" ><span
class="cmr-12">The code discussed here shows how to set up a program exploiting the combined GPU</span>
<span
class="cmr-12">capabilities of PSBLAS and AMG4PSBLAS. The code example is availabile in the</span>
class="cmr-12">capabilities of PSBLAS and AMG4PSBLAS. The code example is available in the</span>
<span
class="cmr-12">source distribution directory </span><span class="obeylines-h"><span class="verb"><span
class="cmtt-12">amg4psblas/tests/gpu</span></span></span><span
@ -83,11 +83,13 @@ class="content">setup of a GPU-enabled test program part one.</span></div><!--te
<!--l. 481--><p class="indent" > <span
class="cmr-12">We then have to initialize the GPU environment, and pass the appropriate MOLD</span>
<span
class="cmr-12">variables to the build methods</span>
class="cmr-12">variables to the build methods (see also the PSBLAS and PSBLAS-EXT users&#8217;</span>
<span
class="cmr-12">guides).</span>
<!--l. 483--><p class="indent" > <a
<!--l. 484--><p class="indent" > <a
id="x16-15002r6"></a><hr class="float"><div class="float"
>
@ -95,7 +97,7 @@ class="cmr-12">variables to the build methods</span>
<div class="center"
>
<!--l. 499--><p class="noindent" >
<!--l. 500--><p class="noindent" >
<div class="minipage"><pre class="verbatim" id="verbatim-11">
&#x00A0;&#x00A0;call&#x00A0;psb_init(ctxt)
&#x00A0;&#x00A0;call&#x00A0;psb_info(ctxt,iam,np)
@ -109,7 +111,7 @@ class="cmr-12">variables to the build methods</span>
&#x00A0;&#x00A0;call&#x00A0;prec%smoothers_build(a,desc_a,info,&#x00A0;amold=agmold,&#x00A0;vmold=vgmold,&#x00A0;imold=igmold)
&#x00A0;
</pre>
<!--l. 514--><p class="nopar" ></div></div>
<!--l. 515--><p class="nopar" ></div></div>
<br /> <div class="caption"
><span class="id">Listing 6: </span><span
class="content">setup of a GPU-enabled test program part two.</span></div><!--tex4ht:label?: x16-15002r6 -->
@ -117,16 +119,18 @@ class="content">setup of a GPU-enabled test program part two.</span></div><!--te
</div><hr class="endfloat" />
<!--l. 521--><p class="indent" > <span
class="cmr-12">Finally, we convert the input matrix, the descriptor and the vectors, then</span>
<!--l. 522--><p class="indent" > <span
class="cmr-12">Finally, we convert the input matrix, the descriptor and the vectors to use a</span>
<span
class="cmr-12">GPU-enabled internal storage format. We then preallocate the preconditioner</span>
<span
class="cmr-12">preallocate the preconditioner workspace before entering the Krylov method. At the</span>
class="cmr-12">workspace before entering the Krylov method. At the end of the code, we close the</span>
<span
class="cmr-12">end of the code, we close the GPU environment</span>
class="cmr-12">GPU environment</span>
<!--l. 524--><p class="indent" > <a
<!--l. 526--><p class="indent" > <a
id="x16-15003r7"></a><hr class="float"><div class="float"
>
@ -134,7 +138,7 @@ class="cmr-12">end of the code, we close the GPU environment</span>
<div class="center"
>
<!--l. 553--><p class="noindent" >
<!--l. 555--><p class="noindent" >
<div class="minipage"><pre class="verbatim" id="verbatim-12">
&#x00A0;&#x00A0;call&#x00A0;desc_a%cnv(mold=igmold)
&#x00A0;&#x00A0;call&#x00A0;a%cscnv(info,mold=agmold)
@ -158,7 +162,7 @@ class="cmr-12">end of the code, we close the GPU environment</span>
&#x00A0;&#x00A0;stop
&#x00A0;
</pre>
<!--l. 580--><p class="nopar" ></div></div>
<!--l. 582--><p class="nopar" ></div></div>
<br /> <div class="caption"
><span class="id">Listing 7: </span><span
class="content">setup of a GPU-enabled test program part three.</span></div><!--tex4ht:label?: x16-15003r7 -->
@ -166,7 +170,7 @@ class="content">setup of a GPU-enabled test program part three.</span></div><!--
</div><hr class="endfloat" />
<!--l. 588--><p class="indent" > <span
<!--l. 590--><p class="indent" > <span
class="cmr-12">It is very important to employ solvers that are suited to the GPU, i.e. solvers that</span>
<span
class="cmr-12">do NOT employ triangular system solve kernels. Solvers that satisfy this constraint</span>
@ -184,7 +188,7 @@ class="cmtt-12">INVT</span></span></span>
</li>
<li class="itemize"><span class="obeylines-h"><span class="verb"><span
class="cmtt-12">AINV</span></span></span></li></ul>
<!--l. 597--><p class="noindent" ><span
<!--l. 599--><p class="noindent" ><span
class="cmr-12">and their </span><span class="obeylines-h"><span class="verb"><span
class="cmtt-12">L1</span></span></span> <span
class="cmr-12">variants.</span>

@ -324,8 +324,8 @@ class="td11"><!--l. 124--><p class="noindent" > </td><td style="white-spac
class="td11"><!--l. 124--><p class="noindent" ><span
class="cmr-12">An auxiliary input argument that can be passed to the underlying</span>
<span
class="cmr-12">objects.</span> </td>
</tr></table></div>
class="cmr-12">objects.</span> </td>
</tr></table></div>
<!--l. 129--><p class="noindent" >
<!--l. 134--><p class="indent" > <span
class="cmr-12">A variety of preconditioners can be obtained by setting the appropriate</span>

@ -421,7 +421,7 @@ call P%build(A,desc_A,info)
The code discussed here shows how to set up a
program exploiting the combined GPU capabilities of PSBLAS and
AMG4PSBLAS. The code example is availabile in the source distribution
AMG4PSBLAS. The code example is available in the source distribution
directory \verb|amg4psblas/tests/gpu|.
First of all, we need to include the appropriate modules and
@ -479,7 +479,8 @@ program amg_d_pde3d
\end{listing}
We then have to initialize the GPU environment, and pass the
appropriate MOLD variables to the build methods
appropriate MOLD variables to the build methods (see also the PSBLAS
and PSBLAS-EXT users' guides).
\begin{listing}[h!]
\ifpdf
\begin{minted}[breaklines=true,bgcolor=bg,fontsize=\small]{fortran}
@ -518,8 +519,9 @@ appropriate MOLD variables to the build methods
\fi
\caption{setup of a GPU-enabled test program part two.\label{fig:gpu-ex2}}
\end{listing}
Finally, we convert the input matrix, the descriptor and the vectors,
then preallocate the preconditioner workspace before entering the
Finally, we convert the input matrix, the descriptor and the vectors
to use a GPU-enabled internal storage format.
We then preallocate the preconditioner workspace before entering the
Krylov method. At the end of the code, we close the GPU environment
\begin{listing}[h!]
\ifpdf

Loading…
Cancel
Save