References

K. E. Batcher (1968): Sorting Networks and their Applications. In: Proceedings of the April 30–May 2, 1968, Spring Joint Computer Conference, AFIPS '68 (Spring). ACM, New York, NY, USA, pp. 307–314, doi:10.1145/1468075.1468121.
Gianfranco Bilardi & Alexandru Nicolau (1989): Adaptive Bitonic Sorting: An Optimal Parallel Algorithm for Shared-memory Machines. SIAM J. Comput. 18(2), pp. 216–228, doi:10.1137/0218014.
Manuel M.T. Chakravarty, Gabriele Keller, Sean Lee, Trevor L. McDonell & Vinod Grover (2011): Accelerating Haskell Array Codes with Multicore GPUs. In: Proceedings of the Sixth Workshop on Declarative Aspects of Multicore Programming, DAMP '11. ACM, New York, NY, USA, pp. 3–14, doi:10.1145/1926354.1926358.
Mischa Dieterle (2016): Structured Parallelism by Composition. Philipps-Universität Marburg, doi:10.17192/z2016.0107.
Mischa Dieterle, Thomas Horstmeyer & Rita Loogen (2010): Skeleton Composition Using Remote Data. In: Manuel Carro & Ricardo Peña: Practical Aspects of Declarative Languages, Lecture Notes in Computer Science 5937. Springer Berlin Heidelberg, pp. 73–87, doi:10.1007/978-3-642-11503-5_8.
The MPI Forum (2012): MPI: A Message-Passing Interface Standard. Technical Report, Knoxville, TN, USA. Available at https://www.mpi-forum.org/docs/mpi-3.0/mpi30-report.pdf.
Ian Foster (1995): Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA. Available at http://www.mcs.anl.gov/~itf/dbpp/text/book.html.
Naga Govindaraju, Jim Gray, Ritesh Kumar & Dinesh Manocha (2006): GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, SIGMOD '06. ACM, New York, NY, USA, pp. 325–336, doi:10.1145/1142473.1142511.
Naga K. Govindaraju, Nikunj Raghuvanshi & Dinesh Manocha (2005): Fast and Approximate Stream Mining of Quantiles and Frequencies Using Graphics Processors. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, SIGMOD '05. ACM, New York, NY, USA, pp. 611–622, doi:10.1145/1066157.1066227.
Peter Kipfer & Rüdiger Westermann (2005): Improved GPU Sorting. In: Matt Pharr: GPUGems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation. Addison-Wesley, pp. 733–746. Available at https://developer.nvidia.com/gpugems/GPUGems2/gpugems2_chapter46.html.
Donald E. Knuth (1998): The Art of Computer Programming: Sorting and Searching, 2 edition, The Art of Computer Programming 3. Addison Wesley Longman Publishing Co., Inc., Redwood City, CA, USA.
Xiaobo Li, Paul Lu, Jonathan Schaeffer, John Shillington, Pok Sze Wong & Hanmao Shi (1993): On the Versatility of Parallel Sorting by Regular Sampling. Parallel Computing 19(10), pp. 1079–1103, doi:10.1016/0167-8191(93)90019-H.
Rita Loogen (2012): Eden — Parallel Functional Programming with Haskell. In: Viktória Zsók, Zoltán Horváth & Rinus Plasmeijer: Proceedings of the 4th Summer School Conference on Central European Functional Programming School, CEFP 11, Lecture Notes in Computer Science 7241. Springer-Verlag, Berlin, Heidelberg, pp. 142–206, doi:10.1007/978-3-642-32096-5_4.
Rita Loogen, Yolanda Ortega-Mallén & Ricardo Peña-Marí (2005): Parallel Functional Programming in Eden. J. Funct. Program. 15(3), pp. 431–475, doi:10.1017/S0956796805005526.
Welf Löwe (1995): Optimization of PRAM-Programs with Input-Dependent Memory Access. In: Proceedings of the First International Euro-Par Conference on Parallel Processing, Euro-Par '95, LNCS 966. Springer-Verlag, London, UK, UK, pp. 243–254, doi:10.1007/BFb0020469.
Simon Marlow (2010): Haskell 2010 Language Report. Available at http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.179.2870.
Rene Mueller, Jens Teubner & Gustavo Alonso (2012): Sorting networks on FPGAs. The VLDB Journal 21(1), pp. 1–23, doi:10.1007/s00778-011-0232-z.
J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone & J. C. Phillips (2008): GPU Computing. In: Proceedings of the IEEE 96 5, pp. 879–899, doi:10.1109/JPROC.2008.917757.
Timothy J. Purcell, Craig Donner, Mike Cammarano, Henrik Wann Jensen & Pat Hanrahan (2005): Photon Mapping on Programmable Graphics Hardware. In: ACM SIGGRAPH 2005 Courses, SIGGRAPH '05. ACM, New York, NY, USA, pp. 258–267, doi:10.1145/1198555.1198797.
Erik Sintorn & Ulf Assarsson (2008): Fast Parallel GPU-sorting Using a Hybrid Algorithm. J. Parallel Distrib. Comput. 68(10), pp. 1381–1388, doi:10.1016/j.jpdc.2008.05.012.
H. S. Stone (1971): Parallel Processing with the Perfect Shuffle. IEEE Trans. Comput. 20(2), pp. 153–161, doi:10.1109/T-C.1971.223205.
Joel Svensson, Mary Sheeran & Koen Claessen (2008): Obsidian: A Domain Specific Embedded Language for General-Purpose Parallel Programming of Graphics Processors. In: Proceedings of the 20th International Conference on Implementation and Application of Functional Languages, IFL '08, Lecture Notes in Computer Science. Springer-Verlag, Berlin, Heidelberg, pp. 156–173, doi:10.1007/978-3-642-24452-0_9.

Comments and questions to:

eptcs@eptcs.org

For website issues:

webmaster@eptcs.org