K. E. Batcher (1968):
Sorting Networks and their Applications.
In: Proceedings of the April 30–May 2, 1968, Spring Joint Computer Conference,
AFIPS '68 (Spring).
ACM,
New York, NY, USA,
pp. 307–314,
doi:10.1145/1468075.1468121.
Gianfranco Bilardi & Alexandru Nicolau (1989):
Adaptive Bitonic Sorting: An Optimal Parallel Algorithm for Shared-memory Machines.
SIAM J. Comput. 18(2),
pp. 216–228,
doi:10.1137/0218014.
Manuel M.T. Chakravarty, Gabriele Keller, Sean Lee, Trevor L. McDonell & Vinod Grover (2011):
Accelerating Haskell Array Codes with Multicore GPUs.
In: Proceedings of the Sixth Workshop on Declarative Aspects of Multicore Programming,
DAMP '11.
ACM,
New York, NY, USA,
pp. 3–14,
doi:10.1145/1926354.1926358.
Mischa Dieterle (2016):
Structured Parallelism by Composition.
Philipps-Universität Marburg,
doi:10.17192/z2016.0107.
Mischa Dieterle, Thomas Horstmeyer & Rita Loogen (2010):
Skeleton Composition Using Remote Data.
In: Manuel Carro & Ricardo Peña: Practical Aspects of Declarative Languages,
Lecture Notes in Computer Science 5937.
Springer Berlin Heidelberg,
pp. 73–87,
doi:10.1007/978-3-642-11503-5_8.
Ian Foster (1995):
Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering.
Addison-Wesley Longman Publishing Co., Inc.,
Boston, MA, USA.
Available at http://www.mcs.anl.gov/~itf/dbpp/text/book.html.
Naga Govindaraju, Jim Gray, Ritesh Kumar & Dinesh Manocha (2006):
GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management.
In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data,
SIGMOD '06.
ACM,
New York, NY, USA,
pp. 325–336,
doi:10.1145/1142473.1142511.
Naga K. Govindaraju, Nikunj Raghuvanshi & Dinesh Manocha (2005):
Fast and Approximate Stream Mining of Quantiles and Frequencies Using Graphics Processors.
In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data,
SIGMOD '05.
ACM,
New York, NY, USA,
pp. 611–622,
doi:10.1145/1066157.1066227.
Peter Kipfer & Rüdiger Westermann (2005):
Improved GPU Sorting.
In: Matt Pharr: GPUGems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation.
Addison-Wesley,
pp. 733–746.
Available at https://developer.nvidia.com/gpugems/GPUGems2/gpugems2_chapter46.html.
Donald E. Knuth (1998):
The Art of Computer Programming: Sorting and Searching,
2 edition,
The Art of Computer Programming 3.
Addison Wesley Longman Publishing Co., Inc.,
Redwood City, CA, USA.
Xiaobo Li, Paul Lu, Jonathan Schaeffer, John Shillington, Pok Sze Wong & Hanmao Shi (1993):
On the Versatility of Parallel Sorting by Regular Sampling.
Parallel Computing 19(10),
pp. 1079–1103,
doi:10.1016/0167-8191(93)90019-H.
Rita Loogen (2012):
Eden — Parallel Functional Programming with Haskell.
In: Viktória Zsók, Zoltán Horváth & Rinus Plasmeijer: Proceedings of the 4th Summer School Conference on Central European Functional Programming School, CEFP 11,
Lecture Notes in Computer Science 7241.
Springer-Verlag,
Berlin, Heidelberg,
pp. 142–206,
doi:10.1007/978-3-642-32096-5_4.
Rita Loogen, Yolanda Ortega-Mallén & Ricardo Peña-Marí (2005):
Parallel Functional Programming in Eden.
J. Funct. Program. 15(3),
pp. 431–475,
doi:10.1017/S0956796805005526.
Welf Löwe (1995):
Optimization of PRAM-Programs with Input-Dependent Memory Access.
In: Proceedings of the First International Euro-Par Conference on Parallel Processing, Euro-Par '95,
LNCS 966.
Springer-Verlag,
London, UK, UK,
pp. 243–254,
doi:10.1007/BFb0020469.
Rene Mueller, Jens Teubner & Gustavo Alonso (2012):
Sorting networks on FPGAs.
The VLDB Journal 21(1),
pp. 1–23,
doi:10.1007/s00778-011-0232-z.
J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone & J. C. Phillips (2008):
GPU Computing.
In: Proceedings of the IEEE 96 5,
pp. 879–899,
doi:10.1109/JPROC.2008.917757.
Timothy J. Purcell, Craig Donner, Mike Cammarano, Henrik Wann Jensen & Pat Hanrahan (2005):
Photon Mapping on Programmable Graphics Hardware.
In: ACM SIGGRAPH 2005 Courses,
SIGGRAPH '05.
ACM,
New York, NY, USA,
pp. 258–267,
doi:10.1145/1198555.1198797.
Erik Sintorn & Ulf Assarsson (2008):
Fast Parallel GPU-sorting Using a Hybrid Algorithm.
J. Parallel Distrib. Comput. 68(10),
pp. 1381–1388,
doi:10.1016/j.jpdc.2008.05.012.
H. S. Stone (1971):
Parallel Processing with the Perfect Shuffle.
IEEE Trans. Comput. 20(2),
pp. 153–161,
doi:10.1109/T-C.1971.223205.
Joel Svensson, Mary Sheeran & Koen Claessen (2008):
Obsidian: A Domain Specific Embedded Language for General-Purpose Parallel Programming of Graphics Processors.
In: Proceedings of the 20th International Conference on Implementation and Application of Functional Languages, IFL '08,
Lecture Notes in Computer Science.
Springer-Verlag,
Berlin, Heidelberg,
pp. 156–173,
doi:10.1007/978-3-642-24452-0_9.