Publications and Outreach


CEED Documents

Publications

2023

  1. Yichen Guo, Eric de Sturler, and Tim Warburton, Stopping Criteria for the Conjugate Gradient Algorithm in High-Order Finite Element Methods, in review, 2023. Also available as arXiv:2305.10965.

2022

  1. W. Pazner, Tz. Kolev, and C. Dohrmann, Low-order preconditioning for the high-order finite element de Rham complex, in review, 2022. Also available as arXiv:2203.02465.
  2. A. Vargas, T. Stitt, K. Weiss, V. Tomov, J. Camier, Tz. Kolev, and R. Rieben, Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA, in review, 2022. Also available as arXiv:2112.07075.
  3. N. Chalmers, A. Mishra, D. McDougall, and T. Warburton, HipBone: A performance-portable GPU-accelerated C++ version of the NekBone benchmark, in review, 2022.
  4. J. Chan, H. Ranocha, A. Rueda-Ramirez, G. Gassner, and T. Warburton, On the entropy projection and the robustness of high order entropy stable discontinuous Galerkin schemes for under-resolved flows, in review, 2022.
  5. Tz. Kolev and W. Pazner, Conservative and accurate solution transfer between high-order and low-order refined finite element spaces, SIAM Journal on Scientific Computing, 44(1), A1-A27, 2022. Also available as arXiv:2103.05283.

2021

  1. Morteza H. Siboni and Mark S. Shephard, Adaptive Workflow for Simulation of RF Heaters, Computer Physics Communications, in review, 2021
  2. A. Abdelfattah, V. Barra, N. Beams, R. Bleile, J. Brown, J. Camier, R. Carson, N. Chalmers, V. Dobrev, Y. Dudouit, P. Fischer, A. Karakus, S. Kerkemeier, Tz. Kolev, Y. Lan, E. Merzari, M. Min, M. Phillips, T. Rathnayake, R. Rieben, T. Stitt, A. Tomboulides, S. Tomov, V. Tomov, A. Vargas, T. Warburton, K. Weiss, GPU Algorithms for Efficient Exascale Discretizations, Special issue of Parallel Computing, 108, 102841, 2021. Also available as arXiv:2109.05072.
  3. Tz. Kolev, P. Fischer, J. Brown, V. Dobrev, J. Dongarra, M. Min, M. Shephard, S. Tomov, T. Warburton, A. Abdelfattah, V. Barra, N. Beams, J.-S. Camier, N. Chalmers, Y. Dudouit, W. Pazner, C. Smith, K. Swirydowicz, J. Thompson and V. Tomov, Efficient Exascale Discretizations: High Order Finite Element Methods, Special issue of The International Journal of High Performance Computing Application, 35(6), 527-552, 2021. Also available as arXiv:2109.04996.
  4. J. Brown, A. Abdelfattah, V. Barra, N. Beams, J. Camier, V. Dobrev, Y. Dudouit, L. Ghaffari, Tz. Kolev, D. Medina, W. Pazner, T. Ratnayaka, J. Thompson and S. Tomov, libCEED: Fast algebra for high-order element-based discretizations, The Journal of Open Source Software, 2021.

2020

  1. A. Abdelfattah, S. Tomov, J. Dongarra, Matrix multiplication on batches of small matrices in half and half-complex precisions, Journal of Parallel and Distributed Computing, vol. 145, pp. 188-201, 2020.
  2. K. Mittal, S. Dutta, and P. Fischer, Multirate Timestepping for the Incompressible Navier-Stokes Equations in Overlapping Grids, in review, 2020.
  3. R. Anderson, J. Andrej, A. Barker, J. Bramwell, J.-S. Camier, J. Cerveny V. Dobrev, Y. Dudouit, A. Fisher, Tz. Kolev, W. Pazner, M. Stowell, V. Tomov, I. Akkerman, J. Dahm, D. Medina, and S. Zampini, MFEM: A Modular Finite Element Library, Computers & Mathematics with Applications, in press, 2020. Also available as arXiv:1911.09220.
  4. C. Brown, A. Abdelfattah, S. Tomov, J. Dongarra, Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs, Innovative Computing Laboratory Technical Report, no. ICL-UT-20-12 (to appear in HPEC'20), August, 2020.
  5. C. Brown, A. Abdelfattah, S. Tomov, J. Dongarra, hipMAGMA v2.0.0, July 2, 2020.
  6. E. Merzari, P. Fischer, M. Min, S. Kerkemeier, A. Obabko, D. Shaver, H. Yuan, Y. Yu, J. Martinez, L. Brockmeyer, Toward exascale: overview of large eddy simulations and direct numerical simulations of nuclear reactor flows with the spectral element method in Nek5000, The 18th International Topical Meeting on Nuclear Reactor Thermal Hydraulics (NURETH-18), 2020.
  7. V. Barra, J. Brown, J. Thompson, Y. Dudouit, High-performance operator evaluations with ease of use: libCEED's Python interface, Proc. 19th Python in Science Conf., 85-90, 2020.
  8. P. Fischer, M. Min, T. Rathanayake, S. Dutta, Tz. Kolev, V. Dobrev, J.S. Camier, M. Kronbichler, T. Warburton, K. Swirydowicz, and J. Brown, Scalability of High-Performance PDE Solvers, Int. J. High Perf. Comput. Appl., 34(5), pp. 562-586, 2020.
  9. P. Bello-Maldonado, Tz. Kolev, R. Rieben, and V. Tomov, A Matrix-Free Hyperviscosity Formulation for High-Order ALE Hydrodynamics, Computers & Fluids, (205) 104577, 2020.
  10. H. Hajduk, D. Kuzmin, Tz. Kolev, and R. Abgrall, Matrix-free subcell residual distribution for Bernstein finite elements: Low-order schemes and FCT, Comp. Meth. Appl. Mech. Eng., (200) 104451, 2020.
  11. F. Alexander, A. Almgren, J. Bell, A. Bhattacharjee, J. Chen, P. Colella, D. Daniel, J. DeSlippe, E. Draeger, A. Dubey, T. Dunning, T. Evans, I. Foster, M. Francois, T. Germann, M. Gordon, S. Habib, M. Halappanavar, S. Hamilton, W. Hart, Z. Huang, A. Hungerford, D. Kasen, P. R.C. Kent, T. Kolev, D. B. Kothe, A. Kronfeld, Y. Luo, P. Mackenzie, D. McCallen, B. Messer, S. Mniszewski, C. Oehmen, A. Perazzo, D. Perez, D. Richards, W. J. Rider, R. Rieben, K. Roche, A. Siegel, M. Sprague, C. Steefel, R. Stevens, M. Syamlal, M. Taylor, J. Turner, J. Vay, A. F. Voter, T. Windus, K. Yelick, Exascale Applications: Skin in the Game, Philosophical Transactions of the Royal Society A: Mathematical, Physical, and Engineering Sciences, (378) 20190056, 2020.

2019

  1. A. Abdelfattah, S. Tomov, and J. Dongarra, Fast Batched Matrix Multiplication for Small Sizes using Half-Precision Arithmetic on GPUs, IPDPS'19, Rio de Janeiro, Brazil, 2019.
  2. V. A. Dobrev, T. V. Kolev, C. S. Lee, V. Z. Tomov, and P. S. Vassilevski, Algebraic Hybridization and Static Condensation with Application to Scalable H(div) Preconditioning, SIAM Journal on Scientific Computing, 41(3):B425-B447, 2019.
  3. J. Cerveny, V. Dobrev, and Tz. Kolev, Non-Conforming Mesh Refinement For High-Order Finite Elements, SIAM Journal on Scientific Computing, 41(4):C367-C392, 2019.
  4. V. Dobrev, P. Knupp, Tz. Kolev, and V. Tomov, Towards Simulation-Driven Optimization of High-Order Meshes by the Target-Matrix Optimization Paradigm, Roca X., Loseille A. (eds) 27th International Meshing Roundtable. IMR 2018. Lecture Notes in Computational Science and Engineering, vol 127, pp. 285-302, Springer, 2019.
  5. I. Masliah, A. Abdelfattah, A. Haidar, S. Tomov, M. Baboulin, J. Falcou, and J Dongarra, Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices, Parallel Computing, vol. 81, pp. 1-21, 2019.
  6. A. Karakus, N. Chalmers, K. Swirydowicz, T. Warburton, A GPU accelerated discontinuous Galerkin incompressible flow solver, Journal of Computational Physics, 390(1):380-404, 2019.
  7. V. Dobrev, P. Knupp, Tz. Kolev, K. Mittal, V. Tomov, The Target-Matrix Optimization Paradigm for High-Order Meshes, SIAM J. Sci. Comp., 41(1), pp. B50-B68, 2019.
  8. A. Karakus, N. Chalmers, J.S. Hesthaven and T. Warburton, Discontinuous Galerkin Discretizations of the Boltzmann Equations in 2D: semi-analytic time stepping and absorbing boundary layers, Journal of Computational Physics, 390:175-202, 2019.
  9. J. Martinez, Y. Lan, E. Merzari, M. Min, On the use of LES-based turbulent thermal-stress models for rod bundle simulations, Int. J. Heat and Mass Transfer, 142:118399, 2019.
  10. S. Patel, P. Fisher, M. Min, and A. Tomboulides, A Characteristic-based, spectral element method for moving-domain problems, J. Sci. Comp., 79:564-592, 2019.
  11. J. Gong, M. Min, P. Fischer, P. Schlatter, and E. Laure, OpenACC acceleration for the Pn-Pn-2 algorithm in Nek5000, J. Parallel and Distributed Computing, vol. 132, pp. 69-78, 2019.
  12. L. Lu, M. Nazarov, P. Fischer, Nonlinear artificial viscosity for spectral element methods , Comptes Rendus Mathematique, vol 357, pp. 646-654, 2019.
  13. K. Mittal, S. Dutta, and P. Fischer, K. Mittal, S. Dutta, and P. Fischer, Nonconforming Schwarz-spectral element methods for incompressible flow, Computers and Fluids, 191(15):104237, 2019.
  14. S. Patel, P. Fischer, M. Min and A. Tomboulides, A Characteristic-Based Spectral Element Method for Moving-Domain Problems, J. Sci. Comput., vol 79, pp. 564-592, 2019.
  15. K. Mittal and P. Fischer, Mesh smoothing for the spectral element method, J. Sci. Comput., 78(2), pp. 1152-1173, 2019.
  16. G.K. Giannakopoulos, C.E. Frouzakis, P. Fischer, A. Tomboulides and K. Boulouchos, LES of the gas-exchange process inside an internal combustion using a high-order method, Flow, Turbulence and Combustion, pp. 1-20, 2019.
  17. K. Swirydowicz, N. Chalmers, A. Karakus, T. Warburton, Acceleration of tensor-product operations for high-order finite element methods, Int. J. High Perf. Comput. Appl., 33(4) 735-757, 2019.

2018

  1. N. Chalmers and T. Warburton, Low-Order Preconditioning of High-Order Triangular Finite Elements, SIAM J. Sci. Comput., 40(6), A4040-A4059, 2018.
  2. P. Bello-Maldonado and P. Fischer, Scalable Low-Order Finite Element Preconditioners for High-Order Spectral Element Poisson Solvers, SIAM J. Sci. Comp., 41(5):S2-S18, 2018.
  3. A. Haidar, S. Tomov, and J. Dongarra, Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky Factorization, HPEC'18, 2018.
  4. A. Barker, V. Dobrev, J. Gopalakrishnan and T. Kolev, A scalable preconditioner for a primal discontinuous Petrov-Galerkin method, SIAM J. Sci. Comp., 40(1), pp. B32-B58, 2018.
  5. R. Anderson, V. Dobrev, Tz. Kolev, R. Rieben and V. Tomov, High-Order Multi-Material ALE Hydrodynamics, SIAM J. Sci. Comp., 40(1):B32-B58, 2018.
  6. V. Dobrev, Tz. Kolev, D. Kuzmin, R. Rieben and V. Tomov, Sequential limiting in continuous and discontinuous Galerkin methods for the Euler equations, Journal of Computational Physics, 356:372-390, 2018.

2017

  1. J. Solberg, E. Merzari, H. Yuan, A. Obabko and P. Fischer, S. Lee, J. Lai, M. Delgado, S. J. Lee, and Y. Hassan, High-Fidelity Simulation of Flow Induced Vibrations in Helical Steam Generators for Small Modular Reactors, Best Paper Award at The 17th International Topical Meeting on Nuclear Reactor Thermal Hydraulics (NURETH-17), September, 2017.
  2. I. Masliah, A. Abdelfattah, A. Haidar, S. Tomov, M. Baboulin, J. Falcou, and J. Dongarra, Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices, Parallel Computing (PARCO), 2017.
  3. K. Raffenetti et. al, P. Fischer, M. Min, and P. Balaji, Why is MPI so slow? Analyzing the fundamental limits in implementing MPI-3.1, SC17, 2017.
  4. P. Fischer, M. Schmitt, and A. Tomboulides, Recent developments in spectral element simulations of moving-domain problems, vol. 79, Fields Institute Communications, 213-244, 2017.
  5. S. Lomperski, A. Obabko, P. Fischer, E. Merzari, and W.D. Pointer, Jet stability and wall impingement flow field in a thermal striping experiment, Int. J. Heat Mass Transfer, 115A:1125- 1136, 2017.
  6. V. Makarashvilia, E. Merzari, A. Obabko, A. Siegel, and P. Fischer, A performance analysis of ensemble averaging for high fidelity turbulence simulations at the strong scaling limit, Comp. Phys. Comm., 219:236-245, 2017.
  7. A. Abdelfattah, A. Haidar, S. Tomov, J. Dongarra, Factorization and Inversion of a Million Matrices using GPUs: Challenges and Countermeasures, Proceedings of the 2017 International Conference on Computational Science, ICCS'17, Zürich, Switzerland, June 12-14, Procedia Computer Science, 2017.
  8. A. Abdelfattah, M. Baboulin, V. Dobrev, J. Dongarra, A. Haidar, I. Karlin, Tz. Kolev, I. Masliah, and S. Tomov, Small Tensor Operations on Advanced Architectures for High-order Applications, Technical report UT-EECS-17-749, EECS Department, University of Tennessee, 2017.
  9. R. Anderson, V. Dobrev, Tz. Kolev, D. Kuzmin, M. Quezada de Luna, R. Rieben and V. Tomov, High-order local maximum principle preserving (MPP) discontinuous Galerkin finite element method for the transport equation, Journal of Computational Physics, 334:102-124, 2017.
  10. Bazilevs, Y., Kamran, K., Moutsanidis, G., Benson, D. J., & Oñate, E, A new formulation for air-blast fluid-structure interaction using an immersed approach. Part I: basic methodology and FEM-based simulations, Computational Mechanics, 1-18, 2017.
  11. Abdelfattah, A., Haidar, A., Tomov, S., and Dongarra, J. Novel HPC Techniques to Batch Execution of Many Variable Size BLAS Computations on GPUs, International Conference on Supercomputing (ICS'17), ACM, Chicago, Illinois, pp. 1-10, June 14-16, 2017.
  12. Ibanez, D., Shephard, M.S., Modifiable Array Data Structures for Mesh Topology, SIAM Journal on Scientific Computing, 39(2):C144-C161, 2017.
  13. Granzow, B.N., Shephard M.S., Oberai, A.A., Output-based error estimation and mesh adaptation for variational multiscale methods, Computer Methods in Applied Mechanics and Engineering, Vol. 322, pp. 441-459, 2017.

2016

  1. E. Merzari, A. Obabko, P. Fischer, N. Halford, J. Walker, A. Siegel, and Y. Q. Yu, Large-scale large eddy simulation of nuclear reactor flows: Issues and perspectives, Nuclear Engineering and Design, page 13, Oct., 2016.
  2. E. Merzari, P. Fischer, H. Yuan, K. Van Tichelen, S. Keijers, J. De Ridder, J. Degroote, J. Vierendeels, H. Doolaard, V. R. Gopala, and F. Roelofs, Benchmark exercise for fluid flow simulations in a liquid metal fast reactor fuel assembly, Nuclear Engineering and Design, 298(3):218-228, 2016.
  3. V. Dobrev, Tz. Kolev, R. Rieben and V. Tomov, Multi-material closure model for high-order finite element Lagrangian hydrodynamics, Int. J. Numer. Meth. Fluids, 82(10), pp. 689-706, 2016.
  4. A. Abdelfattah, M. Baboulin, V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, Tz. Kolev, I. Masliah, S. Tomov, High-performance Tensor Contractions for GPUs, Procedia Computer Science, Volume 80, Pages 108-118, ISSN 1877-0509, 2016.
  5. M. B.E., Y. Peet, P. Fischer, and J. Lottes, A spectrally accurate method for overlapping grid solution of incompressible Navier-Stokes equations, J. Comp. Phys., 307:60-93, 2016.
  6. M. Otten, J. Gong, A. Mametjanov, A. Vose, J. Levesque, P. Fischer, and M. Min, An MPI/OpenACC implementation of a high order electromagnetics solver with GPUDirect communication, The International Journal of High Performance Computing Application, 30(3):320-334, 2016.
  7. J. Gong, S. Markidis, E. Laure, M. Otten, P. Fischer, M. Min, Nekbone Performance on GPUs with OpenACC and CUDA Fortran Implementations, Special issue on Sustainability on Ultrascale Computing Systems and Applications: Journal of Supercomputing, 2016.
  8. Ibanez, Daniel A., et al. PUMI: Parallel unstructured mesh infrastructure. ACM Transactions on Mathematical Software (TOMS) 42.3 2016.
  9. Smith, Cameron W., et al. In-memory Integration of Existing Software Components for Parallel Adaptive Unstructured Mesh Workflows. Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale. ACM, 2016.
  10. Ibanez, D., Dunn, I., Shephard M.S., Hybrid MPI-thread parallelization of adaptive mesh operations, Parallel Computing, 52:133-143, 2016.

2015 and earlier

  1. P. Fischer, K. Heisey, and M. Min, Scaling limits for PDE-based simulation, In 22nd AIAA Computational Fluid Dynamics Conference, AIAA Aviation, AIAA 2015-3049, 2015.
  2. A. Kraus, S. Aithal, A. Obabko, E. Merzari, A. Tomboulides, and P. Fischer, Erosion of a large-scale gaseous stratified layer by a turbulent jet - Simulations with URANS and LES approaches, volume 2, pp. 1448-1461. American Nuclear Society, 2015.
  3. E. Merzari, P. Fischer, and J. Walker, Large-scale simulation of rod bundles: Coherent structure recognition and stability analysis, volume 1. American Society of Mechanical Engineers, 2015.
  4. M. Otten, R. A. Shah, N. F. Scherer, M. Min, M. Pelton, and S. K. Gray, Entanglement of two, three and four plasmonically coupled quantum dots, Physical Review B, 92:125432, 2015.
  5. D. A. May, J. Brown, and L. Le Pourhiet. pTatin3D: High-performance methods for long-term lithospheric dynamics, In Proceedings of SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. ACM, 2014.
  6. R. Anderson, V. Dobrev, Tz. Kolev and R. Rieben, Monotonicity in high-order curvilinear finite element ALE remap, Int. J. Numer. Meth. Fluids, 77(5), pp. 249-273, 2014.
  7. Kamran, Kazem, et al. A compressible Lagrangian framework for modeling the fluid-structure interaction in the underwater implosion of an aluminum cylinder. Mathematical Models and Methods in Applied Sciences 23.02 2013.
  8. Tz. Kolev and P. Vassilevski, Parallel auxiliary space AMG solver for H(div) problems, SIAM J. Sci. Comp., 34, pp. A3079-A3098, 2012.
  9. V. Dobrev, Tz. Kolev and R. Rieben, High-order curvilinear finite element methods for Lagrangian hydrodynamics, SIAM J. Sci. Comp., 34, pp. B606-B641, 2012.
  10. J. Brown, Efficient nonlinear solvers for nodal high-order finite elements in 3D, Journal of Scientific Computing, 45:48-63, 2010. doi:10.1007/s10915-010-9396-8
  11. Tz. Kolev and P. Vassilevski, Parallel auxiliary space AMG for H(curl) problems, J. Comput. Math., 27, pp. 604-623, 2009.

Presentations

2017

  1. Tz. Kolev and M. Shephard, Conforming & Nonconforming Adaptivity for Unstructured Meshes, Argonne Training Program on Extreme-Scale Computing, Aug 7, 2017.
  2. Tz. Kolev and M. Shephard, Unstructured Mesh Technologies, Argonne Training Program on Extreme-Scale Computing, Aug 7, 2017.
  3. B. Smith, Nonlinear and Krylov Solvers, Argonne Training Program on Extreme-Scale Computing, Aug 7, 2017.
  4. J. Dongarra, Adaptive Linear Solvers and Eigensolvers, Argonne Training Program on Extreme-Scale Computing, Aug 7, 2017.
  5. T. Warburton, An Intro to GPU Architecture and Programming Models, Argonne Training Program on Extreme-Scale Computing, Aug 3, 2017.
  6. S. Parker, Architectures of the Argonne Cray XC40 KNL System "Theta", Argonne Training Program on Extreme-Scale Computing, Jul 31, 2017.
  7. S. Tomov and A. Haidar, MAGMA Tensors and Batched Computing for Accelerating Applications on GPUs, GPU Technology Conference (GTC'17), Session S7728, May 8-11, 2017.
  8. A. Abdelfattah, M. Baboulin, V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, Tz. Kolev, I. Masliah, S. Tomov, Accelerating Tensor Contractions in High-Order FEM with MAGMA Batched, SIAM Conference on Computer Science and Engineering (SIAM CSE'17), Atlanta, GA, Feb 26-Mar 3, 2017.
  9. P. Fischer, Efficiency of High-Order Methods on the 2nd Generation Intel Xeon Phi Processor, SIAM Conference on Computer Science and Engineering (SIAM CSE'17), Atlanta, GA, Feb 26-Mar 3, 2017.
  10. M. Min, Spectral Element Simulation for Nanowire Solar Cells on HPC Platforms, SIAM Conference on Computer Science and Engineering (SIAM CSE'17), Atlanta, GA, Feb 26-Mar 3, 2017.
  11. C. Smith, G. Diamond and M.S. Shephard, Fast Dynamic Load Balancing Tools for Extreme Scale System, SIAM Conference on Computer Science and Engineering (SIAM CSE'17), Atlanta, GA, Feb 26-Mar 3, 2017.
  12. S. Tendulkar, O. Klaas, M.W. Beall, and M.S. Shephard, Parallel Geometry and Meshing Adaptation with Application to Problems with Evolving Domains, SIAM Conf. on Computational Science and Engineering, Atlanta, GA, Mar 3, 2017.
  13. P. Fischer, CFD, PDEs, and HPC: A 30-Year Perspective, Argonne Training Program on Extreme-Scale Computing, Aug 2, 2016.

Highlights


Other Resources