publications

Talks

2025

  1. LLVM
    llvm.png
    CuTeDSL, CUTLASS Python DSL Infrastructure
    Guray Ozen
    2025
  2. EuroLLVM
    llvm.png
    Bringing NVIDIA Blackwell support to LLVM and MLIR
    Guray Ozen
    2025

2024

  1. EuroLLVM
    llvm.png
    Zero to Hero: Programming Nvidia Hopper Tensor Core with MLIR’s NVGPU Dialect
    Guray Ozen
    2024
  2. CGO
    cgo.jpg
    Targeting NVIDIA Hopper using MLIR
    Guray Ozen
    2024

2021

  1. NASA
    nasa2.png
    Thinking OpenMP with NVIDIA HPC Compilers
    Guray Ozen
    2021
  2. GTC
    gtc.png
    Accelerating Applications for NERSC’s Perlmutter Supercomputer Using OpenMP and NVIDIA’s HPC SDK
    Christopher Daley and Guray Ozen
    2021

2018

  1. SC
    sc18.png
    Compiler and Runtime Based Parallelization and Optimization for GPUs
    Guray Ozen, Eduard Ayguade, and Jesus Labarta
    2018

2017

  1. GTC
    gtc.png
    Multi-target Task-Based Programming Model Exploiting OpenACC GPU Kernel
    Guray Ozen
    2017

Papers

2024

  1. ICML
    icml.png
    NVDSL: Simplifying Tensor Cores with Python-Driven MLIR Metaprogramming
    Guray Ozen
    In ESFOMO Workshop at ICML, 2024

2022

  1. CC
    cc22.jpg
    Performant Portable OpenMP
    Guray Ozen and Michael Wolfe
    In Proceedings of the 31st ACM SIGPLAN International Conference on Compiler Construction, 2022

2021

  1. SC
    sc21.png
    Non-Recurring Engineering (NRE) Best Practices: A Case Study with the NERSC/NVIDIA OpenMP Contract
    Christopher Daley, Annemarie Southwell, Rahulkumar Gayatri, and 4 more authors
    In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021

2018

  1. LLVM-HPC
    sc18.png
    OpenMP GPU Offload in Flang and LLVM
    Guray Ozen, Simone Atzeni, Michael Wolfe, and 2 more authors
    In LLVM-HPC Workshop at SC, 2018
  2. PhD
    Compiler and Runtime Based Parallelization and Optimization for GPUs
    Guray Ozen
    Universitat Politècnica de Catalunya (UPC), 2018
    Excellent Cum Laude

2016

  1. PACT
    Collective Dynamic Parallelism for Directive Based GPU Programming Languages and Compilers
    Guray Ozen, Eduard Ayguade, and Jesus Labarta
    In International Conference on Parallel Architectures and Compilation Techniques (PACT), 2016
  2. LLVM-HPC
    Offloading Support for OpenMP in Clang and LLVM
    Samuel Antao, Alexey Bataev, Arpith Jacob, and 10 more authors
    In LLVM-HPC Workshop at SC, 2016
  3. IWOMP
    Multiple Target Task Sharing Support for the OpenMP Accelerator Model
    Guray Ozen, Sergi Mateo, Eduard Ayguade, and 2 more authors
    In International Workshop on OpenMP (IWOMP), 2016

2015

  1. WACCPD
    sc15.png
    Exploring Dynamic Parallelism in OpenMP
    Guray Ozen, Eduard Ayguade, and Jesus Labarta
    In Workshop on Accelerator Programming using Directives (WACCPD) at SC, 2015

2014

  1. IWOMP
    On the Roles of the Programmer, the Compiler and the Runtime System When Programming Accelerators in OpenMP
    Guray Ozen, Eduard Ayguade, and Jesus Labarta
    In International Workshop on OpenMP (IWOMP), 2014