PRofessional ProfILE

Anastasia Stulova

I have 10+ years of professional experience in compiler development and in leading innovations with primary focus on parallel compilation and acceleration using dedicated hardware e.g. GPUs.

I am responsible for design, development and maintenance of features from the acceleration standards (mainly for GPUs) in the large scale LLVM compiler project where I coordinate work from multiple vendors and also help to steer programming models evolution in alignment with open source tooling in the OpenCL standard maintained by the Khronos Group. I pioneered a new programming language that brings many C++ features to accelerators (such as OpenCL or Vulkan) while keeping backward compatibility with previous standards. In the past I prototyped research tools and defined methodologies for mapping applications onto parallel HW architectures. Such prototypes were later used to bootstrap production tools for compilation and HW/SW co-design of heterogeneous parallel HW platforms. I have presented numerous technical talks, and tutorials as well as published papers, articles and blogs about innovations and development I have been involved in.

While my primary interest revolves around developing compiler stack for emerging parallel hardware architectures and features I am also excited about designing language features and defining programming models to facilitate effective use of new hardware architectures. I am very passionate about innovative applications of compiler technologies including new application domains or HW features, novel code analysis, profiling, transformation and IDE techniques.

Main areas of expertise: compiler design, Clang/LLVM compiler tooling, GPU, dedicated HW accelerators, parallel programming models, C/C++-based language dialects and extensions, multi-core, machine learning.

RECENT EMployment history

2014 - present

Compiler Engineer, Arm, Cambridge, UK (www.arm.com)

  • Designed and co-developed the Mali GPU compiler stack for OpenCL 2.0, OpenCL 3.0 and some features of OpenCL 1.2 e.g. creating user libraries and linking separate compilation units.

  • Co-designed, co-developed and coordinated work on OpenCL features in Clang frontend.

  • Co-designed and co-developed SPIR-V binary generation in Clang/LLVM.

  • Helped steering evolution of the OpenCL standard (in particular OpenCL C) in alignment with open source tooling based on the LLVM infrastructure.

  • Originally designed C++ for OpenCL kernel programming language and co-developed its support in Clang and documentation (now also integrated into Mali and clspv compilers).

  • Co-designed and co-developed various OpenCL standard library features in Mali toolchain and libclcxx GitHub project.

  • Extended address space semantics from embedded C (ISO/IEC TR 18037:2008) to C++ facilitating the support of memory segmented architectures in Clang.

  • Improved Mali support in the TVM compiler to allow compiling wider range of TFlite models.

2009 - 2013

Research Associate, The Institute for Communication Technologies and Embedded Systems (ICE), RWTH Aachen University, Germany (www.rwth-aachen.de)

  • Prototyped tools (as a part of the MAPS research project and later Silexica toolchain) for compiling applications onto heterogeneous embedded multi-core architecture including:

    • C for Process Networks (CPN) dialect (derived from C language) support in Clang.

    • Source-to-source transformation engine based on Clang tooling infrastructure.

    • Eclipse IDE plugins for code analysis.

    • High-level simulation framework based on SystemC for performance estimation.

  • Co-authored CPN language for data streaming applications.

  • Defined formal model of computation based techniques for compiler performance analysis of parallel application execution on heterogeneous parallel HW accelerators.

  • Assisted teaching of international master studies in compilation and electronics design automation topics.

Community roles

2014 - present

Active contributor to the LLVM project (https://www.llvm.org/)

2016 - present

Code owner of OpenCL support in Clang frontend (https://clang.llvm.org/docs/OpenCLSupport.html)

2018 - present

Chair of OpenCL Tooling Technical Subgroup, Khronos Group (https://www.khronos.org/)

2009 - 2012

Co-chair of Design and Simulation Cluster, European Network on High-performance Embedded Architecture and Compilation - HiPEAC (https://www.hipeac.net/)

Education

2006 - 2008

Master of Science in Embedded Systems Design, Advanced Learning and Research Institute, Faculty of Informatics, University of Lugano (in collaboration with ETH Zurich and Politecnico di Milano), Switzerland (www.alari.ch)

2001 - 2006

Diploma in Software Design and Computer Automation, Computer Science Faculty, Ivanovo State Power University, Russia (www.ispu.ru)

Core skills

Programming models and languages: C++, C, OpenCL, Vulkan, Cuda, Java, python, bash, OpenMP, MPI, pthreads, SystemC

Compiler technologies: Clang/LLVM/MLIR, TVM

Development tools: vim, git, svn, cmake, gcc, gdb, pdb, totalview, valgrind, clang tools

Operating systems: Linux, Mac OS, Android, Windows

Awards and Honors

2020

Khronie Award by Khronos Group (https://www.khronos.org/about/khronie-awards)

2006

University Diploma with Honor, Ivanovo State Power University, Russia (www.ispu.ru)

2001

Golden medal for excellent studies, Ivanovo State School N49, Russia

Teaching

2018

Guest Lecture on Challenges in GPU compilers, Course on Modern Compiler Design, Department of Computer Science, Cambridge University, UK

2012

Teaching Assistant, Electronics Design Automation, Faculty of Electrical Engineering, RWTH Aachen University, Germany

2011

Teaching Assistant, Embedded Processor Design, Faculty of Electrical Engineering, RWTH Aachen University, Germany

2009 - 2013

Teaching Assistant, Digital Signal Processor Design Methodologies and Tools, Faculty of Electrical Engineering, RWTH Aachen University, Germany

TALKS, Publications, BLOGS

2022

Tech Talk: The present and future of SPIR-V in LLVM from the perspective of OpenCL, LLVM Workshop at ISC High Performance, Germany (https://hps.vi4io.org/events/2022/llvm)

Poster: Reaching even richer C++ in OpenCL kernels with use of libclcxx, IWOCL, Online (https://youtu.be/XKoMY4VZ3bk, https://doi.org/10.1145/3529538.3529540)

Poster: C++ for OpenCL 2021, IWOCL, Online (https://youtu.be/m2LQDLJe318, https://doi.org/10.1145/3529538.3529981)

Blog: What's new in clang-14 for OpenCL developers?, Online (https://www.khronos.org/blog/whats-new-in-clang-release-14-for-opencl-developers)

2021

Panel Discussions: SPIR-V support in LLVM and Clang, the 2021 LLVM Developer Meeting, Online (https://youtu.be/KTPPVYMfAQE)

Tech Talk: libclcxx: C++ libraries for OpenCL, the 2021 LLVM Developer Meeting, Online (https://youtu.be/DBknc1pRB9E)

Blog: Significant new OpenCL open source tools and resources to be discussed at annual LLVM Developers Meeting, Online (https://www.khronos.org/blog/new-opencl-open-source-tools-and-resources-november-2021)

Poster: Experimenting with C++ Libraries in OpenCL kernel code, IWOCL, Online (https://doi.org/10.1145/3456669.3456675, https://youtu.be/w7XtSSLvzQo)

2020

Tech Talk: the C++ for OpenCL Programming Language, IWOCL, Online (https://doi.org/10.1145/3388333.3388647, https://youtu.be/SR5E3hKVh14)

Blog: Mali SDK supporting compilation of Kernels written in C++ for OpenCL, (https://community.arm.com/arm-community-blogs/b/tools-software-ides-blog/posts/mali-sdk-supporting-compilation-of-kernels)

2019

Tutorial: An overview of Clang, the 2019 US LLVM Developer Meeting, USA (https://youtu.be/5kkMpJpIGYU)

Tech Talk: From C++ for OpenCL to C++ for accelerator devices, the 2019 US LLVM Developer Meeting, USA (https://youtu.be/9a5gQ4wJoxs)

Blog: New and enhanced OpenCL open source tools & resources, Online (https://www.khronos.org/blog/new-and-enhanced-opencl-open-source-tools-resources)

2016

Tech Talk: A journey of OpenCL 2.0 development in Clang, the 2016 European LLVM Developer Meeting, Spain (https://youtu.be/3yzL2loPtgM)

2012

Paper: Throughput Driven Transformations of Synchronous Data Flows for Mapping to Heterogeneous MPSoCs, SAMOS XII, Greece (https://doi.org/10.1109/SAMOS.2012.6404168)

Paper: FIFO Exploration in Mapping Streaming Applications onto the TI OMAP3530 Platform: Case Study and Optimizations, MC SOC, Japan (https://doi.ieeecomputersociety.org/10.1109/MCSoC.2012.15)

2011

Article: Component-Based Waveform Development: The Nucleus Tool Flow for Efficient and Portable Software Defined Radio, Journal on Analog Integrated Circuits and Signal Processing, Issue 2-3 (https://dl.acm.org/doi/10.1007/s10470-011-9670-1)

2010

Tech Talk: C for Process Networks, Map2MPSoC Workshop, Germany (http://www.artist-embedded.org/docs/Events/2010/Map2MPSoC/slides/schuermans-map2mpsoc2010-tue-5.pdf)

Paper: Trace-based KPN composability analysis for mapping simultaneous applications to MPSoC platforms, DATE, Germany (https://doi.ieeecomputersociety.org/10.1109/DATE.2010.5456950)

2009

Tech Talk: A co-simulation framework for MPSoC run-time behavior analysis in early system design, MCC, Sweden (https://www.ice.rwth-aachen.de/fileadmin/Publications/Attachments/stulova09.pdf)

Paper: A high-level virtual platform for early MPSoC software development, CODES-ISSS, USA (https://dl.acm.org/doi/10.1145/1629435.1629438)

Tech Talk: MAPS and a high-level virtual platform (MVP), Map2MPSoC Workshop, Germany (http://www.artist-embedded.org/docs/Events/2009/map2mpsoc/slides/stulova-map2mpsoc2009-tue-5.pdf)

2008

Paper: A protocol for pervasive distributed computing reliability, WiMob, France (https://ieeexplore.ieee.org/document/4654301)

Languages

English (Fluent), Russian (Native), German (Intermediate), Italian (Basic)

CONTACT