Publications
2025 |
| Unleashing Parallelism with Elastic-Barriers. Amit Tiwari and V K Nandivada, in the ACM Transactions on Architecture and Code Optimization (TACO), 2025 (to appear).
Preprint here.
|
| IncIDFA: An Efficient and Generic Algorithm for Incremental Iterative Dataflow Analysis. Aman Nougrahiya and V. Krishna Nandivada, (to appear) in the Proceedings of the ACM on Programming Languages (PACMPL - OOPSLA), 2025.
Preprint here.
|
2024 |
|
|
|
|
|
|
|
|
|
|
2023 |
|
|
2021 |
|
|
2020 |
| DisGCo : A Compiler for Distributed Graph Analytics. R S Anchu and V K Nandivada, in the ACM Transactions on Architecture and Code Optimization (TACO), 2020.
Preprint here.
|
| Chunking Loops with non-Uniform Workloads. I K Prabhu and V K Nandivada, in the Proceedings of the International conference on Supercomputing (ICS), 2020.
Preprint here.
|
| A Study of Graph Analytics for Massive Datasets on Large-Scale Distributed GPUs. V Jatala, R Dathathri, G Gill, L Hoang, V K Nandivada and K Pingali, in the proceedings of the IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2020.
Preprint here.
|
| Mix Your Contexts Well: Opportunities Unleashed by Recent Advances in Scaling Context-Sensitivity. M Thakur and V K Nandivada, in the Proceedings of the International conference on Compiler Construction (CC), 2020. Artifact evaluated (Functional, Validated, Available).
Preprint here.
|
| On the fly MHP Analysis. S Saha and V K Nandivada, in the Proceedings of the Symposium on Principles and Practice of Parallel Programming (PPoPP), 2020. Artifact Evaluated (Available).
Preprint here;
a copy with a couple of errata fixed here.
|
2019 |
| Batch Alias Analysis. J Vedurada and V K Nandivada, in the Proceedings of the International Conference on Automated Software Engineering (ASE), 2019. Preprint here.
|
|
|
| Optimizing Remote Communication in X10. A Thangamani and V K Nandivada, in the ACM Transactions on Architecture and Code Optimization (TACO), Oct 2019. Preprint here.
|
| Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics. R Dathathri, G Gill, L Hoang, H Dang, V Jatala, V K Nandivada, M Snir and K Pingali, in the Proceedings of the Parallel and Architectures and Compilation Techniques (PACT), 2019. Preprint here.
|
| Efficient Lock-Step Synchronization in Task-Parallel Languages. A Utture and V K Nandivada. In Software: Practice and Experience (SPE), Jul 2019. Preprint here.
|
| Graph Coloring using GPUs. M A Sistla and V K Nandivada, in the Proceedings of the International European Conference on Parallel and Distributed Computing (Euro-Par), 2019. Preprint here.
|
| Precise, Efficient and Secure Just-In-Time Analysis of Java Programs. M Thakur and V K Nandivada, International Workshop on Advances in Open Runtime Technologies and Applications (AORTA), London, UK, July 2019.
|
| PYE: A Framework for Precise-Yet-Efficient Just-In-Time Analyses for Java Programs. M Thakur and V K Nandivada, ACM Transactions on Programming Languages and Systems (TOPLAS), ACM, July 2019. Preprint here.
|
| Efficiency and Expressiveness in UW-OpenMP. R Aloor and V K Nandivada, in the Proceedings of the International conference on Compiler Construction (CC), 2019. Preprint here;
a copy with a couple of errata fixed here.
|
| Compare less, Defer more: Scaling value-contexts based whole-program heap analyses. M Thakur and V K Nandivada, in the Proceedings of the International conference on Compiler Construction (CC), 2019. Preprint here.
|
2018 |
| Identifying Refactoring Opportunities for Replacing Type Code with Subclass and State. J Vedurada and V K Nandivada in the Proceedings of the ACM on Programming Languages (PACMPL - OOPSLA), 2018.
Artifact Evaluated (Functional, Reusable)! Preprint here.
|
| Optimizing Remote Data Transfers in X10. A Thangamani and V K Nandivada, in the proceedings of the Parallel Architectures and Compilation Techniques (PACT), 2018. Artifact evaluated (Available, Customizable-reusable, Results replicated)! Preprint here.
|
|
|
| TTLG - An Efficient tensor transposition library for GPUs. J Vedurada, A Suresh, A S Rajam, J Kim, C Hong, S Krishnamoorthy, V K Nandivada, A Panyala, R Srivastava and P Sadayappan, in the proceedings of the IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2018. Preprint here.
|
2017 |
| Energy Efficient Compilation of Irregular Task-Parallel Loops. R Shrivastava and V K Nandivada, in the ACM Transactions on Architecture and Code Optimization (TACO). Preprint here.
|
| Refactoring Opportunities for Replacing Type Code with State and Subclass. J Vedurada and V K Nandivada, in the Poster Proceedings of the International Conference on Software Engineering, 2017. Preprint here.
|
| Optimizing Recursive Task Parallel Programs. S Gupta, R Shrivastava, and V K Nandivada, in the Proceedings of the International Conference on Supercomputing 2017. Preprint here;
a copy with a couple of errata fixed here.
|
2016 |
| Improved MHP Analyses. A Shankar, S S Chakraborty, and V K Nandivada, in the Proceedings of the International conference on Compiler Construction (CC) 2016. Preprint here.
|
| Lexical State Analyzer for JavaCC grammars. K Gupta and V K Nandivada, Software: Practice and Experience (SPE), 2016. Download LSA here. Preprint here.
|
2015 |
| IMSuite: A Benchmark Suite for Simulating Distributed Algorithms. S Gupta and V K Nandivada, Journal of Parallel and Distributed Computing (JPDC), Elsevier, 2015. Preprint here.
|
| Loop tiling in the presence of exceptions. A Bhandari and V K Nandivada, in the Proceedings of the European Conference on Object-Oriented Programming (ECOOP), 2015. Preprint here.
|
| Unique Worker model for OpenMP. R Aloor and V K Nandivada, in the Proceedings of the International Conference on Supercomputing (ICS), 2015. Preprint here.
|
2014 |
|
|
|
2013
|
| IMSuite: A Benchmark Suite for Simulating Distributed Algorithms. Suyash Gupta and V K Nandivada, CoRR arXiv cs.FL/1310.2814, 2013.
|
| Lexical State Analyzer. Kartik Gupta and V K Nandivada, CoRR arXiv cs.FL/1308.3156, 2013.
|
|
|
|
|
2012
|
|
|
2011
|
|
|
|
|
2010 |
|
|
| Parallelizing Loops in Parallel Programs. Soham S. Chakraborty and V Krishna Nandivada, IBM Research Technical Report RI10007, 2010.
|
|
|
2009 |
|
|
| Efficient, portable implementation of asynchronous multi-place programs. G Bikshandi, J G Castanos, S B Kodali, V K Nandivada, I Peshansky, V A Saraswat, S Sur, P Varma, T Wen, Principles and Practice of Paralle Computing, ACM, 2009. Preprint here.
|
|
|
2008 |
|
|
2007 |
|
|
| Advances in Register Allocation. V K Nandivada, The Compiler Design Handbook: Optimizations and Machine Code Generation. 2nd Edition., CRC Press, 2007.
|
2006 |
|
|
|
|
2005 |
|
|
|
|
2003 |
|
|