J. Su and K. A. Yelick, "Automatic communication performance debugging in PGAS languages," in Languages and Compilers for Parallel Computing: Proc. 20th Intl. Workshop (LCPC 2007). Revised Selected Papers, V. Adve, M. J. Garzaran, and P. Petersen, Eds., Lecture Notes in Computer Science, Vol. 5234, Berlin, Germany: Springer-Verlag, 2008, pp. 232-245.
A. Kamil and K. A. Yelick, "Hierarchical pointer analysis for distributed programs," in Static Analysis: Proc. 14th Intl. Symp. (SAS 2007), H. Riis Nielson and G. File, Eds., Lecture Notes in Computer Science, Vol. 4634, Berlin, Germany: Springer-Verlag, 2007, pp. 281-297.
E. J. Im, I. Bustany, C. Ashcraft, J. Demmel, and K. A. Yelick, "Performance tuning of matrix triple products based on matrix structure," in Applied Parallel Computing: State of the Art in Scientific Computing. Proc. 7th Intl. Workshop (PARA 2004): Revised Selected Papers, J. Dongarra, K. Madsen, and J. Wasniewski, Eds., Lecture Notes in Computer Science, Vol. 3732, Berlin, Germany: Springer-Verlag, 2006, pp. 740-746.
R. Vuduc, A. Gyulassy, J. Demmel, and K. A. Yelick, "Memory hierarchy optimizations and performance bounds for Sparse {A sup T Ax}," in Computational Science: Proc. Intl. Conf. on Computational Science (ICCS 2003), P. M. A. Sloot, D. Abramson, A. V. Bogdanov, J. J. Dongarra, A. Y. Zomaya, and Y. E. Gorbachev, Eds., Lecture Notes in Computer Science, Vol. 2659, Berlin, Germany: Springer-Verlag, 2003, pp. 705-714.
K. Asanović, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick, "A View of the Parallel Computing Landscape," Communications of the ACM, vol. 52, no. 10, pp. 56-67, Oct. 2009.
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, R. C. Whaley, and K. A. Yelick, "Self-adapting linear algebra algorithms and software," Proc. IEEE, vol. 93, no. 2, pp. 293-312, Feb. 2005.
D. Oppenheimer, A. Brown, J. Beck, D. Hettena, J. Kuroda, N. Treuhaft, D. A. Patterson, and K. A. Yelick, "ROC-1: hardware support for recovery-oriented computing," IEEE Transactions on Computers, vol. 51, no. 2, pp. 100-107, Feb. 2002.
K. A. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit, A. Krishnamurthy, P. N. Hilfinger, S. L. Graham, D. Gay, P. Colella, and A. Aiken, "Titanium: A high-performance Java dialect," Concurrency: Practice and Experience, vol. 10, no. 11-13, pp. 825-836, Sep. 1998.
S. Chakrabarti, J. Demmel, and K. A. Yelick, "Models and scheduling algorithms for mixed data and task parallel programs," J. Parallel and Distributed Computing: Special Issue on Dynamic Load Balancing, vol. 47, no. 1, pp. 168-184, Nov. 1997.
C. Kozyrakis, S. Perissakis, D. A. Patterson, T. Anderson, K. Asanović, N. Cardwell, R. Fromm, J. Golbus, B. Gribstad, K. Keeton, R. Thomas, N. Treuhaft, and K. A. Yelick, "Scalable Processors in the Billion Transistor: IRAM," Proceedings IEEE Computer, vol. 30, no. 9, Sep. 1997.
D. A. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton, C. Kozyrakis, R. Thomas, and K. A. Yelick, "A Case for Intelligent DRAM: IRAM," IEEE Micro, vol. 17, no. 2, pp. 34-44, March 1997.
Articles in conference proceedings
A. Tripathy, K. A. Yelick, and A. Buluç, "Reducing Communication in Graph Neural Network Training," in Proceedings of the 2020 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC'20, 2020.
E. Georganas, R. Egan, S. Hofmeyr, E. Goltsman, B. Arndt, A. Tritt, A. Buluç, L. Oliker, and K. A. Yelick, "Extreme scale de novo metagenome assembly," in Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'18), 2018, pp. 10.
E. Georganas, A. Buluç, J. Chapman, L. Oliker, D. Rokhsar, and K. A. Yelick, "Parallel de bruijn graph construction and traversal for de novo genome assembly," in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2014, pp. 437--448.
E. Georganas, J. Gonz{citeKey, \'a}lez-Dom{\'\i, E. Solomonik, Y. Zheng, J. Touri{\~n}o, and K. A. Yelick, "Communication Avoiding and Overlapping for Numerical Linear Algebra," in SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (Supercomputing 2012), 2012.
B. C. Catanzaro, S. A. Kamil, Y. Lee, K. Asanović, J. Demmel, K. Keutzer, J. Shalf, K. A. Yelick, and A. Fox, "SEJITS: Getting productivity and performance with selective embedded JIT specialization," in Proceedings First Workshop on Programming Models for Emerging Architectures, 2009.
J. Demmel, M. Hoemmen, M. Hohiyuddin, and K. A. Yelick, "Avoiding communication in sparse matrix computations," in Proc. 22nd IEEE Intl. Parallel & Distributed Processing Symp. (IPDPS 2008), Piscataway, NJ: IEEE Press, 2008, pp. 12 pg.
K. A. Yelick, D. Bonachea, W. Y. Chen, P. Colella, K. Datta, J. Duell, S. L. Graham, P. Hargrove, P. N. Hilfinger, P. Husbands, C. Iancu, A. Kamil, R. Nishtala, J. Su, M. Welcome, and T. Wen, "Productivity and performance using partitioned global address space languages (Invited Paper)," in Proc. 2007 Intl. Workshop on Parallel Symbolic Computation (PASCO '07), M. M. Maza and S. M. Watt, Eds., New York, NY: The Association for Computing Machinery, Inc., 2007, pp. 24-32.
S. Agarwal, R. Barik, D. Bonachea, V. Sarkar, R. K. Shyamasundar, and K. A. Yelick, "Deadlock-free scheduling of X10 computations with bounded resources," in Proc. 19th Annual ACM Symp. on Parallelism in Algorithms and Architectures (SPAA 2007), New York,NY: The Association for Computing Machinery, Inc., 2007, pp. 229-240.
W. Chen, D. Bonachea, J. Duell, P. Husbands, C. Iancu, and K. A. Yelick, "A performance analysis of the Berkeley UPC compiler," in Proc. 17th Annual Intl. Conf. on Supercomputing, New York, NY: ACM Press, 2003, pp. 63-73.
B. R. Gaeke, P. Husbands, X. S. Li, L. Oliker, K. A. Yelick, and R. Biswas, "Memory-intensive benchmarks: IRAM vs. cache-based machines," in Proc. 16th Intl. Parallel and Distributed Processing Symp., Piscataway, NJ: IEEE Press, 2002, pp. 30-36.
D. Judd, K. A. Yelick, C. Kozyraki, D. Martin, and D. A. Patterson, "Exploiting On-Chip Memory Bandwidth in the VIRAM Compiler," in Revised Papers from the Second International Workshop on Intelligent Memory Systems, IMS, London, UK: Springer-Verlag, 2000, pp. 122 - 134.
R. H. Arpaci-Dusseau, E. Anderson, N. Treuhaft, D. E. Culler, J. M. Hellerstein, D. A. Patterson, and K. A. Yelick, "Cluster I/O with River: Making the fast case common," in Proc. 6th Workshop on I/O in Parallel and Distributed Systems (IOPADS 1999), New York, NY: ACM Press, 1999, pp. 10-22.
D. A. Patterson, K. Asanović, A. Brown, R. Fromm, J. Golbus, B. Gribstad, K. Keeton, C. Kozyrakis, D. Martin, S. Perissakis, R. Thomas, N. Treuhaft, and K. A. Yelick, "Intelligent RAM (IRAM): The Industrial Setting, Applications, and Architectures," in Proceedings of ICCD ‘97 International Conference on Computer Design: VLSI in Computers and Processors, ICCD, IEEE, 1997, pp. 2 - 7.
R. Fromm, S. Perissakis, N. Cardwell, C. Kozyrakis, B. McGaughy, D. A. Patterson, T. Anderson, and K. A. Yelick, "The Energy Efficiency of IRAM Architectures," in Proceedings of the 24th annual international symposium on Computer architecture, ACM SIGARCH Computer Architecture News, Vol. 25, New York, NY: ACM, 1997, pp. 327-337.
D. A. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton, C. Kozyrakis, R. Thomas, and K. A. Yelick, "Intelligent RAM (IRAM): Chips that Remember and Compute," in Digest of Technical Papers, ISSCC, New York, NY: IEEE, 1997, pp. 224 - 225.
J. Demmel, M. F. Hoemmen, M. Mohiyuddin, and K. A. Yelick, "Avoiding Communication in Computing Krylov Subspaces," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2007-123, Oct. 2007.
K. Asanović, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2006-183, Dec. 2006.
P. N. Hilfinger, D. O. Bonachea, K. Datta, D. Gay, S. L. Graham, B. R. Liblit, G. Pike, J. Z. Su, and K. A. Yelick, "Titanium Language Reference Manual, version 2.19," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2005-15, Nov. 2005.
P. N. Hilfinger, D. Bonachea, D. Gay, S. Graham, B. Liblit, G. Pike, and K. Yelick, "Titanium Language Reference Manual," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-01-1163, Nov. 2001.
B. Liblit, A. Aiken, and K. Yelick, "Data Sharing Analysis for Titanium," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-01-1165, Nov. 2001.
S. Chakrabarti, E. Deprit, E. Im, J. Jones, A. Krishnamurthy, C. Wen, and K. Yelick, "Multipol: A Distributed Data Structure Library," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-95-879, July 1995.