Computing Platforms

Content

The seminar will cover core concepts and ideas in the general area of computer systems, ranging from software and hardware architectures to system design for operating systems, data processing systems, and distributed systems. The focus will be on fundamental ideas that apply across systems and application areas but with an emphasis on those ideas that apply to cloud platforms and hardware accelerators.

Format

The seminar will consist on student presentations based on a list of papers that will be provided at the beginning of the course. Presentations will be arranged in slots of 20 minutes talk plus 10 minutes questions. Grades will be assigned based on quality of the presentation, coverage of the topic including material not in the original papers, participation during the seminar, and ability to understand, present, and criticize the underlying technology.

Seminar Hours

Mondays, 4-6pm, at CHN D 44. The first seminar will be on February 19th.

Lecturers

  • Prof. Gustavo Alonso
  • Dr. Michael Giardino

Teaching Assistant

  • Michael Wawrzoniak ()

Schedule

Papers

You may need to click on the links from within the ETH network (via VPN) to get the full-text papers.

1. Michael Armbrust, Armando Fox, Rean Griffith, Anthony D. Joseph, Randy Katz, Andy Konwinski, Gunho Lee, David Patterson, Ariel Rabkin, Ion Stoica, Matei Zaharia (2010) A View of Cloud Computing. In: CACM. [external page Link]

2. Barroso, L., Marty, M., Patterson, D., & Ranganathan, P. (2017). Attack of the killer microseconds. In: CACM. [external page Link]

3. Primorac, M., Bugnion, E., & Argyraki, K. (2017). How to measure the killer microsecond. In: CCR. [external page Link]

4. Delimitrou, C., & Kozyrakis, C. (2018). Amdahl’s law for tail latency: Queueing theoretic models can guide design trade-​offs in systems targeting tail latency, not just average performance. In: CACM. [external page Link]

5. Huaicheng Li , Daniel S. Berger, Stanko Novakovic , Lisa Hsu , Dan Ernst , Pantea Zardoshti, Monish Shah , Samir Rajadnya , Scott Lee , Ishwar Agarwal , Mark D. Hill , Marcus Fontoura , Ricardo Bianchini. (2023) Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. In: ASPLOS. [external page Link

6. Hasan Al Maruf, Hao Wang, Abhishek Dhanotia, Johannes Weiner, Niket Agarwal, Pallab Bhattacharya, Chris Petersen, Mosharaf Chowdhury, Shobhit Kanaujia, Prakash Chauhan. (2023) TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory. In: ASPLOS. [external page Link]

7. Z Guo, Y Shan, X Luo, Y Huang, Y Zhang. Clio: A Hardware-Software Co-Designed Disaggregated Memory System. ASPLOS, 2022 [external page Link]

8. Q Zhang, P Bernstein, D Berger, B Chandramouli. Redy: Remote Dynamic Memory Cache. VLDB, 2022. [external page Link]

9. Shafer, J., Rixner, S., & Cox, A. L. (2010). The Hadoop distributed file system: Balancing portability and performance. In: ISPASS. [external page Link1][external page Link2]

10. Burrows, M. (2006). The Chubby lock service for loosely-​coupled distributed systems. In: OSDI. [external page Link]

11. Hunt, P., Konar, M., Junqueira, F. P., & Reed, B. (2010). ZooKeeper: Wait-​free coordination for internet-​scale systems. In: USENIX ATC. [external page Link]

12. DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, et al. (2007). Dynamo: Amazon’s Highly Available Key-​value Store. In: SIGOPS. [external page Link]

13. Corbett, J. C., Dean, J., Epstein, M., et al. (2012). Spanner: Google’s Globally-​Distributed Database. In: OSDI. [external page Link]

14. Bacon, D. F., Bales, N., Bruno, N., et al. (2017). Spanner: Becoming a SQL system. In: SIGMOD. [external page Link]

15. Armbrust, M., Ghodsi, A., Zaharia, M., et al. (2015). Spark SQL: Relational Data Processing in Spark. In: SIGMOD. [external page Link]

16. Chen, G. J., Wiener, J. L., Iyer, S., Jaiswa, et al. (2016). Realtime Data Processing at Facebook. In: SIGMOD. [external page Link]

17. Verbitski et al (2017) Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases. In: SIGMOD '17. [external page Link]

18. Wang et al. (2023) PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba. In: SIGMOD'23. [external page Link]

19. Lasse Thostrup, Gloria Doci, Nils Boeschen, Manisha Luthra, Carsten Binnig. Distributed GPU Joins on Fast RDMA-capable Networks. SIGMOD 2023 [external page Link]

20. Fang, Yuanwei and Zou, Chen and Chien, Andrew A. Accelerating raw data analysis with the ACCORDA software and hardware architecture. VLDB 2019 [external page Link]

21. Harshit Saokar, Soteris Demetriou, Nick Magerko, Max Kontorovich, Josh Kirstein, and Margot Leibold, Dimitrios Skarlatos, Hitesh Khandelwal and Chunqiang Tang. (2023) ServiceRouter: Hyperscale and Minimal Cost Service Mesh at Meta. In OSDI'23. [external page Link]

22. Johann Schleier-Smith, Vikram Sreekanti, Anurag Khandelwal, Joao Carreira, Neeraja J. Yadwadkar, Raluca Ada Popa, Joseph E. Gonzalez, Ion Stoica, David A. Patterson. What Serverless Computing Is and Should Become: The Next Phase of Cloud Computing. In: CACM. [external page Link]

23. Saxena et al. (2023). The Story of AWS Glue. In: VLDB'23. [external page Link]

24. Shankar, V., Krauth, K., Vodrahalli, K., Pu, Q., et al. (2020). Serverless linear algebra. In: SoCC. [external page Link]

25. Klimovic, A., Wang, Y., Stuedi, P., et al. (2018). Pocket: Elastic Ephemeral Storage for Serverless Analytics. In: OSDI. [external page Link]

26. S Fouladi, F Romero et al. From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers. ATC, 2019. [external page Link]

27. Ao Wang, Jingyuan Zhang, Xiaolong Ma, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng (2020). INFINICACHE: exploiting ephemeral serverless functions to build a cost-effective memory cache. In FAST'20. [external page Link]

 

Invited Talks

TBD

Presentations Tips

 

JavaScript has been disabled in your browser