[pdf version is here](As of 2021-Mar-3)
Keynote Presentations
“虎穴に入らずんば虎子を得ず<High Risk, high return /No Risk, no return>: Domain-specific Processors make for Cool Solutions”
Avi Baum (Hailo)
Abstract: In recent years, domain specific architectures are thriving. One main reason that fuels this trend is the prolific domain of machine learning. In this talk I will briefly survey some of the main approaches and a glimpse into theoretical aspects that underlie their suggested benefit. I will share some observations on present and future developments in the field and share my subjective view on about the possible implications on compute architectures.
Avi Baum is CTO and Co-Founder of Hailo. Prior to this, he served as Texas Instruments’ CTO for Wireless Connectivity, working with the company for over a decade. In this role, he established the connected-MCU product line for IoT and IIoT markets and defined the technological roadmap for products in the IoT ecosystem. He also served as a Technical Team Leader in the Israel Defense Forces’ elite technology unit. Avi holds a B.Sc. in Electrical Engineering from the Technion, Israel Institute of Technology.
“Why Preferred Networks Made MN-Core?”
Yusuke Doi (Preferred Networks)
Abstract: At Preferred Networks, we use deep learning as the core of our technology to contribute to various customers, including those in the manufacturing, biotechnology, and healthcare industries. As efficient computation is a critical differentiator in this field, we are also working on high-efficiency computation using MN-Core, an ASIC that we made. Preferred Networks, which initially started with software and algorithm technology as its core, decided to create MN-Core because of how to utilize the power of software in hardware and the economic aspect of computational optimization. In this talk, I will introduce MN-Core backgrounds and targets and the industrial impacts achieved by vertical integration from software to hardware.
Yusuke Doi is Corporate Officer and VP of Computing Infrastructure, Preferred Networks, Inc.
Joined Preferred Networks in 2016. He worked on the design, management, and operation of the computing infrastructure of Preferred Networks.
“Codesign and System of the Supercomputer “Fugaku””
Mitsuhisa Sato (Riken)
Abstract: We have been carrying out the FLAGSHIP 2020 Project to develop the Japanese next-generation flagship supercomputer, “Fugaku”. We have designed an original manycore processor based on Armv8 instruction sets with the Scalable Vector Extension (SVE), an A64FX processor, as well as a system including interconnect and a storage subsystem with the industry partner, Fujitsu. The “co-design” of the system and applications is a key to making it power efficient and high performance. We determined many architectural parameters by reflecting an analysis of a set of target applications provided by applications teams. As a result, the system has been proven to be a very power-efficient system, and it is confirmed that the performance of some target applications using the whole system is more than 100 times the performance of the K computer. In this talk, the pragmatic practice of our co-design effort for “Fugaku” and its performance will be presented as well as an overview of system software.
Mitsuhisa Sato received the M.S. degree and the Ph.D. degree in information science from the University of Tokyo in 1984 and 1990. From 2001, he was a professor of Graduate School of Systems and Information Engineering, University of Tsukuba. He has been working as a director of Center for computational sciences, University of Tsukuba from 2007 to 2013. Since October 2010, he is appointed to the research team leader of programming environment research team in Advanced Institute of Computational Science (AICS), renamed to R-CCS, RIKEN. Since 2014, he is working as a team leader of architecture development team in FLAGSHIP 2020 project to develop Japanese flagship supercomputer “Fugaku” in RIKEN. Since 2018, he is appointed to a deputy Director of RIKEN Center for Computational Science. He is a Professor (Cooperative Graduate School Program) and Professor Emeritus of University of Tsukuba.
“High-Efficiency Inferencing for Scalable Machine Learning”
Art Swift (Esperanto Technologies)
Abstract: The extraordinary market demand for large-scale machine learning solutions requires more than GPUs, FPGAs, or large multiplier arrays. These approaches deliver high performance, but at high costs: high power consumption, prohibitively complicated programming models, and unacceptable inflexibility. Esperanto Technologies CEO Art Swift will describe the architectural approach and design methodology for the company’s first supercomputer-on-chip solution for ML inferencing acceleration. The ET-SoC-1 combines the traditional flexibility and programmability of CPU cores with the high efficiency of autonomous tensor processing to deliver unmatched system-level efficiency and all-layer ML acceleration. Every element of Esperanto’s integrated solution represents best-in-class technology: the simplicity of the RISC-V instruction set, proprietary instruction-set extensions for machine learning, an on-chip mesh interconnect, a uniquely optimized memory hierarchy, state of the art process technology, and custom low-voltage circuits. In this way, Esperanto delivers more performance per watt than existing products without compromising flexibility.
Art Swift has 30 plus years of executive-level experience in the tech and microprocessor industry, including CEO at low power processor chip-maker Transmeta; President of MIPS, a leading provider of microprocessor IP; CEO of Wave Computing, a pioneer in dataflow computing architectures, as well as CEO of nanotech innovator Unidym. Previously, Art served in executive level positions at Cirrus Logic; in Digital Equipment’s Alpha processor group; and at Sun Microsystems, one of the pioneering companies in networked computing and RISC processing.
Invited presentations
“Architectural Challenges in the Era of New Technologies and Extreme Heterogeneity”
Anastasiia Butko (Lawrence Berkeley Nat’l Lab.)
Abstract: As the end of the Moore’s Law is approaching, we enter the era of new technologies and extreme heterogeneity. Novel architectures bring new challenges in their adoption and integration into larger systems. For example, adopting quantum accelerators hinges on building a classical control hardware pipeline that is scalable, extensible, and provides a real-time response. The physical nature of quantum devices creates non-trivial architectural challenges for control hardware that cannot be solved with the existing approaches. In this talk, we address the architectural challenges related to the adoption of novel accelerators and how these challenges can be addressed with the open-source hardware trends.
Anastasiia Butko, Ph.D. is a Research Scientist in the Computational Research Division at Lawrence Berkeley National Laboratory (LBNL), CA. Her research interests lie in the general area of computer architecture, with particular emphasis on high-performance computing, emerging and heterogeneous technologies, associated programming models and architectural simulation techniques. Her primary research projects address architectural challenges in adopting novel technologies to provide continuing performance scaling in the approaching Post-Moores Law era. Dr. Butko is a chief architect of the custom control hardware stack for the Advanced Quantum Tested at LBNL.
“The CMOS image sensor Advance in key technology and the Introduction of Next-generation image sensor”
Akito Kuwabara (Sony Semiconductor Solutions)
Abstract:The CMOS image sensor is widely used not only in video cameras, digital still cameras and smartphones and security cameras, but also in-vehicles and medical, because its productivity and performance has been improved through the development of basic semiconductor technology and stacked structure technology. In particular, the stacked CMOS image sensor has made it possible to mount various processing circuits on sensor edge and has expanded possibilities of the CMOS image sensor. For example, Intelligent Vision Sensor equipped with CNN(Convolutional Neural Network)processing on sensor edge enables high-speed edge AI processing and extraction of only the necessary data(Metadata), which, when using cloud AI processing, reduces data transmission latency, power consumption and communication costs, and protects privacy and confidential information. In this presentation, we will explain the CMOS image sensor advance in key technology and the features of Intelligent Vision Sensor using stacked structure technology and its architecture.
Akito Kuwabara received a bachelor’s degree and a master’s degree in engineering science from Osaka University, Osaka, Japan in 2017 and 2019. He joined the Sony Semiconductor Solutions Corporation in 2019 and engaged in the research of the CMOS image sensor equipped with AI processing functionality.
Panel Discussion
Topics: “Hot” Techs for “Cool” AI Computing: Do We have Enough Tricks?
Art Swift (Esperanto Technologies, USA)
Mitsuhisa Sato (Riken, Japan)
Abstract: It is often mentioned that data is the new oil in the 21st century. Importantly, oil was able to drive industrial revolution only after the advent of combustion engine. By analogy, data can drive AI revolution only after the right silicon engines, i.e., cool chips. The panel will try to discuss hot topics regarding this important role that cool chips should fulfill for AI computing from various aspects.
Masato Motomura graduated and received Ph.D. from Kyoto University. He was a researcher in NEC central research labs, then became a professor at Hokkaido University. Now he is at Tokyo Institute of Technology leading AI computing research unit. He is actively working on reconfigurable and parallel architectures for deep neural networks, machine learning, annealing machines, and intelligent computing in general. He was a recipient of the IEEE JSSC Annual Best Paper Award, the IPSJ Annual Best Paper Award, the IEICE Achievement Award.
Special Sessions (invited lectures)
“Reducing Errors in Quantum Computation via Program Transformation”
Moinuddin Qureshi (Georgia Institute of Technology)
Abstract: Quantum computing promises exponential speedups for an important class of problems. While quantum computers with few dozens of qubits have been demonstrated, these machines suffer from a high rate of gate errors. Such machines are operated in the Noisy Intermediate Scale Quantum (NISQ) mode of computing where the output of the machine can be erroneous. In this talk, I will discuss some of our recent work that aims to improve the reliability of NISQ computers by developing software techniques to mitigate hardware errors. Our first work exploits the variability in the error rates of qubits to steer more operations towards qubits with lower error rates and avoid qubits that are error-prone. Our second work looks at executing different versions of the programs each crafted to cause diverse mistakes so that the machine becomes less vulnerable to correlated errors. Our third work looks at exploiting the state-dependent bias in measurement errors (state 1 is more error-prone than state 0) and dynamically flips the state of the qubit to perform the measurement in the stronger state. We perform our evaluations on real quantum machines from IBM and demonstrate significant improvement in the overall system reliability. Finally, I will also briefly discuss the hardware aspect of designing large-scale quantum computers, including cryogenic processor and cryogenic memory system.
Moinuddin Qureshi is a Professor of Computer Science at the Georgia Institute of Technology. His research interests include computer architecture, memory systems, hardware security, and quantum computing. He is a member of the Hall of Fame of ISCA, Hall of Fame of MICRO, and Hall of Fame of HPCA. His research has been recognized with the best paper awards at MICRO 2018, CF 2019, and two selections (and three honorable mentions) at IEEE MICRO Top Picks. His ISCA 2009 paper on Phase Change Memory was awarded the 2019 Persistent Impact Prize in recognition of “exceptional impact on the fields of study related to non-volatile memories”. He received the “Outstanding Researcher Award” from Intel (2020) and an “Outstanding Technical Achievement” award from IBM (2011). He was the Program Chair of MICRO 2015 and Selection Committee Co-Chair of Top Picks 2017. He received his PhD (2007) and MS (2003) from the University of Texas at Austin.
“Processor Hardware Security”
Jakub Szefer (Yale University)