2019 VLSI Design/CAD Symposium | Oral 12

線上論文審稿會議地點大會聯絡方式贊助廠商

重要日期

3/22	投稿開始日
6/6	投稿截止日 (至 11:59 AM)
6/28	論文錄取通知日
6/20~7/2	教師及一般人士報名
6/28~7/2	論文保障學生與展示學生報名
6/20~7/2	其它學生報名(一般、參觀學生)
7/3	一般學生入住抽籤結果通知
7/6	繳費截止日
8/6~9	大會期間

Call for Papers

Download Zip

Call for Pictures

Download Zip

Oral 1 Oral 2 Oral 3 Oral 4 Oral 5 Oral 6 Oral 7 Oral 8 Oral 9 Oral 10 Oral 11 Oral 12 Oral 13 Oral 14 Poster 1 Poster 2 Poster 3 Poster 4 Demo

Oral 12: Design and Optimization for Storage, Memory, and System

Session Chair: Chien-Chung Ho (National Chung Cheng University)
Date: Aug. 8 (Thu.), 2019
Time: 11:30 – 12:30
Room: 7F 論語廳+大學廳

順序	口頭報告論文
1	11:30 – 11:42 (SD11) On Enhancing Lifetime and Performance of Fine-tuning Neural Networks Through an Overhead-Reduced Design on NVM-based System Szu-Yu Chen and Chien-Chung Ho National Chung Cheng University Abstract: Convolutional neural network (CNN) which is one class of neural network has become one of the dominated applications in the computer vision field. Due to the needs of reducing training time and improving training performance, the fine-tuning neural network is widely adopted to avoid the time-consuming procedure of training a neural network from scratch. Since the continuously growing data/model size and a large number of supported CNN techniques on the neural network system, it needs to increase the DRAM size significantly. However, it is impractical to scale up the DRAM size on the neural network system since DRAM can incur many issues, such as scaling limitation and leakage power problems. This work aims at exploring a solution of how to resolve issues caused by adopted large scale CNN application on DRAM. To explore a cost-efficient solution for the large scale CNNs without using DRAM, this work proposes to exploit non-volatile memory (NVM) as main memory because of its high scalability, low read latency, and near-zero leakage power. However, the inherent properties of NVM, such as longer write latency and worse endurance, can significantly affect the performance of CNNs and reduce the lifetime of NVM. To improve the performance and lifetime issues of NVM-based CNN system, this work proposes a split-FFT approach to reduce the number of write operation on NVM while fine-tuning neural networks. Besides, this work also proposes a writing strategy based on the characteristic of fine-tuning neural network with our proposed split-FFT approach. To examine the effectiveness of the proposed approaches, a series of experiments were conducted. The experiment results show that the proposed approaches successfully average the bit flip and enhance the lifetime of NVM. To be more specific, compared to the conventional FFT convolution approach, a 1.95x performance improvement, and a 50% reduction of bit flip were achieved.
2	11:42 – 11:54 (SD12) Design a Fault Tolerant System Using System-Level Redundancy Sih-Kai Shen and Peng-Sheng Chen National Chung Cheng University Abstract: In this paper, we design a fault tolerant system using system-level redundant techniques. The whole structure consists of a primary system connected to a redundant system using network socket programming APIs. A heartbeat mechanism checks whether the primary system is alive. GlusterFS, an open-source distributed file system, aggregates the disk storage resources to provide dependable storage. In addition, distributed multithreaded checkpointing (DMTCP) stores the execution states of the application to allow resumption from failure. A GUI tool is also developed to assist users in building up the proposed fault tolerant system. Preliminary experimental results for benchmarking and test situations show that the proposed approach can improve fault tolerance and allow the operation to resume after failure.
3	11:54 – 12:06 (SD13) Pthread’s Spinlock is Unfair Shi-Wu Lo National Chung Cheng University Abstract: Pthread is a standard library defined by POSIX. Most operating systems including BSD, Linux, Solaris, HPUX and Window use pthreads as their thread library. Although a higher-level programming language defines object-oriented or function-based thread library, the underlying layer uses Pthread. For example, Java, Android, and OpenMP use Pthread to implement their multi-thread libraries. In our research, it was found that the implementation of the spinlock in GNU’s Pthread library is unfair in the many core architecture. A few cores have a chance to get a lock several times higher than other cores. We reimplemented the lock and unlock functions of Pthread’s spinlock library and changed it to an algorithm called ticket-lock. The performance of the new spinlock has dropped by 10%, but it guarantees fairness.
4	12:06 – 12:18 (SD14) Real-World Anomaly Detection in Videos Using Spatio-Temporal Autoencoders Po-Ju Lin and Pao-Ann Hsiung National Chung Cheng University Abstract: Surveillance videos capture a variety of realistic anomalies, which are challenging to detect due to the fuzzy definition of anomalous behavior and complex monitoring scenarios. This article proposes a novel spatio-temporal autoencoder network using 3DCNN and ConvLSTM to learn the characteristics of video anomalies. Experimental results show that this method can detect anomalies in the video with at least 2.4% improvement in AUC accuracy compared to the state-of-the-art ConvLSTM.
5	12:18 – 12:30 (SD15) A Novel Approach for Story Generation Wei Lin, Ting-Hsuan Chien, and Rong-Guey Chang National Chung Cheng University Abstract: The sequence transformer models are based on complex recurrent neural network or convolutional networks that include an encoder and a decoder. High-accuracy models are usually represented by used connect the encoder and decoder through an attention mechanism. Neural story generation is an important thing. If we can let computers learn the ability of story-telling, computers can help people do more things. Actually, the squence2squence model combine attention mechanism is being used to Chinese poetry generation. However, it difficult to apply in Chinese story generation, because there are some rules in Chinese poetry generation. Therefore, we trying to use 500 human-labeled summarization of paragraphs from a classic novel named “Demi-Gods and Semi-Devils”(天龍八部) to train the transformer network by the low resource. In our experiment, we got a low loss rate between different epoch.

指導單位：
教育部資訊及科技教育司

主辦單位：
臺灣積體電路設計學會

承辦單位：
國立中正大學電機工程學系
 國立中正大學資訊工程學系

協辦單位：
科技部工程司工程科技推展中心
 國研院台灣半導體研究中心
 中國電機工程學會
 智慧聯網整合推動聯盟中心
 中央研究院資訊科學研究所

贊助單位：
聯發科技股份有限公司
 日月光半導體製造股份有限公司
 威鋒電子股份有限公司
 奇景光電股份有限公司
 荷蘭商益華國際電腦科技(股)公司台灣分公司
 台灣新思科技股份有限公司
 財團法人工業技術研究院
 瑞昱半導體股份有限公司
 聯詠科技股份有限公司
 鈦思科技股份有限公司
 台灣是德科技股份有限公司
 和澄科技股份有限公司
 愛爾蘭商明導股份有限公司台灣分公司
 一元素科技股份有限公司
 力旺電子股份有限公司
 博鑫醫電股份有限公司
 群聯電子股份有限公司
 晶心科技股份有限公司

Powered by Poetry-Life Professional Conference Organizer