WEKO3
アイテム
USING COMPUTING FIRST PRINCIPLES TO IMPROVE THE SYMBIOTIC PERFORMANCE IN ALGORITHMS AND PROCESSORS USED IN LOW-POWERED MACHINE LEARNING
https://tokushima-u.repo.nii.ac.jp/records/2010412
https://tokushima-u.repo.nii.ac.jp/records/2010412881501ed-6c92-4689-a766-cac0cff5e510
名前 / ファイル | ライセンス | アクション |
---|---|---|
k3652_abstract.pdf (51.8 KB)
|
|
|
k3652_review.pdf (33.3 KB)
|
|
|
k3652_fulltext.pdf (3.59 MB)
|
|
Item type | 文献 / Documents(1) | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
公開日 | 2022-10-27 | |||||||||||
アクセス権 | ||||||||||||
アクセス権 | open access | |||||||||||
資源タイプ | ||||||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_db06 | |||||||||||
資源タイプ | doctoral thesis | |||||||||||
出版タイプ | ||||||||||||
出版タイプ | NA | |||||||||||
出版タイプResource | http://purl.org/coar/version/c_be7fb7dd8ff6fe43 | |||||||||||
タイトル | ||||||||||||
タイトル | USING COMPUTING FIRST PRINCIPLES TO IMPROVE THE SYMBIOTIC PERFORMANCE IN ALGORITHMS AND PROCESSORS USED IN LOW-POWERED MACHINE LEARNING | |||||||||||
言語 | en | |||||||||||
タイトル別表記 | ||||||||||||
その他のタイトル | コンピューティングの第一原理を使用して、低電力の機械学習で使用されるアルゴリズムとプロセッサの共生パフォーマンスを向上させる | |||||||||||
言語 | ja | |||||||||||
著者 |
ンシンガ, ロバート
× ンシンガ, ロバート
|
|||||||||||
抄録 | ||||||||||||
内容記述タイプ | Abstract | |||||||||||
内容記述 | Using less electric power or speeding up processing is catching the interests of researchers in deep learning. Models have grown in complexity and size using as much precision depth as can be computationally supported regardless of how expensive the minimum required cooling system might cost. Quantization has offered ease of deployment to small devices lacking floating precision capability, but little has been suggested about the floating numbers themselves. This thesis evaluates hardware acceleration for embedded devices that cannot support the energy requirements of floating numbers and proposes solutions to challenge the limits of power consumption and apply them to measure their effectiveness in terms of energy demand and speed capacity. Experts have declared the end of Moore’s law with the current state of nanotechnology coming to terms with its inability to increase the performance per transistor density ratio. Accelerators, although providing a countering measure, have also increased their power needs to unsustainable levels. At the same time there has been sufficient increase in knowledge, such as distributed computing, to branch-off into possibilities that could reduce power demands while maintaining, or possibly increase microprocessor performance. This thesis highlights some important challenges that were born out of the rapid rise of deep learning. We present experimental results showing that low-powered devices can serve as powerful tools in low cost deep learning research. In doing so we are interested in slowing down the ongoing trend that favors expensive investment in deep learning computers. Using known properties in computer architecture, hardware acceleration, and digital arithmetic we implement ways to design algorithms that symbiotically match their performance in accordance with the theoretical limits afforded by the hardware components that run them. Computer processors are utilized based on their ability to execute instructions defined in code or machine-readable format. Some processors are multi-purpose, others are domain-specific, the former being good at a wide range of tasks and the latter only focused for specific tasks. While executing any task an ideal processor should engage all its transistors to ensure that no part is left underutilized. However, in practice it is not always the case, which is why domain-specific processors are optimized to carry only the instructions for which they would fully commit their components. It is considered good practice when algorithms are designed to encourage the maximum use of available capacity for any execution. Our proposed method improves the symbiotic complementarity in peak algorithm performance and theoretical hardware capacity. |
|||||||||||
言語 | en | |||||||||||
キーワード | ||||||||||||
言語 | en | |||||||||||
主題Scheme | Other | |||||||||||
主題 | embedded system | |||||||||||
キーワード | ||||||||||||
言語 | en | |||||||||||
主題Scheme | Other | |||||||||||
主題 | IEEE754-2008 | |||||||||||
キーワード | ||||||||||||
言語 | en | |||||||||||
主題Scheme | Other | |||||||||||
主題 | floating point | |||||||||||
キーワード | ||||||||||||
言語 | en | |||||||||||
主題Scheme | Other | |||||||||||
主題 | digital signal processor | |||||||||||
キーワード | ||||||||||||
言語 | en | |||||||||||
主題Scheme | Other | |||||||||||
主題 | Q format notation | |||||||||||
書誌情報 |
発行日 2022-09-20 |
|||||||||||
備考 | ||||||||||||
言語 | ja | |||||||||||
値 | 内容要旨・審査要旨・論文本文の公開 学位授与者所属 : 徳島大学大学院先端技術科学教育部(システム創生工学専攻) |
|||||||||||
言語 | ||||||||||||
言語 | eng | |||||||||||
報告番号 | ||||||||||||
学位授与番号 | 甲第3652号 | |||||||||||
学位記番号 | ||||||||||||
言語 | ja | |||||||||||
値 | 甲先第436号 | |||||||||||
学位授与年月日 | ||||||||||||
学位授与年月日 | 2022-09-20 | |||||||||||
学位名 | ||||||||||||
言語 | ja | |||||||||||
学位名 | 博士(工学) | |||||||||||
学位授与機関 | ||||||||||||
言語 | ja | |||||||||||
学位授与機関名 | 徳島大学 |