Quinn, Sean and Mileo, Alessandra ORCID: 0000-0002-6614-6462 (2018) Injecting knowledge into deep neural networks. In: Irish Postgraduate Research Conference, 8-9 Nov 2018, Dublin, Ireland.
Abstract
Much of the recent hype around artificial intelligence stems from recent advances in Neural Networks, currently the most widely used algorithm that succeeded where other approaches failed for decades. Neural Networks today can leverage large amounts of data to be trained to perform hard tasks such as recognising objects in an image or translating languages. The process they use to perform these tasks is equivalent to a complex pattern recognition procedure which uses some clever mathematics to expose the underlying structure in a body of data. Humans think in a more conceptual way. We build a mental model of our world. We have the ability to extract relationships such as causality between elements involved in learning to perform a task, and the ability to use background knowledge when learning. One of the key challenges in making more human-like artificial intelligence is incorporating these properties of natural learning into the neural network paradigm. Designing such a system which could utilise background knowledge in learning a new task would enable the networks to be trained on much less data, opening up a new world of opportunities for Neural Networks to be applied to tasks which were previously not feasible due to the scarce availability of data. In identifying these challenges, we have been inspired by recent seminal papers within the Deep Learning community, which call for new approaches to enhance deep representations with (common-sense) background knowledge. This is considered as a key enabler to significantly improve the ability of machines to learn new tasks faster and in a domain invariant way. The main practical challenges involved in this research are finding how best to extract and format relevant knowledge from a trained network, and finding how best to inject this knowledge into an untrained network.
Metadata
Item Type: | Conference or Workshop Item (Poster) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Subjects: | Computer Science > Artificial intelligence Computer Science > Machine learning |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Initiatives and Centres > INSIGHT Centre for Data Analytics |
Copyright Information: | © 2018 The Authors |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Irish Research Council, GOIPG/2018/2501 |
ID Code: | 22953 |
Deposited On: | 28 Jan 2019 09:42 by Sean Quinn . Last Modified 13 Oct 2022 12:15 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record