elie.abilahoud@ucd.ie
paul-emmanuel.raoul@ucd.ie
cal.muckley@ucd.ie
Keywords
Financial technology
Knowledge representation
Design science research
Fintech
Regtech
Semantic web
Linked open data
Ontology
RDF graph
SPARQL
Data visualisation
The fintech ecosystem is a highly dynamic environment. Venture investment in fintech grew by 11% to $17.4 billion in 2016 according to Forbes and Pitchbook . Having a clear picture of this ecosystem at a given point in time is much needed to inform policy activities as evidenced by the European Commission call for tenders SMART 2016/0042. This paper describes our approach to integrating, enriching and presenting data about the fintech ecosystem. It presents FintechBrowser , a demonstrator of an integrated set of dashboards and a graph explorative-browsing tool.
Over the past decade we have witnessed technology-driven innovations that disrupted the financial services industry and continue to do so. In consumer payments, nascent start-ups are able to bring to market affordable and flexible cross-border payment solutions outperforming some of the largest traditional financial institutions. In the funding space, the rise of crowd-funding democratised access-to-capital creating a whole new market segment. In investment management, robo-advisors renewed the faith of many in the applications of Artificial Agents while providing satisfying results at competitive costs. Disruption caught-up even with currencies - one of the oldest concepts in the financial industry - with the proliferation of crypto-currencies using blockchain technologies.
Having a clear picture of this ecosystem at a given point in time is much needed to inform policy activities as evidenced by the European Commission call for tenders SMART 2016/0042. Our research is focussed on exploring the fintech ecosystem using ontological modelling. It seeks, inter alia, to improve our understanding of this ecosystem, which is a critically important environment from which solutions to operational risks at financial institutions frequently arise. We combine novel knowledge representation and data modelling techniques to reveal links between concepts represented in the data which otherwise are not accessible. We use semantic web standards to free data from tables & spreadsheets and to enrich it with linked open data. We visualise data through the lens of our ontology and present it on a map and in a navigable graph.
The rest of the paper is structured as follows. In Section 2 we describe our approach to building a fintech knowledge graph 1 . In Section 3 we present FintechBrowser 2 , a tool for presenting and navigating the fintech knowledge graph. Finally, in Section 4 we conclude the paper and outline on-going work to extend the knowledge graph and FintechBrowser.
Current semantic web specifications
[6]
[14]
[11]
and tools
[15]
[13]
[1]
allow to represent a domain by documenting its concepts and the relationships
between them, to source data instantiating this representation from heterogeneous sources and to perform
automated reasoning to draw inferences and deductions from data following the domain representation. We
build on these capabilities in order to construct the nucleus of a knowledge graph of the fintech ecosystem.
Ehrlinger & Wöß
[7]
outline that a knowledge graph is
somehow superior to a knowledge base
as a knowledge graph
acquires and integrates information into
an ontology and applies a reasoner to derive new knowledge
. At the heart of this fintech knowledge
graph is our OWL fintech ontology. We enrich the latter with custom-built layers of extensions to include
additional domain knowledge and by tapping into, and reusing knowledge from, the Linked Open Data Cloud.
Section 4
outlines our plan to evolve this knowledge base into a knowledge graph.
We follow a Design Science approach in building our ontology [12] . With the objective of maximising our ontology's chances of reuse, we pay special attention to guidelines 2 and 4 on the relevance of the research and the value of the contribution as described by Hevner et al. To this end, we have assembled an advisory committee of domain experts in financial services, in fintech and in technology [10] to observe, assist and assess. The ontology architecture is described in Figure 1 (Top left). A core module contains definitions of OWL classes, object properties and restrictions representing the main actors of the fintech ecosystem and the relationships among them. Extension modules build on top of the core one to add further concepts as we learn about them or as they become relevant to the applications/queries this ontology underpins. For example, we are designing an extension to represent innovation programs and accelerators and another extension to represent conferences and industry events. In addition to these two layers and as our study evolves, the time dimension is becoming more and more relevant to support for example queries about people changing roles or investment trends over time. To this end, we are adding time-related information to some concepts, roles and relationships. The ontology metrics of the current version of our core ontology module are highlighted in Figure 1 (Right) in a screenshot from Protégé [15] .
In our Design Science approach and following guidelines 3 and 5 on evaluation and rigour [12] , we gathered a set of queries that the first version of the ontology should help answer. This is in-line with ontology engineering best practices and widely referenced guidelines [16] [8] . Figure 1 (Bottom left) shows an excerpt of these questions. As we were capturing these key competency questions, we were able to identify classes and entities which are highlighted between quotes in Figure 1 (Bottom left) and which made their way to our ontology.
The Linked Open Data community is growing steadily [3] [5] and in particular Dbpedia [2] contains knowledge relevant to this study. We link to Dbpedia to obtain information, for example, about locations, cities and population sizes. We use this linked data in queries such as the second one described in Figure 1 (Bottom left), where we correlate technologies with services and locations of firms and then filter by the size of the population in this (these) location(s). Another source of knowledge we are currently working on linking to, is the Financial Industry Business Ontology (FIBO). FIBO is an initiative supported by many institutions [4] . Modules of FIBO are undergoing the Objet Management Group's rigorous standardisation process [9] . In particular, FIBO Business Entities and its modules contain knowledge relevant to this study such as the definitions of entities and corporations and also individuals representing a wide array of regulators in different jurisdictions.
In this paper we described on-going research aiming at improving our understanding of the fintech ecosystem. We followed a semantic web approach in so far we used semantic web de facto standards to guide and express our knowledge representation of this domain. In building the ontology, we followed design science research guidelines on rigour and relevance. We sought advice and feedback from a committee of international domain experts. We also built FintechBrowser a demonstrator of an integrated set of dashboards and graph navigation tool. FintechBrowser reuses the ontology enriched with linked open data and populated with a test dataset to highlight the benefits of this approach and how it contributes to increasing our understanding of the fintech ecosystem. We conducted a set of experiments with FintechBrowser and experienced first hand the advantages of discovering associations in the knowledge base by visually navigating from node to node without the need for typical tabular data manipulation such as pivot and manual cross-referencing. FintechBrowser along with the fintech ecosystem ontology also allowed us to demonstrate one major benefit of this approach namely: you do not need to have all the data about the ecosystem, the system will create associations between entities using the domain knowledge represented in the ontology.
As it stands, FintechBrowser operates on manually collected data, which we believe is an impediment to it reaching its full potential and helping us improve our understanding of the fintech ecosystem. We are exploring a series of algorithms to automatically (or semi-automatically) collect instance data and evolve our knowledge base to a knowledge graph [7] . We are evaluating Named Entity Recognition and other Natural Language Processing techniques to populate the graph from different sources including company websites and news. With more data continuously being added and time-stamped, it is our expectation that through this study we will be able to deepen our understanding of this ecosystem by exploring the impact of criteria such as "education", "tax rates", etc. on fintech performance expressed in terms of number of "deal values", "patents", etc. at a given "location", and in a subsequent step to explore scenarios of future development of the fintech ecosystem.
This work is supported by Enterprise Ireland and IDA Ireland under the Government of Ireland Technology Centre Programme. It is also supported by University College Dublin, College of Business Research Support scheme 2017.
Apache Jena: project website. https://jena.apache.org/ (Last visited June 2017).
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R. and Ives, Z. (2007). Dbpedia: A nucleus for a web of open data. The semantic web, pp.722-735.
Bauer, F. and Kaltenböck, M. (2011). Linked open data: The essentials. Edition mono/monochrom, Vienna.
Bennett, M. (2013). The financial industry business ontology: Best practice for big data. Journal of Banking Regulation, 14(3-4), pp.255-268.
Bizer, C., Heath, T., Idehen, K. and Berners-Lee, T. (2008). Linked data on the web (LDOW2008). In 17th international conference on World Wide Web (pp. 1265-1266). ACM.
Brickley, D. and Guha, R.V. (2004). RDF vocabulary description language 1.0: RDF schema.
Ehrlinger, L., & Wöß, W. (2016). Towards a Definition of Knowledge Graphs. SEMANTiCS.
Fernández-López, M., Gómez-Pérez, A. and Juristo, N. (1997). Methontology: from ontological art towards ontological engineering.
Financial Services Standards: FIBO, Object Management Group Website. http://www.omg.org/hot-topics/finance.htm (Last visited June 2017).
Fintech Ecosystem: project website. www.fintechecosystem.net (Last visited July 2017).
Harris, S., Seaborne, A. and Prud’hommeaux, E. (2013). SPARQL 1.1 query language. W3C recommendation, 21(10).
Hevner, A.R., March, S.T., Park, J. and Ram, S. (2008). Design Science in Information Systems Research. Management Information Systems Quarterly, 28(1), p.6.
McBride, B. (2001). Jena: Implementing the rdf model and syntax specification. In Second International Conference on Semantic Web-Volume 40 (pp. 23-28).
McGuinness, D.L. and Van Harmelen, F. (2004). OWL web ontology language overview. W3C recommendation, 10(10), p.2004.
Musen, M.A. (2005). The Protégé project: A look back and a look forward. AI Matters. Association of Computing Machinery Specific Interest Group in Artificial Intelligence, 1(4), June 2015. DOI: 10.1145/2557001.25757003.
Uschold, M. and Gruninger, M. (1996). Ontologies: Principles, methods and applications. The knowledge engineering review, 11(02), pp.93-136.
The ontology will be made available at: https://github.com/FinTechEcosystem . [back]
Demonstrations of FintechBrowser are available via: http://fintechecosystem.net . [back]