Seminar: Scan-vs-BIM for monitoring in construction

Title:Scan-vs-BIM for monitoring in construction

Speaker: Frédéric Bosché, Associate Professor in Construction Informatics,
Director of the Institute for Sustainable Building Design (ISBD), and Leader of the CyberBuild Lab. Heriot-Watt University

Date: 11:15 on 4 March 2019

Location: CM F.17, Heriot-Watt University

Abstract: When Laser Scanning and Building Information Modelling (BIM) technologies were emerging, the construction industry showed significant interest in what were to be eventually called “Scan-to-BIM”: the process of using a laser scanned point clouds to develop BIM models of existing assets. However, with the use of BIM for design, another important use of these technologies is what some have called “Scan-vs-BIM”: the comparison of reality capture 3D point clouds (capturing the as-is states of constructions) to BIM models (representing the as-designed states of constructions). Scan-vs-BIM offers significant opportunities for further automation in construction project delivery for example for progress or quality control.

This talk will present the Scan-vs-BIM concept, illustrate its process and benefits. The talk will then expand on the subject of using the output of Scan-vs-BIM processing to enhance dimensional quality control with a view to evolve dimensional quality control from a traditionally point-based measurement process to a surface-based measurement process.

Bio: Frédéric graduated holds a PhD in Civil Engineering, but also worked as a PostDoc in the Computer Vision group of ETH Zurich, Switzerland for 2.5 years. He is currently Associate Professor in the School of the Energy, Geoscience, Infrastructure and Society (EGIS). Frédéric leads the CyberBuild Lab (http://cyberbuild.hw.ac.uk/), and his research covers two main areas:

  1. Processing of reality capture data to enhance asset construction and life cycle management.
  2. Development and use of virtual and mixed reality technology, to support collaborative and engaging design, construction and engineering works, as well as training.

Frédéric has published over 70 peer-reviewed papers in internationally-recognised journal and conferences, and his research has received a few international research and innovation awards, including two CIOB International Research & Innovation awards in 2016, and the IAARC Tucker-Hasegawa Award in 2018 for “distinguished contributions to the field of automation and robotics in construction”. Frédéric is a member of the Executive Committee of the International Association for Automation and Robotics in Construction (IAARC), and he is Associate Editor of Automation in Construction (Elsevier).

Seminar: The Challenges of Automated Ontology Debugging: Experiences and Ideas

Title: The Challenges of Automated Ontology Debugging: Experiences and Ideas

Speaker: Juan Casanova, University of Edinburgh

Date: 11:15 on 18 February 2019

Location: CM F.17, Heriot-Watt University

Abstract: Some of the principal attractive aspects of semantic automated reasoning methods (logic) are, at the same time, what fights against it becoming widely spread and easily usable for the management of large amounts of data coming from multiple sources (ontologies). Ontology debugging is a fundamental subfield to master if automated ontology-based technologies are to be in charge of large data and knowledge management systems.

I am still a PhD student, and relatively new to the field, but during my work on ontology debugging techniques I feel I have come to identify a few fundamental challenges that we need to be aware of, such as the need for additional information, how big the issue of (local) inconsistency in ontologies can be, and the problem of efficiently finding relevant justifications for inferences.

In this talk, I’ll be briefly explaining my work on automated fault detection using meta-ontologies, in the context of which I have identified and battled with these challenges, and I’ll be presenting my opinions on where these challenges are coming from and what could be done to tackle them. It is likely that some of you will disagree with some of my claims or think that they are obvious, and that is precisely why I think this talk should incentivize some useful and interesting discussion.

Biohackathon 2018 -Paris

Last November I had the privilege to be one of 150 participants at the Biohackathon organised by ELIXIR. The hackathon was organised into 29 topics, many of which were related to Bioschemas and one directly focused on Bioschemas. For the Bioschemas topic we had up to 30 people working around three themes. The first theme […]

Bioschemas at the Biohackathon

Last November I had the privilege to be one of 150 participants at the Biohackathon organised by ELIXIR. The hackathon was organised into 29 topics, many of which were related to Bioschemas and one directly focused on Bioschemas. For the Bioschemas topic we had up to 30 people working around three themes.

The first theme was to implement markup for the various life sciences resources present. Representatives from ELIXIR Core Data Resources and node resources from the UK and Switzerland were there to work on this thanks to the staff exchange and travel fund. By the end of the week we had new live deploys for 11 additional resources and examples for many more.

The second theme was to refine the types and profiles that Bioschemas has been developing based on the experiences of deploying the markup. Prior to the hackathon, Bioschemas had moved from a minimal Schema.org extension of a single BioChemEntity type to collection of types for the different life science resources, e.g. Gene, Protein, and Taxon. Just before the hackathon a revised set of types and profiles were released. This proved to be useful for discussion, but it very quickly became clear that there was need for further refinement. During the hackathon we started new profiles for DNA, Experimental Studies, and Phenotype, and the Chemical profile was split into MolecularEntity and ChemicalSubstance. Long discussions were held about the types and their structure with early drafts for 17 types being proposed. These are now getting to a state where they are ready for further experimentation.

The third theme was to develop tooling to support Bioschemas. Due to the intensity of the discussions on the types and profiles, there was no time to work on this topic. However, the prototype Bioschemas Generator was extensively tested during the first theme and improvements fed back to the developer. There were also refinements made to the GoWeb tool.

Overall, it was a very productive hackathon. The venue proved to be very conducive to fostering the right atmosphere. During the evenings there were opportunities to socialise or carry on the discussions. Below are two of the paintings that were produced during one of the social activities that capture the Bioschemas discussions.

And there was the food. Wow! Wonderful meals, three times a day.

Seminar: Environmental Health Research in the Era of the ‘Exposome’

Title: Environmental Health Research in the Era of the ‘Exposome’

Speaker: Miranda Loh, Sc.D., Senior Scientist at the Institute of Occupational Medicine

Date: 11:15 on February 2019

Location: CM F.17, Heriot-Watt University

Abstract: In 2015, an estimated 9 million premature deaths were caused by pollution, with air pollution as the leading environmental risk factor. The potential environmental burden of disease could be even larger, as there are still many unknown causes of disease. Much of this uncertainty around the cause of diseases comes from poor description of environmental and occupational exposures in epidemiological studies. Current research into characterising the exposome, the sum total of all exposures through an individual’s lifetime, aims at improving exposure science and our understanding of the relationships between environment and health. There has been great interest in the exposome community in using sensors and smart technologies to further assessment of environmental, behavioural, and health information for individuals. This seminar will explore current interests in the use of technology in exposome research.

Seminar: Exploiting Semantic Web Technologies in Open-domain Conversational Agents

Title: Exploiting Semantic Web Technologies in Open-domain Conversational Agents

Speaker: Alessandro Suglia, Heriot-Watt University

Date: 11:15 on 10 December 2018

Location: CM F.17, Heriot-Watt University

Abstract: The Amazon Alexa Prize is an international competition organised to foster the development of sophisticated open-domain conversational agents. During the competition, the systems should be able to support a conversation with a user about several topics ranging from movies to the news in an engaging and coherent way. In order to understand the entities mentioned by the user and to be able to provide interesting information about those, we relied on several Semantic Web Technologies such as the Amazon Neptune cluster for high-performance SPARQL query execution on a customised large-scale knowledge base composed of Wikidata and a fragment of the DBpedia ontology. In this talk, I will provide an overview of the system and I will describe how our system leverages the power of Linked Data in several components of the architecture.

ISWC 2018

ISWC 2018 Trip Report Keynotes There were three amazing and inspiring keynote talks, all very different from each other. The first was given by Jennifer Golbeck (University of Maryland). While Jennifer did her PhD on the Semantic Web in the early days of social media and Linked Data, she now focuses on user privacy and […]

ISWC 2018 Trip Report

Keynotes

There were three amazing and inspiring keynote talks, all very different from each other.

The first was given by Jennifer Golbeck (University of Maryland). While Jennifer did her PhD on the Semantic Web in the early days of social media and Linked Data, she now focuses on user privacy and consent. These are highly relevant topics to the Semantic Web community and something that we should really be considering when linking people’s personal data. While the consequences of linking scientific data might not be as scary, there are still ethical issues to consider if we do not get it right. Check out her TED talk for an abridged version of her keynote.

She also suggested that when reading a companies privacy policy, you should replace the work “privacy” with “consent” and see how it seems then.

The talk also struck an accord with the launch of the SOLID framework by Tim Berners-Lee. There was a good sales pitch of the SOLID framework from Ruben Verborgh in the afternoon of the Decentralising the Semantic Web Workshop.

The second was given by Natasha Noy (Google). Natasha talked about the challenges of being a researcher and engineering tools that support the community. Particularly where impact may only be detect 6 to 10 years down the line. She also highlighted that Linked Data is only a small fraction of the data in the world (the tip of the iceberg), and it is not appropriate to expect all data to become Linked Data.

Her most recent endeavour has been the Google Dataset Search Tool. This has been a major engineering and social endeavour; getting schema.org markup embedded on pages and building a specialist search tool on top of the indexed data. More details of the search framework are in this blog post. The current search interface is limited due to the availability of metadata; most sites only make title and description available. However, we can now start investigating how to return search results for datasets and what additional data might be of use. This for me is a really exciting area of work.

Later in the day I attended a talk on the LOD Atlas, another dataset search tool. While this gives a very detailed user interface, it is only designed for Linked Data researchers, not general users looking for a dataset.

The third keynote was given by Vanessa Evers (University of Twente, The Netherlands). This was in a completely different domain, social interactions with robots, but still raised plenty of questions for the community. For me the challenge was how to supply contextualised data.

Knowledge Graph Panel

The other big plenary event this year was the knowledge graph panel. The panel consisted of representatives from Microsoft, Facebook, eBay, Google, and IBM, all of whom were involved with the development of Knowledge Graphs within their organisation. A major concern for the Semantic Web community is that most of these panelists were not aware of our community or the results of our work. Another concern is that none of their systems use any of our results, although it sounds like several of them use something similar to RDF.

The main messages I took from the panel were

  • Scale and distribution were key

  • Source information is going to be noisy and challenging to extract value from

  • Metonymy is a major challenge

This final point connects with my work on contextualising data for the task of the user [1, 2] and has reinvigorated my interest in this research topic.

Final Thoughts

This was another great ISWC conference, although many familiar faces were missing.

There was a great and vibrant workshop programme. My paper [3] was presented during the Enabling Open Semantic Science workshop (SemSci 2018) and resulted in a good deal of discussion. There were also great keynotes at the workshop from Paul Groth (slides) and Yolanda Gil which I would recommend anyone to look over.

I regret not having gone to more of the Industry Track sessions. The one I did make was very inspiring to see how the results of the community are being used in practice, and to get insights into the challenges faced.

The conference banquet involved a walking dinner around the Monterey Bay Aquarium. This was a great idea as it allowed plenty of opportunities for conversations with a wide range of conference participants; far more than your standard banquet.

Here are some other takes on the conference:

I also managed to sneak off to look for the sea otters.

[1] [doi] Colin R. Batchelor, Christian Y. A. Brenninkmeijer, Christine Chichester, Mark Davies, Daniela Digles, Ian Dunlop, Chris T. A. Evelo, Anna Gaulton, Carole A. Goble, Alasdair J. G. Gray, Paul T. Groth, Lee Harland, Karen Karapetyan, Antonis Loizou, John P. Overington, Steve Pettifer, Jon Steele, Robert Stevens, Valery Tkachenko, Andra Waagmeester, Antony J. Williams, and Egon L. Willighagen. Scientific Lenses to Support Multiple Views over Linked Chemistry Data. In The Semantic Web – ISWC 2014 – 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I, page 98–113, 2014.
[Bibtex]
@inproceedings{BatchelorBCDDDEGGGGHKLOPSSTWWW14,
abstract = {When are two entries about a small molecule in different datasets the same? If they have the same drug name, chemical structure, or some other criteria? The choice depends upon the application to which the data will be put. However, existing Linked Data approaches provide a single global view over the data with no way of varying the notion of equivalence to be applied.
In this paper, we present an approach to enable applications to choose the equivalence criteria to apply between datasets. Thus, supporting multiple dynamic views over the Linked Data. For chemical data, we show that multiple sets of links can be automatically generated according to different equivalence criteria and published with semantic descriptions capturing their context and interpretation. This approach has been applied within a large scale public-private data integration platform for drug discovery. To cater for different use cases, the platform allows the application of different lenses which vary the equivalence rules to be applied based on the context and interpretation of the links.},
author = {Colin R. Batchelor and
Christian Y. A. Brenninkmeijer and
Christine Chichester and
Mark Davies and
Daniela Digles and
Ian Dunlop and
Chris T. A. Evelo and
Anna Gaulton and
Carole A. Goble and
Alasdair J. G. Gray and
Paul T. Groth and
Lee Harland and
Karen Karapetyan and
Antonis Loizou and
John P. Overington and
Steve Pettifer and
Jon Steele and
Robert Stevens and
Valery Tkachenko and
Andra Waagmeester and
Antony J. Williams and
Egon L. Willighagen},
title = {Scientific Lenses to Support Multiple Views over Linked Chemistry
Data},
booktitle = {The Semantic Web - {ISWC} 2014 - 13th International Semantic Web Conference,
Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part {I}},
pages = {98--113},
year = {2014},
url = {http://dx.doi.org/10.1007/978-3-319-11964-9_7},
doi = {10.1007/978-3-319-11964-9_7},
}
[2] [doi] Alasdair J. G. Gray. Dataset Descriptions for Linked Data Systems. IEEE Internet Computing, 18(4):66–69, 2014.
[Bibtex]
@article{Gray14,
abstract = {Linked data systems rely on the quality of, and linking between, their data sources. However, existing data is difficult to trace to its origin and provides no provenance for links. This article discusses the need for self-describing linked data.},
author = {Alasdair J. G. Gray},
title = {Dataset Descriptions for Linked Data Systems},
journal = {{IEEE} Internet Computing},
volume = {18},
number = {4},
pages = {66--69},
year = {2014},
url = {http://dx.doi.org/10.1109/MIC.2014.66},
doi = {10.1109/MIC.2014.66},
}
[3] Alasdair J. G. Grayg. Using a Jupyter Notebook to perform a reproducible scientific analysis over semantic web sources. In Enabling Open Semantic Science, Monterey, California, USA, 2018. Executable version: https://mybinder.org/v2/gh/AlasdairGray/SemSci2018/master?filepath=SemSci2018%20Publication.ipynb
[Bibtex]
@InProceedings{Gray2018:jupyter:SemSci2018,
abstract = {In recent years there has been a reproducibility crisis in science. Computational notebooks, such as Jupyter, have been touted as one solution to this problem. However, when executing analyses over live SPARQL endpoints, we get different answers depending upon when the analysis in the notebook was executed. In this paper, we identify some of the issues discovered in trying to develop a reproducible analysis over a collection of biomedical data sources and suggest some best practice to overcome these issues.},
author = {Alasdair J G Grayg},
title = {Using a Jupyter Notebook to perform a reproducible scientific analysis over semantic web sources},
OPTcrossref = {},
OPTkey = {},
booktitle = {Enabling Open Semantic Science},
year = {2018},
OPTeditor = {},
OPTvolume = {},
OPTnumber = {},
OPTseries = {},
OPTpages = {},
month = oct,
address = {Monterey, California, USA},
OPTorganization = {},
OPTpublisher = {},
note = {Executable version: https://mybinder.org/v2/gh/AlasdairGray/SemSci2018/master?filepath=SemSci2018%20Publication.ipynb},
url = {http://ceur-ws.org/Vol-2184/paper-02/paper-02.html},
OPTannote = {}
}

First steps with Jupyter Notebooks

At the 2nd Workshop on Enabling Open Semantic Sciences (SemSci2018), colocated at ISWC2018, I presented the following paper (slides at end of this post): Title: Using a Jupyter Notebook to perform a reproducible scientific analysis over semantic web sources Abstract: In recent years there has been a reproducibility crisis in science. Computational notebooks, such as […]

At the 2nd Workshop on Enabling Open Semantic Sciences (SemSci2018), colocated at ISWC2018, I presented the following paper (slides at end of this post):

Title: Using a Jupyter Notebook to perform a reproducible scientific analysis over semantic web sources

Abstract: In recent years there has been a reproducibility crisis in science. Computational notebooks, such as Jupyter, have been touted as one solution to this problem. However, when executing analyses over live SPARQL endpoints, we get different answers depending upon when the analysis in the notebook was executed. In this paper, we identify some of the issues discovered in trying to develop a reproducible analysis over a collection of biomedical data sources and suggest some best practice to overcome these issues.

The paper covers my first attempt at using a computational notebook to publish a data analysis for reproducibility. The paper provokes more questions than it answers and this was the case in the workshop too.

One of the really great things about the paper is that you can launch the notebook, without installing any software, by clicking on the binder button below. You can then rerun the entire notebook and see whether you get the same results that I did when I ran the analysis over the various datasets.

Summer of Good News

This has been a good summer, not just because the British weather has been somewhat more summery than usual. Qianru’s Graduation In June, my first PhD student graduated. Dr Qianru Zhou investigated the use of an ontology to enable a software defined network. Her PhD thesis is “Ontology-driven knowledge based autonomic management for telecommunication networks: Theory, […]

Picture of Qianru on her graduation day

This has been a good summer, not just because the British weather has been somewhat more summery than usual.

Qianru’s Graduation

In June, my first PhD student graduated. Dr Qianru Zhou investigated the use of an ontology to enable a software defined network. Her PhD thesis is “Ontology-driven knowledge based autonomic management for telecommunication networks: Theory, implementation and applications”,

Promotion

As of today (1 August 2018), I am now an Associate Professor (equivalent to Senior Lecturer in traditional British universities).

Grant Success

Today saw the start of a collaboration with VisionWare, a company based in Glasgow who specialise in record linkage, and is funded as an Interface Voucher. We are investigating combining the data corruption framework that Ahmad has been developing with the synthetic data that VisionWare have been generating. The purpose is to enable us to evaluate, and thus improve, record linkage.

 

Report on the 6th Scottish Linked Data Interest Group Workshop

On 29 May, the Heriot-Watt Semantic Web Lab hosted the sixth Scottish Linked Data Interest Group workshop (SLiDinG6), sponsored by the SICSA Data Science theme. There were 30 attendees from academia, government, and industry.

The main focus of the day was to share interests and knowledge around Semantic Web and Linked Data research and use across Scotland, with a view to fostering collaboration and interaction.  We asked all attendees to fill out a well-sorted form two weeks prior to the event.  Well Sorted allows attendees of events to put in titles and short descriptions of their interests.  Once these are all received, attendees group them according to their own view of how they interact. The Well Sorted algorithm then takes all sortings and creates themed groups.  Attendees were allowed to submit up to two responses. We received 17 responses, which were sorted into three groups as shown in the figure below. The full well-sorted results can be found here.

The full schedule for the day can be found on the event webpage.  We began with lightning talks – 17 in total – to give attendees a feel for the breadth and focus of interest across the group.  After lunch we had the first group session, based on the well-sorted groups. All attendees could see the topics in each group and attend whichever they felt suited them best.  The groups discussed the first three questions outlined on the event webpage, with a focus on their interests within their particular group:

  1. What are the killer apps of the Semantic Web?
  2. What are the challenges from industry and government?
  3. Where are the synergies in Semantic Web research in Scotland?

Feedback was given by each group before the coffee break, and on this basis some attendees moved between groups.

After the coffee break, group discussions continued, with more focus on using the outcome of the earlier session to discuss the fourth question:

  1. Can we identify projects to push forward for funding?

At the end of the day, we summarised across the meeting what had come out of these groups, and various potential avenues which could be developed further were identified.

We discussed when we would like to hold the next event and agreed that holding SLiDInG events more often would be preferable.  We agreed that the next one would be held around Easter next year (10 months away), possibly in conjunction with the UK Ontology Network Meeting, venue to be confirmed.

Throughout the day attendees were encouraged to contribute to the live blog of the meeting, which can be found here.  This gives a much more thorough overview of the discussion of the day.

We are very grateful to SICSA for the support, which enabled this useful event to take place.

SLiDInG 6

Today, the Semantic Web Lab hosted the 6th Scottish Linked Data Interest Group workshop at Heriot-Watt University. The event was sponsored by the SICSA Data Science Theme. The event was well attended with 30 researchers from across Scotland (and Newcastle) coming together for a day of flash talks and discussions. Live minutes were captured during the […]

Today, the Semantic Web Lab hosted the 6th Scottish Linked Data Interest Group workshop at Heriot-Watt University. The event was sponsored by the SICSA Data Science Theme. The event was well attended with 30 researchers from across Scotland (and Newcastle) coming together for a day of flash talks and discussions. Live minutes were captured during the day and can be found here.

I gave a talk on the successes and challenges of FAIR data. My slides are embedded below.