Kevin Miller

Finding Items In Imagery Has Never Been Easier

Twenty years ago, my paternal grandfather passed away. While we were all dealing with the grief of the situation and planning his funeral, my mother handed me a stack of what seemed like a million old family photographs.

She, being a genealogy buff, instructed me to “Find some good photographs of your grandfather” and “Digitize every photo using this fancy scanner”. She also told me to label every person who appeared in each photo, along with where it was taken and what was going on in the photo. Some of the photos dated back to when I was a five-year-old kid running around with my cousins on a hot summer’s day. Others were only a few years old, taken when we were visiting Grandma and Grandpa and eating homemade chicken noodles and mashed potatoes that my grandma made from scratch.

At the time, going through these photos was a very emotional and tedious task. Off and on over the course of the week, I sat there slowly putting a photo in the scanner one-by-one, hitting the scan button, and writing in a journal all the information I could extract out of the photo. I know I was probably making a few mistakes along the way, but I didn’t care, I just wanted this long process to end.

Today, every single photograph taken is natively stored in a digital form in the camera and in many cases, stored online immediately after it is taken. Throughout the Internet, we all have access to millions of photographs of people, our friends, pets, food, buildings, flowers, and landscapes. These are all logged automatically using artificial intelligence. I can open my photos app on my smart phone, select a photo and it automatically shows me all the photos I have that have the same person in it. I can simply search for “car” and find all the pictures of my old red sports car that I used to drive to university every day. I can digitally erase the photobomber out of the picture that I don’t want in that otherwise perfect shot I took. It’s quite amazing that technology exists so readily, right in our hands. You don’t need a Ph.D. in computer vision to be able to do this sort of work, you just need the right application to help you out.

Here at Solv3d, our customers collect thousands upon thousands of miles (or kilometers) of LiDAR, orthomosaics, and panoramic images every year. They are mapping the world and creating a digitized version of it. This digitalization requires them to identify areas (or objects) of interest, such as street signs.

For example, a municipality may want to quickly identify the location of all their stop signs and yield signs, and then cross reference this against a list of dates the signs were installed. They may need to use this information for maintenance; a maintenance manager could use this information to estimate costs to upgrade all these signs throughout the entire city boundary.

We set out to provide a solution that will allow our customers to cost effectively identify objects of interest in their datasets in a timely manner. We knew that whatever we came up with had to produce results consistent between datasets. We also set ourselves a goal to develop a solution that was both secure (the data and the model had to remain local to the user) and flexible (the user should be able to identify whatever is important to them, not what we thought was important). Our solution is a deep learning-based tool that our customers can use to produce their own model and then use it to identify objects of interest in their data.

The process is simple and very flexible. Users load a recent set of panoramic images into Engine that they then use to train a model. They do this by using Engine to draw polygons around 10 to 20 examples of the object they are interested in, such as a stop sign or a yield sign [Figure 1].

Engine labelling process for stop-sign — Figure 1: Labeling process for a stop-sign

They then press a button to have Engine “teach” the model about the example. As Engine is teaching the model, they can monitor the accuracy of the model as it works through its learning process [Figure 2].

TensorBoard training chart — Figure 2: Viewing accuracy of training

When Engine has finished training the model, users then use it to apply the model to their data. Engine will show the results on the screen.

Figure 3: Results in Engine – 300+ observations found for each object

Finally, Engine can export the results, including an estimate of the locations of each object, in multiple formats such as CSV. This allows users to share the results with others, or use the results in other applications such as SOLV3D Encompass or Esri ArcGIS. Figure 4 shows results inside of Encompass.

Figure 4: Results in Encompass. Teal pins = stop signs, magenta pins = yield signs.

Once you have trained the model, you can use that model any time on other data to identify the same object of interest. The model is very cost effective to use on subsequent datasets, and much faster when compared to a manual process.

Because Engine is a local application, you have complete control over your data and your model. Both the model and your data stay with you. The model is yours and is not shared with anyone else, which means you can use your data strategically by training it to identify unique items that your competitors may not have data for.

Because the model is computer based, the results are consistent, and Engine will provide you with a statistical estimation of its effectiveness at identifying the objects – something difficult (if not impossible) to do with a manual process.

Do I wish I had this technology 20 years ago when I was cataloging all those photographs for my mother?

I sure do!

What I find even more amazing is that this technology is only a stepping stone towards what is yet to come with automatic analysis of geospatial datasets. In the very near future, using the combination of imagery and point cloud datasets, we plan to deliver more value by aggregation of results of different datasets to improve the accuracy of detection and to automate workflows even further.

We developed an intensive, hands-on workshop to introduce you to deep learning and how it can be applied to geospatial data like imagery. To learn more, please attend our workshop.

Workshop Information

Additionally, if you would like more information on how Engine can help identify items you need to find in your data, contact us at sales@solv3d.com.

Automated Site Plan from Point Cloud

Kevin Miller · ·

Recently we have been experimenting with the creation of a top-down floorplan view from point cloud data. In our latest version of SOLV3D engine™ (Engine), we have released a tool called “Generate Floorplan”. Through the examination of point density across a sparse matrix, this tool maps the results into a transparent georeferenced tiff file ready for consumption in SOLV3D encompass™ (Encompass), or other software like Esri ArcGIS. This is extremely useful in remote or construction sites where the user would like a highly accurate and quick to produce floor plan.

In this example, which uses terrestrial scanner data, a user can clearly see the layout of the indoor facility. The tanks, wall edges, catwalk and other features are clearly visible.

Pushing this a bit further, by changing different parameters, the user can fine tune the results to make some things clearer to see.

Moving to an outdoor mobile mapping project, we can see that the building edges are greatly enhanced. By removing the ground classification and lowering the percentile, nearby features such as trees and poles become more visible.

Use cases for this tool could include:

Viewing topography changes.
Checking positional correctness of building walls or foundations as well as any changes when compared to design.
Visibly see extent of LiDAR data on a computer that may not have the tools in place to view.

Future enhancements may include:

Extraction of 2D linework of captured building edges or breaklines.
Comparing the generated image to UAV/orthophotos for disaster analysis.
Pair the generated image with Engine’s object identification algorithms to automatically identify trees or other unique objects picked up within the dataset.

The latest release of Engine (3.0.5) also includes some enhanced blurring of vehicles and people within images, a change detection function showing spatial differences between two-point clouds, as well as several cleanup utilities for converting/scaling different attributes of extra information within point cloud data.

To use this feature, or any other of our 70+ algorithms, please contact us to access a free 7-day trial of Engine.

PDF Copy

Sharing Geospatial Data Effectively

Kevin Miller · ·

Whether apparent or not, almost every data set and application today have a geographic component built in. It could be as simple as an X,Y location of your computer mouse within a web-application, the monitoring of assets and their locations, construction progress over time including how components fit together, or perhaps a more advanced self-driving car that needs to make sure it isn’t running into that parent pushing their baby across the street. All of these examples include some sort of geospatial information, a standard that identifies the data having an implicit or explicit association with a location relative to Earth.

Over the past several years, the amount of spatial data available has exploded. Before the internet, thirty years ago, no one really had any detailed sense of where things were located except surveyors and geographers. It was extremely costly, and in many cases impossible, to track your assets based on location. Geographic dataset coverage was a challenge which was solved when internet services started becoming available in remote areas and not just in major centers. Within the last 10-15 years, we have seen systems start talking to each other, allowing users to bring increased information together, including location. Device coverage was also an issue which we are now solving as systems are getting smaller and less costly, allowing more and more “things” to record and provide all sorts of data. The common term for this development and architecture is known as the Internet of Things (IoT). Along with this, there are now even some companies working on building microchips that can essentially capture 3D points (like LiDAR) – small enough to fit within a mobile phone! There remains the question of how should a company manage and share all of this information from all of these devices to the rest of their team/stakeholders?

Our team at SOLV3D, deals with extremely large and complex geospatial datasets.
Most of the time, our clients’ data lives on computer servers, or hard drives somewhere and is utilized by one person or a small team of people (usually within the same organization), preventing this typically expensive data from being shared and used effectively. We are looking to change this and provide a method for sharing the information effectively over the internet.

Regardless of the sharing method of choice, many of the organizations we work with aren’t in the day-to-day business of developing software or managing big IT software projects. This means they simply do not understand the architecture behind hosting and sharing data of this magnitude.

After speaking with dozens of individuals at companies throughout North America
and Europe, it seems there some consistent thoughts in regards to the needs of users:

They typically do not need to access all of the project data sets on a daily basis.
Rather, they just need to access sub-sets or single tile/polygon of an area. They do not want to wait in order to see the latest data in an area. They want to be able to see it right away.
They want to compare versions of data for the same area to see how the area has changed over time.

To meet these needs, the data needs to be:

On the internet, accessible from a web browser and no special software needed.
Organized, or merged, in a way that a user can use a map and easily find out their data coverage.
Organized in a chronological fashion, and accessible with associated metadata that lets a user know what they are looking at, when the data was recorded, and allows the user to easily compare datasets of various vintages.
Easily accessible, to allow analysis on the data such as measurements or comparisons. Downloadable for use in other off-line programs.
Provided in the format of a known common file format and properly georeferenced.
Accessible by as many users as needed, and not restricted to those with technical expertise.
While the hardware and skill sets of data collectors have evolved rapidly over the past number of years, to the point where they can collect data with millimetre accuracy and properly geo-reference it even if it is several kilometres under the ground, they often revert back to archaic practices when they attempt to deliver and disseminate this data. Typically, their data ends up on a USB hard-drive and mailed across the country to their client! Let’s not get into the problems that can arise when a hard-drive is mailed across the country, but trust me, you’re definitely taking a risk doing so.

We also see most data collectors employing different workflows for each hardware type they employ. This results in datasets that are all over the place in terms of consistency, especially when it comes to combining this data with other datasets. All of that hardwon data accuracy is for nothing if the data is lost in transit, or if the data is not consistent from dataset to dataset.

By sharing data over the internet, data collectors can dramatically reduce the risk of losing their data when they send it to their clients. This method also usually means that their clients can access their data immediately, and not several days later.

Sharing geospatial data over the Internet does come with its set of challenges however:

Too much data can slow things down. Data typically needs to be formatted, indexed or split up to allow for quick access.
Data needs to be in a common format. To be accessible by many people, the data needs to be organized and presented in a format that is easily consumable – i.e. not everyone has a high-end computer running some CAD software.
This all boils down to the user experience on how to visualize and interact with the data.
The data needs to be accessible. Systems should be in place to allow people to collaborate/comment/work with other team members without too much hassle
The data need to be secure, protected and only accessible to the people who have permissions to access it. The data needs to be available for users when it is needed. You can’t have any downtime, or have the data slow to access.

As an organization, you may choose to take on these challenges yourself, however, this typically involves forming an entire professional development team. Instead, I recommend looking at companies or products that specialize in dealing with and managing these types and magnitude of data. By doing so, as they expand their product offerings to solve use-cases for their users, they can also take advantage of these features.

Companies that specialize in creating professional software, have the team and expertise to help you effectively manage your data. They are the ones that can help you deal with different types of data, along with making it easier for you and your clients to collaborate.

PDF Copy

Metadata and the quest for the Holy Grail

Kevin Miller · ·

Metadata. When spoken of in the geospatial context, we often think of GIS data and the information which describes it. Having attributes such as the date of creation, method of digitization, and even the color prescribed for a feature, has become an integral part of the way our industry does business.

What about similar information related to remote sensing data?

It is infrequent that field collection is performed on a project and an end product not created. Whether it is a topographic plan, an engineering model or simply a data dump onto a hard drive, users need to know and have access to, some of the same information.

When was the data collected? Which type of sensor was used? What was the temperature during collection? As unimportant as it may seem, these types of information may have a meaningful impact on the downstream products and without the answers, the resulting decisions are easily questioned.

Professional surveyors have traditionally captured this information on the cover page of field notes. Things such as the collection date, the weather, the instrument serial number and who performed the work are standard requirements. These sheets are rarely sent to clients, instead they receive plans which capture this information in the title block. The original field notes, a legal document, are stored for eternity by the surveyors.

With so much remotely sensed data now being collected outside traditional survey firms, it is hard to know how many of these practices remain. How is someone supposed to know what projection a point cloud file is in? What was the instrument height of the 360° camera? Although this information may not have the same legal implications as legal boundaries, it is no less important.

With fewer and fewer physical deliverables in the form of plans with title blocks, this information seems lost, or if not lost, at least hidden. From proprietary projection files, to overly complex folder structures, the “field collection metadata” is often there, you just need to dig for it, and it helps if you know exactly what you’re looking for.

Why not make this easier for everyone?
Is it really necessary to keep this information concealed?

The use of something as simple as a project deliverables sheet can prove invaluable when delivering raw data files and ensures the intent of the data remains with it. Without this information, data can easily be used for the wrong purpose. Examples of this include using full earth for DEM creation, procuring sparsely posted point cloud information for engineering grade modeling and employing outdated feature location information for as-built tie-in.

To assist with this handover of information, we have developed a simple geospatial Data Sheet which can be used for these submissions. We hope that this will alleviate many of the struggles we see with finding metadata associated with the transfer of geospatial data and reduce search times.

Although some enjoy the search for information which may, or may not, even be
present, not everyone wants to undertake the adventure of an expedition with it’s
undoubted trials and tribulations to find the “field collection metadata”, or as I have
started to call this, the data Holy Grail.

PDF Copy