Enabling Scalable Smart-Building Analytics

Arka Bhattacharya

EECS Department
University of California, Berkeley
Technical Report No. UCB/EECS-2016-201
December 15, 2016

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-201.pdf

Modern buildings are being integrated with myriad (often >1000s) networked sensors to improve convenience, occupant comfort accessibility and energy-efficient operations. These technological improvements hold the promise of significant advances in centralized operation and management, fault diagnosis, and integration to an emerging smart grid. As of 2012, 14% of the buildings in the U.S. deployed Building Management Systems (BMS) to provide some kind of programmatic interface to the the sensors, actuators, and historical data management. Innovations in "Internet of Things" (IoT) devices have further led to connected lights, power meters, occupancy sensors and appliances that are capable of interfacing with the underlying BMS systems used in building automation. New buildings are installed with a BMS by design, and older buildings are being continuously retrofitted with networked systems for improved efficiency.

However, whether provided by novel sensor networks or legacy instrumentation, extracting meaningful information from sensor data and taking actions based on that data depends fundamentally on the metadata available to interpret it. There are more than 5 million commercial buildings in the US, with the sensors in each building set up with customized and obscure metadata. One cannot achieve scalable deployments of software analytics and applications across buildings if deploying them requires vendors and domain experts spending spending 100s of hours fixing each building. Today even well-established applications do not get deployed at scale because of this very reason. Thus, the major challenge is scalability , i.e a paradigm where an application can be written once and deployed on 100s or 1000s of buildings.

This thesis evaluates the challenges with existing metadata of sensors in smart-buildings and proposes ways to normalize it to uniform standard that would allow scalable (write-once and deploy everywhere) application development. We develop three empirical criteria for successful metadata schemas --- (a) completeness, or the ability to capture all sensors, (b) ability to capture all relationships between sensors required by state-of-the-art applications, and (c) flexibility in incorporating novel sensors and applications, and usability. We empirically demonstrate that no existing smart-building sensor metadata schema satisfy these properties and develop a schema based on an underlying graphical data model, Brick, that does. We validate Brick across 6 large and diverse commercial buildings (comprising more than 17,000 sensors) in two different continents and set up by different BMS vendors.

We also develop a human-in-the-loop synthesis technique which uses syntactic and data-driven steps to parse legacy metadata into a common schema. This technique allows building-experts, who might not be conversant with sophisticated regular expression programs, to parse more than 70% of the legacy metadata in a building to a common schema by providing example parses of only about 1% of the sensors. We also show how to use active perturbations of subsystems in a building to construct functional relationships between subsystems that may have been missing or incorrectly captured in legacy metadata with an accuracy of ~80%. Finally, we demonstrate the power of our normalized smart-building metadata schema paradigm by using the standardized sensor and relationship representations to implement both simple (e.g. finding errant zones, identifying inefficient air handling subsystems to save energy) and sophisticated applications (e.g finding rough occupancy estimates) that scale across real-world smart-buildings.

Advisor: David E. Culler


BibTeX citation:

@phdthesis{Bhattacharya:EECS-2016-201,
    Author = {Bhattacharya, Arka},
    Title = {Enabling Scalable Smart-Building Analytics},
    School = {EECS Department, University of California, Berkeley},
    Year = {2016},
    Month = {Dec},
    URL = {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-201.html},
    Number = {UCB/EECS-2016-201},
    Abstract = {
Modern buildings are being integrated with myriad (often >1000s) networked sensors to improve convenience, occupant comfort accessibility and energy-efficient operations. These technological improvements hold the promise of significant advances in centralized operation and management, fault diagnosis, and integration to an emerging smart grid. 
As of 2012, 14% of the buildings in the U.S. deployed Building Management Systems (BMS) to provide some kind of programmatic interface to the the sensors, actuators, and historical data management. Innovations in "Internet of Things" (IoT) devices have further led to connected lights, power meters, occupancy sensors and appliances that are capable of interfacing with the underlying BMS systems used in building automation.  New buildings are installed with a BMS by design, and older buildings are being continuously retrofitted with networked systems for improved efficiency.

However, whether provided by novel sensor networks or legacy instrumentation, extracting meaningful information from sensor data and taking actions based on that data depends fundamentally on the metadata available to interpret it. There are more than 5 million commercial buildings in the US, with the sensors in each building set up with customized and obscure metadata. One cannot achieve scalable deployments of software analytics and applications across buildings if deploying them requires vendors and domain experts spending spending 100s of hours fixing each building. Today even well-established applications do not get deployed at scale because of this very reason. Thus, the major challenge is scalability , i.e a paradigm where an application can be written once and deployed on 100s or 1000s of buildings. 

This thesis evaluates the challenges with existing metadata of sensors in smart-buildings and proposes ways to normalize it to uniform standard that would allow scalable (write-once and deploy everywhere) application development. We develop three empirical criteria for successful metadata schemas --- (a) completeness, or the ability to capture all sensors, (b) ability to capture all relationships between sensors required by state-of-the-art applications, and (c) flexibility in incorporating novel sensors and applications, and usability. We empirically demonstrate that no existing smart-building sensor metadata schema satisfy these properties and develop a schema based on an underlying graphical data model, Brick, that does. We validate Brick across 6 large and diverse commercial buildings (comprising more than 17,000 sensors) in two different continents and set up by different BMS vendors. 

We also develop a human-in-the-loop synthesis technique which uses syntactic and data-driven steps to parse legacy metadata into a common schema. This technique allows building-experts, who might not be conversant with sophisticated regular expression programs, to parse more than 70% of the legacy metadata in a building to a common schema by providing example parses of only about 1% of the sensors. We also show how to use active perturbations of subsystems in a building to construct functional relationships between subsystems that may have been missing or incorrectly captured in legacy metadata with an accuracy of ~80%. Finally, we demonstrate the power of our normalized smart-building metadata schema paradigm by using the standardized sensor and relationship representations to implement both simple (e.g. finding errant zones, identifying inefficient air handling subsystems to save energy) and sophisticated applications (e.g finding rough occupancy estimates) that scale across real-world smart-buildings.}
}

EndNote citation:

%0 Thesis
%A Bhattacharya, Arka
%T Enabling Scalable Smart-Building Analytics
%I EECS Department, University of California, Berkeley
%D 2016
%8 December 15
%@ UCB/EECS-2016-201
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-201.html
%F Bhattacharya:EECS-2016-201