Future Cities Hackathon London
Future Cities Hackathon: Open Urban Data for The Citizens - Saturday, October 5, 2013 9:00 AM - The Hub - 80 Haymarket
Organisers / Catapult
- "Dog fouling, parking violations, anti social behaviour - more interesting than ultra smart sensors???"
- Open data...
- is available raw
- in modifiable form
- is redistributable
- can be re-used
- anyone can access,
- is free
- "We have to be aware that the data scientists at the hack day can analyse the info freely to come up with patterns, scenarios or conclusions outside of our control"
- Future Cities Catapult is funded by Technology Strategy Board - organiser of this event
- Catapult's main role sounds like it is neutral, matching projects to funding and grants, finding spaces, cleaning up data, making datasets available, etc
Paul Egan from Neul
- NEUL - paul egan - "do machines dream of their own network?" - machine-2-machine communication, frictionless IoT / NaaS Network as a Service / Sensor deployment
- in 2013 the OeD has added IoT into the dictionary.
- Internet of Things: "A proposed development of the internet in which everyday objects have network connectivity, allowing them to send and receive data."
- is this to be the rise of the machines? but machines are different to humans and in some cases very different. machines communicate differently to humans.
- first of all there will be more of them than us.
- most will communicate on a scheduled basis, periodically
- they wont have much to say... they might only speak once a week or something, whatever they've been programmed to do
- GE predict that the 3rd industrial revolution - see "industrial internet: pushing the boundaries of minds and machines / peter evans and marco annunziata"
- 1st industrial revolution = mechanisation and economics of scale
- 2nd industrial revolution = electronic comms / www
- 3rd industrial revolution - the industrial internet enabled at scale by IoT - an opportunity for UK plc?
- Possible smart city applications:
- adaptive lighting
- smart waste
- pest control
- hygiene management
- transport and traffic management
- smart grid
- asset tracking
- water is the next oil, hygiene is the next big thing
- "the industrial revolution has already started, only the scale and timing is being debated. the internet of things is real and is no longer hype, this will change every human's life, and is one of the few truly global opportunities of economic growth"
Yodit Stanton from Opensensors.io
- aim of opensensors.io to make device data as easy to publish instantly like a blog in real time
- data is property and should be accessible to us all
- but there is also communal, some private
- publishers of the data choose the data license
- MQTT - MQTT is a machine-to-machine (M2M)/"Internet of Things" connectivity protocol. It was designed as an extremely lightweight publish/subscribe messaging transport. It is useful for connections with remote locations where a small code footprint is required and/or network bandwidth is at a premium.
- http://nodered.org/ (mqtt + xively + nodered)
Tez Deep Dive with Geolocation Data, Chris Harris
- Example with Truck data / Road Trip data
- what is Hadoop?
- i want to store large amt of data
- HDFS (Hadoop File System) Storage Framework > MapReduce (Computational Framework
- Hadoop MapReduce Tutorial: http://hadoop.apache.org/docs/stable/mapred_tutorial.html
- Hadoop 1 and Hadoop 2 - Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation. Hadoop makes it possible to run applications on systems with thousands of nodes involving thousands of terabytes. Its distributed file system facilitates rapid data transfer rates among nodes and allows the system to continue operating uninterrupted in case of a node failure. This approach lowers the risk of catastrophic system failure, even if a significant number of nodes become inoperative. Hadoop was inspired by Google's MapReduce, a software framework in which an application is broken down into numerous small parts. Any of these parts (also called fragments or blocks) can be run on any node in the cluster
- Tez - "Empowering end users" - expressive dataflow definitions apis - to enable definition of complex data flow pipelines using simple graph connections. tez expands the logical plan at run time. targeted towards data processing apps like hive/pig.
- The aim of the model is to have a library of inputs, outputs and processors that can be programmatically composed to generate useful tasks.
- Sandbox: http://hortonworks.com/sandbox
- Aside: I googled Hadoop for an image that might illustrate its architecture and it was like AAaaaaaaaaa
- And in case you're wondering, Doug Cutting, Hadoop's creator, named the framework after his child's stuffed toy elephant.
- What is Hadoop? - http://strata.oreilly.com/2011/01/what-is-hadoop.html
- name of team
- brief desc solution
- send code + links + github
- sunday 1pm
- http://figshare.com/ - how to share academic data without losing credit for it