Working with small data that you dare to share
AW1.120 | Day 2 | 16:00 - 16:30 | Speakers: Ulrika Vincent, Mikael Kullberg
Abstract
How to work with toxic data? In our project we work with DNS query streams, which contain a lot of data that may expose single users and their browsing behaviour.
This talk covers how we have built a large scale statistics platform while preserving the user’s privacy and still being able to find important observations. We cover which algorithms and methods we use to gather the data in a cloud platform and run advanced analytics without touching individual user data. We share how to go from big data sets to small aggregated and minimised sets.
We believe the approach of "small data" is applicable to any field where you want to use and share sensitive data. We also invite the audience to audit our work and help build a privacy-first internet statistics platform as one good example.
Speakers
Ulrika is one of the core analysts in the DNS TAPIR project and a consultant since many years in Agical AB. During the years she has been working with development, product management, agile coaching, training and now data analytics. She lives in Stockholm, Sweden and has a strong passion for Internet privacy. Ulrika is a member of the board of DFRI, a Swedish non-profit organisation focused on privacy and integrity on the Internet.
Mikael is one of the founders of the DNS TAPIR project and the core architect with a focus on the data analytics platform. Mikael has years of experience in digging in large sets of DNS query data for the dark side. Today he is a strong proponent for personal privacy on the Internet, working hard to establish DNS query analytics based on open data and Open Source platforms. Mostly because of the challenge, but also to create transparent systems that help individuals rather than selling their souls to commercial actors with dubious motives. Mikael lives in Stockholm with his family, and a dog who also guards its privacy carefully.
Links
External Links
Notice: The placeholder video image is licensed under CC BY-SA 4.0. The original image can be found hereChanges made to the image are: Cropped the image to a new ratio, part of the image was cut off.
