Fabian Hueske

Technische Universität Berlin
FG DIMA, Sekr. EN-7
Einsteinufer 17
10687 Berlin, Germany

fabian.hueske [at] tu-berlin.de

Fabian Hueske

I am a Ph.D. student at the Database Systems and Information Management (DIMA) group at Technische Universität Berlin. My advisor is Volker Markl.
I am working on the Stratosphere research project, mainly focusing on (robust) optimization of parallel data flows.
In summer 2013, I was an intern with the DMX group of Microsoft Research working with Bolin Ding and Surajit Chaudhuri.

I received a master in computer science from University Ulm and wrote my master's thesis at SAP Research, Karlsruhe in 2008. In the course of my undergraduate studies at University of Cooperative Education Stuttgart, I was fully employed by IBM Germany and did two internships at the IBM Almaden Research Center in 2005 and 2006.

My research interests include query optimization, robust optimization, query processing, and massively parallel data processing.
My CV contains details.

Publications

  • Alexander Alexandrov, Rico Bergmann, Stephan Ewen, Johann-Christoph Freytag, Fabian Hueske, Arvid Heise, Odej Kao, Marcus Leich, Ulf Leser, Volker Markl, Felix Naumann, Mathias Peters, Astrid Rheinländer, Matthias J. Sax, Sebastian Schelter, Mareike Höger, Kostas Tzoumas, Daniel Warneke
    The Stratosphere Platform for Big Data Analytics
    VLDB Journal 2014, Paper: [Link]

  • Fabian Hueske, Volker Markl
    Optimization of Massively Parallel Data Flows
    in Large Scale Data Analytics, Springer, 2014, Editors: A. Gkoulalas-Divanis, A. Labbi, Chapter: [Link]

  • Fabian Hueske, Mathias Peters, Aljoscha Krettek, Matthias Ringwald, Kostas Tzoumas, Volker Markl, Johann-Christoph Freytag
    Peeking into the Optimization of Data Flow Programs with MapReduce-style UDFs (Demo)
    ICDE 2013, Paper: [PDF], Poster: [PDF], Video: [YouTube]

  • Fabian Hueske, Aljoscha Krettek, Kostas Tzoumas
    Enabling Operator Reordering in Data Flow Programs Through Static Code Analysis
    XLDI Workshop (2012), affiliated with ICFP, Paper: [PDF]

  • Fabian Hueske, Mathias Peters, Matthias J. Sax, Astrid Rheinländer, Rico Bergmann, Aljoscha Krettek, Kostas Tzoumas
    Opening the Black Boxes in Data Flow Optimization
    PVLDB 5(11): pp. 1256-1267, (2012), Paper: [PDF], Slides: [PDF]

  • Alexander Alexandrov, Stephan Ewen, Max Heimel, Fabian Hueske, Odej Kao, Volker Markl, Erik Nijkamp, Daniel Warneke
    MapReduce and PACT - Comparing Data Parallel Programming Models
    BTW 2011: pp. 25-44, Paper: [PDF]

  • Alexander Alexandrov, Dominic Battré, Stephan Ewen, Max Heimel, Fabian Hueske, Odej Kao, Volker Markl, Erik Nijkamp, Daniel Warneke
    Massively Parallel Data Analysis with PACTs on Nephele (Demo)
    PVLDB 3(2): pp. 1625-1628, (2010), Paper: [PDF], Poster: [PDF]

  • Dominic Battré, Stephan Ewen, Fabian Hueske, Odej Kao, Volker Markl, Daniel Warneke
    Nephele/PACTs: A Programming Model and Execution Framework for Web-Scale Analytical Processing
    SoCC 2010: pp. 119-130, Paper: [PDF]

  • Leonardo Weiss Ferreira Chaves, Erik Buchmann, Fabian Hueske, Klemens Böhm
    Towards Materialized View Selection for Distributed Databases
    EDBT 2009: pp. 1088-1099, Paper: [PDF]

  • Alexander Löser, Fabian Hueske, Volker Markl
    Situational Business Intelligence
    BIRTE 2008 (Informal Proceedings), Website: [Link]

  • Peter J. Haas, Fabian Hueske, Volker Markl
    Detecting Attribute Dependencies from Query Feedback
    VLDB 2007: pp. 830-841, Paper: [PDF]

My publications on DBLP [Link] and Google Scholar Citations [Link].

Talks, Posters, Demos, and Seminars

  • 2013/06: DB Seminar @ Microsoft Research, Redmond
    Talk: Opening the Black Boxes in Data Flow Optimization

  • 2012/08: Dagstuhl Seminar "Robust Query Processing", Website: [Link]
    Organizers: G. Graefe, W. Guy, G. Paulley

  • 2012/02: Berlin Apache Hadoop Get-Together, Berlin
    Talk: Large-Scale Data Analysis Beyond MapReduce, Slides: [PDF]

  • 2011/11: Google Developer Day 2011, Berlin
    Poster & Demo: Big Data Analytics Beyond MapReduce, Poster: [PDF]
    (together with Stephan Ewen and Kostas Tzoumas)

  • 2011/09: CloudDay 2011, Swedish Institute of Computer Science (SICS), Stockholm
    Talk: Large-Scale Data Analytics Beyond Map/Reduce, Slides: [PDF]

  • 2011/08: Dagstuhl Seminar "Information Management in the Cloud", Website: [Link]
    Organizers: A. Ailamaki, M. J. Carey, D. Kossmann, S. Loughran, V. Markl, R. Ramakrishnan
    Demo: The Stratosphere System
    (together with Astrid Rheinländer and Daniel Warneke)

  • 2010/06: Berlin Buzzwords 2010, Berlin
    Talk: Massively Parallel Analytics Beyond Map/Reduce, Slides: [PDF]

  • 2010/03: IBM Almaden Research Center, San Jose
    Talk: Massively Parallel Analytics Beyond Map/Reduce, Slides: [PDF]
    (Joint talk with Stephan Ewen and Daniel Warneke)

  • 2010/03: Yahoo Research, Santa Clara
    Talk: Massively Parallel Analytics Beyond Map/Reduce, Slides: [PDF]
    (Joint talk with Stephan Ewen and Daniel Warneke)

  • 2010/03: HP Research, Palo Alto
    Talk: Massively Parallel Analytics Beyond Map/Reduce, Slides: [PDF]
    (Joint talk with Stephan Ewen and Daniel Warneke)

Service

Committee Membership

  • Member of Repeatability and Workability Committee for
    SIGMOD 2010 Repeatability and Workability Evaluation, Website: [Link]

Journal Reviews

  • VLDB Journal: 2013

External Reviews

  • SIGMOD: 2008, 2009, 2011, 2012, 2013
  • VLDB: 2008, 2010, 2011
  • ICDE: 2011, 2013
  • EDBT: 2010
  • SOCC: 2012