Dr. Adrian Ulges

 
  Contact     Short CV     Research     Demos     Teaching and Talks     Patents     Publications     Activities     Honors     Other Links  
 

I am a senior researcher with the Multimedia Analysis and Data Mining (MADM) Group at the German Research Center for Artificial Intelligence (DFKI) GmbH based in Kaiserslautern/Germany. My major research interests are in image and video processing, pattern recognition, multimedia forensics, and machine learning.

 
   Contact

Dr. Adrian Ulges
Multimedia Analysis and Data Mining (MADM) Group
DFKI GmbH
Trippstadter Str. 122
D-67663 Kaiserslautern/Germany
phone: ++49 (631) 20575-4190
e-mail: adrian.ulges at dfki dot de


   Short CV

I received a diploma degree in computer science (with honors) in 2005, and a PhD degree in computer science in 2009, both from the University of Kaiserslautern, Germany. Since then, I have been working with the German Research Center for Artificical Intelligence (DFKI) GmbH, where I am currently a senior researcher with the Multimedia Analysis and Data Mining (MADM) Group. I have also worked with Google Inc in 2005 (internship in Mountain View, California) and 2011 (guest scientist in Zurich, Switzerland).
My research interests are in image and video analysis, multimedia forensics, and machine learning. I have developed visual recognition systems that learn autonomously from web-based image and video content (project MOONVID) and image and video analysis technology for fighting child sexual abuse (projects iCOP, FIVES, and INBEKI). I have published over 30 scientific papers and am also active as a reviewer and committee member for several international journals, conferences, and workshops. Finally, I also have a strong background in teaching, including the lecture Multimedia Information Retrieval, 13 student theses (bachelor's, master's, diploma), and several guest lectures, lab courses, tutorials, and seminars.

   Research

   Visual Learning from the Web


Web-based portals like YouTube and Flickr offer huge amounts of images and videos. In the project MOONVID, we develop visual recognition approaches that autonomously learn from such web content, constructing visual models for semantic concepts, like objects ("Eiffel Tower"), activities ("soccer"), or locations ("desert"). These concepts can then be automatically detected in image and video databases, which opens up exciting applications such as semantic search, recommendation, content filtering, and a content-based targeting of ads. Our research has been awarded with a Google Research Award and a McKinsey Business Technology Award for Damian Borth.
[MOONVID project website] [demo]

   Multimedia Analysis for Fighting Child Porn


Parallel to the rapid overall growth of multimedia collections, the spread of child ponography increases at alarming rates. To support police forces with the detection of child pornographic images and videos in (frequently several hundred thousands of) questioned items, we develop content analysis technology in the projects iCOP, FIVES, and INBEKI. Our approaches includes automatic porn detection and child porn detection as well as similarity search, object recognition, and intelligent video summarization.
[iCOP project website] [demo]

   Motion Segmentation and Object Recognition


Another interest of mine is recognition that makes use of motion aspects in video data. For example, motion allows to segment moving objects from their background, based on which incorrect correspondences due to clutter can be discarded and the robustness and generalization of recognition methods can be improved significantly. This work has also been conducted in the project MOONVID.
[MOONVID project website]

   Camera-based Document Capture

    
Digital cameras offer a cheap, quick, or even ubiquitious alternative for capturing documents. However, information extraction from the resulting images is hampered by low resolution, noise, and distorsion. For curled pages of books, we perform a dewarping to a flat, upright representation, using a (pseudo-)depth model of the document surface. This representation is fed to a special optical character recognition (OCR) system for noisy, warped, and low-resolution text called DIVER. In combination, this technology allows you to "google" your camera-captured document snapshots. This work was done as part of the project IPeT.
[DIVER] [dewarping demo]



   Demos

Some web demos we created. For a full list of demos from our lab, please refer to madm.dfki.de.


TubeTagger is a concept-based video retrieval system that automatically learns from YouTube. The demo allows you to search a video collection using keywords, whereas all videos have been indexed completely automatically (no manual tagging was done). Smart Video Buddy is a video recommender based on computer vision technology. Semantic concepts (like "soccer game") are detected as the user is watching, and this information is used to "make videos smarter", i.e. to enrich them with adapted news, advertisements, or interesting links. Smart Video Buddy was presented at CeBIT 2010 and is the Silver Award winner at the EuroITV'10 Competition Grand Challenge. TubeFiler is an automatic genre categorizer for YouTube videos. Given a tagged YouTube clip, the system automatically sorts it into a hierarchy of genres (such as "travel->skiing"). These categories are refined further based on a visual clustering. We participated with TubeFiler in the ACM Multimedia Grand Challenge 2009.

Navidgator allows a structural browsing of image/video databases based on visual appearance. You can put certain images in your current focus of interest, zoom into the image collection, or pan over it just like a map. The demo can be run for user photographs or video frames.

This page dewarping demo illustrates how camera-captured document images can be prepared for optical character recognition. For curved book surfaces, a dewarping is applied which generates an upright, straightened representation of the page.


   Teaching and Talks

   Lectures

Lecture Multimedia Information Retrieval, winter term 2011/12.

Lecture Multimedia Information Retrieval, summer term 2010.

   Guest Lectures and Other Talks

   Student Theses I Supervised

For more information on all theses (including pdfs), please refer to madm.dfki.de.

   Other Teaching (Seminars, Lab Courses, Tutorials)

Some more teaching I did at the University of Kaiserslautern during my undergrad and PhD time



   Patents

US Patent (PCT/US2007072578) Recognizing Text in Images (work during 2005 internship patented by Google).



   Publications

For more information on all publications (including pdfs), please refer to madm.dfki.de.

   PhD Thesis

   2012

   2011

   2010

   2009

   2008

   2007

   2006

   2005

   2004



   Activities

   Committee Memberships

   Peer Reviewing



   Honors



   Other Links

Handball @ TV Bad Ems

Summer School SSIP 05 @ Szeged / Hungary

Rebecca (waving her engagement ring) and me

My dog

Some web videos I like:
[M. Welsch: An anthropological Introduction to YouTube] [Rainald Grebe] [i-rack] [Doppelter Windsor] [Und wer bist duuhuhu] [Hinton: Deep Learning] [Nessun Dorma] [Fergus: Small Codes...] [Matt Mays: Terminal Romance] [Gimme Hope Joachim] [Ridley: When Ideas have Sex] [Photosynth] [Hay: Waiting for my Real Life to Begin] [must see - Inception] [The "Life in a Day" Project] [Human Mirror]

My favorite web comic: Order of the Stick

Last update: 21/10/2010