Education

Hasso Plattner Institute, Germany 03/2010 - 11/2013

I have finished my PhD study at Hasso Plattner Institut, University Potsdam in 2013. My PhD thesis "Automatic Video Indexing and Retrieval Using Video OCR Technology" achieved the highest scores of a PhD thesis in Germany "summa cum laude" (<5%).
My major research interests are:

  • Multimedia processing
  • Computer vision
  • Machine learning
  • Deep learning, deep neural networks
  • Video indexing and trieval
  • Video OCR (Optical Character Recognition)
  • Content-based Video Search
  • Real-time computer vision applications

Technische Universität Ilmenau, Germany 10/2002 - 01/2008

I have got my Dipl.-Ing degree (Diplom Studium) at Technical University Ilmenau. My college major is digital-media technology.

  • I have written my Diploma Thesis "music visualization using graphic card technology" at Fraunhofer Institut für Digital Medien Technologie (IDMT)

Course 2000 - 2001

  • Preparatory Course in Ilmenau, Germany 05/2001 - 03/2002
  • German Course, in Peking 09/2000 - 03/2001

School, in Zheng Zhou, China09/1988 - 07/2000

Technical Expertise

Major Programming Languages C#, Java, C++, C, Python
Web Programming HTML, CSS, XML, MS-Silverlight, JSP, PHP, ASP.Net, XSLT, VRML, SVG, SMIL, Javascript, AJAX, WCF-Web Service etc.
IDE MS-Visual Studio, Eclipse, NetBeans, Adobe Premiere, Maya
Graphical Programming OpenGL
Audio Programming Java-Sound API, Base.Net
Image Processing OpenCV
Programming Libraries and Development Software Git, TortoiseSVN, .Net Framework 3.5, CCTrees, DevExpress, DotnetBar, GoDragram, TXTextControl, TreeGX, FlexCel, Boorst Lib, NHibernat, Ocropus, Tesseract, CMU Sphinx
Operation Systems Windows, Linux, Mac OS X
Datenbase MS-SQL Server, MySQL, PostgreSQL
Documentation Latex, MS-Office, iWork

Work Experiences

My startup company

SemaMediaData is alreay online! It is focusing on multimedia analysis technologies, automatic metadata creation for videos and images. Video OCR, video segmentation, image OCR, lecture video analysis, image concept detection etc.

Current Projects

Senior Research Fellow and team lead, Hasso Plattner Institute 11/2013 -

Research in multimedia retrieval (MIR) with the focus on deep learning technologies:
  • Multimedia processing
  • Computer vision
  • Machine learning
  • Deep learning, deep neural networks
  • Video indexing and trieval
  • Video OCR (Optical Character Recognition)
  • Content-based Video Search
  • Real-time computer vision applications

tele-TASK - tele-TASK (tele-Teaching Anywhere Solution Kit) is an advanced mobile system for the production of Internet streaming videos and podcasts featuring a new and drastically simplified technology.

Previous Projects

Research Assistant/PhD Student, Hasso Plattner Institute 03/2010 - 11/2013

My work focus on multimedia analysis technology, video search, semantic information extraction, semantic web technology etc.

  • Research and development of a video OCR software-framework
  • Research and development of a video ASR software-framework
  • Development and integration of video analysis framework into web lecture video portal www.tele-task.de
  • Research and development of a context based framework for automated video analysis
  • Teaching assistance for related lectures and seminars.

MEDIAGLOBE - The digital archive is part of the THESEUS research program initiated by the German Federal Ministry of Economy and Technology (BMWi). MEDIAGLOBE deals with digitization, analysis, and semantic retrieval of historical, documentary audiovisual content.

Semantic Media Explorer (SEMEX) - The Semantic Media Explorer is a demonstrator that combines the latest media analysis processes to provides optimal access to video content.

Software engineer, company Andagon GmbH09/2008 - 03/2010

Developement of QA-software aqua
  • development environment .Net 2.0, .Net 3.5
  • Windows form developement in C#
  • software structure construction and implementation
  • Microsoft SQL server, object-relational mapping solution NHibernet
  • Using development software CCTrees, TortoiseSVN and other programming libraries
  • Writing unit test program

Web developer, company db-Central GmbH03/2008 - 06/2008

Programming in PHP5, Javascript, CSS, MySQL database

Diploma Thesis at Fraunhofer IDMT06/2007 - 01/2008

Thesis Title "Visualisierung von Musikdaten mit Hilfe moderner Grafikprozessoren" ("music visualization using graphic card technology")

  • Research on Graphic-Card technology
  • Research on theory and applications in music-visualization domain
  • Implemenation of a software library for music-visualization (Programming in OpenGL)
  • Migration of visualization software in music-analysis framework

Internship at Volkswagen AG11/2006 - 04/2007

  • Implementation of XPath Application in Adobe Flash
  • Research on SCROM
  • Migration of Online-Productiontraining system in E-Learning-Standard SCROM (Sharable Content Object Reference Model)
  • Maintained the VW Intranet

Publications

2016

  • Haojin Yang, Cheng Wang, Christian Bartz, Christoph Meinel "SceneTextReg: A Real-Time Video OCR System", ACM international conference on Multimedia (ACM MM 2016), system demonstration, 15-19 October 2016, Amsterdam, The Netherlands

  • Cheng Wang, Haojin Yang, Christian Bartz, Christoph Meinel "Image Captioning with Deep Bidirectional LSTMs", ACM international conference on Multimedia (ACM MM 2016), 15-19 October 2016, Amsterdam, The Netherlands

  • Sheng Luo, Haojin Yang, Cheng Wang, Xiaoyin Che and Christoph Meinel, "Real-time action recognition in surveillance videos using ConvNets", in the 23rd International Conference on Neural Information Processing (ICONIP 2016), in Kyoto (Japan), 16th-21th of October 2016

  • Sheng Luo, Haojin Yang, Cheng Wang, Xiaoyin Che, and Christoph Meinel, "Action Recognition in Surveillance Video Using ConvNets and Motion History Image", International Conference on Artificial Neural Networks (ICANN 2016), Barcelona Spain, 6th-9th of September 2016

  • Xiaoyin Che, Sheng Luo, Haojin Yang and Christoph Meinel, "Sentence Boundary Detection Based on Parallel Lexical and Acoustic Models", INTERSPEECH 2016, San Francisco, California, USA in September 8-12, 2016

  • Xiaoyin Che, Thomas Staubitz, Haojin Yang and Christoph Meinel, "Pre-Course Key Segment Analysis of Online Lecture Videos", 16th IEEE International Conference on Advancing Learning Technologies (ICALT-2016), Austin, Texas, USA, July 25-28, 2016

  • Cheng Wang, Haojin Yang and Christoph Meinel, "Exploring Multimodal Video Representation for Action Recognition", the annual International Joint Conference on Neural Networks (IJCNN 2016), Vancouver, Canada, July 24-29, 2016

  • Haojin Yang, "Real-Time Video OCR System", system demonstration at 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Show&Tell session, Shanghai China, 20-25 March 2016

  • Xiaoyin Che, Cheng Wang, Haojin Yang and Christoph Meinel, "Punctuation Prediction for Unsegmented Transcript Based on Word Vector", "the 10th International Conference on Language Resources and Evaluation (LREC 2016)", Portorož (Slovenia), 23-28 May 2016

  • Cheng Wang, Haojin Yang and Christoph Meinel, "A Deep Semantic Framework for Multimodal Representation Learning", International Journal of MULTIMEDIA TOOLS AND APPLICATIONS (MTAP), DOI: 10.1007/s11042-016-3380-8, online ISSN:1573-7721, Print ISSN:1380-7501, Special Issue: "Representation Learning for Multimedia Data Understanding", March 2016

2015

  • Cheng Wang, Haojin Yang, Xiaoyin Che and Christoph Meinel, "Concept-Based Multimodal Learning for Topic Generation", the 21st MultiMedia Modelling Conference (MMM2015), Sydney, Australia, Jan 5 to Jan 7, 2015

  • Sheng Luo, Haojin Yang and Christoph Meinel, "Reward-based Intermittent Reinforcement in Gamification for E-learning", 7th International Conference on Computer Supported Education (CSEDU), Lisbon, Portugal, Mai 23-25, 2015

  • Haojin Yang, Cheng Wang, Xiaoyin Che and Christoph Meinel. “An Improved System For Real-Time Scene Text Recognition”, ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, June 23-26, 2015

  • Cheng Wang, Haojin Yang and Christoph Meinel, "Does Multilevel Semantic Representation Improve Text Categorization?", the 26th International Conference on Database and Expert Systems Applications (DEXA 2015), Valencia, Spain, September 1-4, 2015

  • Cheng Wang, Haojin Yang and Christoph Meinel, "Visual-Textual Late Semantic Fusion Using Deep Neural Network for Document Categorization", the 22nd International Conference on Neural Information Processing (ICONIP2015), Istanbul, Turkey, November 9-12, 2015

  • Cheng Wang, Haojin Yang, Christoph Meinel, "Deep Semantic Mapping for Cross-Modal Retrieval", the 27th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2015), Vietri sul Mare, Italy, November 9-11, 2015

  • Xiaoyin Che, Haojin Yang and Christoph Meinel, "Adaptive E-Lecture Video Outline Extraction Based on Slides Analysis", the 14th International Conference on Web-based Learning (ICWL 2015), Guangzhou, China, November 5-8, 2015

  • Xiaoyin Che, Haojin Yang and Christoph Meinel, "Table Detection from Slide Images", 7th Pacific Rim Symposium on Image and Video Technology (PSIVT2015), 23-27 November, 2015, Auckland, New Zealand

2014

  • Haojin Yang, Christoph Meinel, "Content Based Lecture Video Retrieval Using Speech and Video Text Information", IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES (TLT), vol. 7, no. 2, pp. 142-154, April-June 2014, doi:10.1109/TLT.2014.2307305, online ISSN: 1939-1382, Publisher: IEEE Computer Society and IEEE Education Society

  • Xiaoyin Che, Haojin Yang, Christoph Meinel, "The Automated Generation and Further Application of Tree-Structure Outline for Lecture Videos with Synchronized Slides", International Journal of Technology and Educational Marketing (IJTEM), vol. 4, no.1, pp. 34-50, 2014, publisher: IGI Global

  • Bernhard Quehl, Haojin Yang and Harald Sack, "Improving text recognition by distinguishing scene and overlay text", the 7th International Conference on Machine Vision (ICMV 2014), Milan, Italy, November 19-21, 2014

2013

  • PhD thesis, Haojin Yang, "Automatic Video Indexing and Retrieval Using Video OCR Technology", 2013. ("summa cum laude")

  • Xiaoyin Che, Haojin Yang, Christoph Meinel, "Lecture Video Segmentation by Automatically Analyzing the Synchronized Slides", The 21st ACM International Conference on Multimedia, Grand Challenge: "Temporal Segmentation and Annotation Grand Challenge" October 21-25, 2013, Barcelona, Spain.

  • Xiaoyin Che, Haojin Yang, Christoph Meinel, "Tree-Structure Outline Generation for Lecture Videos with Synchronized Slides", The Second International Conference on E-Learning and E-Technologies in Education (ICEEE2013), 23-25th September 2013, Lodz Poland.

  • Haojin Yang, Franka Grünewald, Matthias Bauer, Christoph Meinel, "Lecture Video Browsing Using Multimodal Information Resources", 12th International Conference on Web-based Learning (ICWL 2013), 6 - 9th October 2013, Kenting, Taiwan. Springer lecture notes, 2013.

  • Franka Grünewald, Haojin Yang, Christoph Meinel, "Evaluating the Digital Manuscript Functionality - User Testing For Lecture Video Annotation Features", 12th International Conference on Web-based Learning (ICWL 2013), 6 - 9th October 2013, Kenting, Taiwan. Springer lecture notes, 2013.(best student paper award)

  • Franka Grünewald, Haojin Yang, Elnaz Mazandarani, Matthias Bauer and Christoph Meinel, "Next Generation Tele-Teaching: Latest Recording Tech- nology, User Engagement and Automatic Metadata Retrieval", International Conference on Human Factors in Computing and Informatics (southCHI), Lecture Notes in Computer Science (LNCS) Springer, 01–03 July, 2013 Maribor, Slovenia

2012

  • Haojin Yang, Bernhard Quehl and Harald Sack, "A Framework for Improved Video Text Detection and Recognition", Int. Journal of MULTIMEDIA TOOLS AND APPLICATIONS (MTAP), Print ISSN:1380-7501, online ISSN:1573-7721, Publicher: springer Netherlands, DOI: http://dx.doi.org/10.1007/s11042-012-1250-6.

  • Haojin Yang, Harald Sack, Christoph Meinel, "Lecture Video Indexing and Analysis Using Video OCR Technology", International Journal of Multimedia Processing and Technologies (JMPT), Volume: 2, Issue:4, pp. 176-196, Print ISSN: 0976-4127, Online ISSN: 0976-4135, Dec. 2011

  • Haojin Yang, Christoph Oehlke and Christoph Meinel, "An Automated Analysis and Indexing Framework for Lecture Video Portal", 11th International Conference on Web-based Learning (ICWL 2012), 2 - 4th September 2012, Sinaia, Romania. Springer lecture notes, Volume 7558, 2012. (best student paper award)

  • Haojin Yang, Bernhard Quehl, Harald Sack, "A skeleton based binarization approach for video text recognition", 13th International Workshop on Image analysis for multimedia interactive services (WIAMIS 2012, H-index: 13), 23rd - 25th May 2012, Dublin Ireland

  • C. Hentschel, J. Hercher, M. Knuth, J. Osterhoff, B. Quehl, H. Sack, N. Steinmetz, J. Waitelonis, H.Yang: "Open Up Cultural Heritage in Video Archives with Mediaglobe", 12th International Conference on Innovative Internet Community Services (I2CS 2012), June 13-15, 2012, Trondheim (Norway) (best paper award)

  • Haojin Yang, Franka Gruenewald and Christoph Meinel, "Automated extraction of lecture outlines from lecture videos: a hybrid solution for lecture video indexing", 4th Int. Conf. on Computer Supported Education (CSEDU), SciTePress, Porto, Portugal, April. 16-18

  • Haojin Yang, Bernhard Quehl and Harald Sack, "Text detection in video images using adaptive edge detection and stroke width verification" 19th Int. Conf. on Systems, Signals and Image Processing (IWSSIP), IEEE Press, Vienna, Austria, April. 11-13, 2012

2011

  • Haojin Yang, Maria Siebert, Patrick Lühne, Harald Sack and Christoph Meinel, "Automatic Lecture Video Indexing Using Video OCR Technology" IEEE Int. Symposium on Multimedia 2011 (ISM 2011), Dana Point, CA, USA, Dec. 5-7, 2011

  • Haojin Yang, Maria Siebert, Patrick Lühne, Harald Sack and Christoph Meinel, "Lecture Video Indexing and Analysis Using Video OCR Technology", 7th Int. Conf. on Signal Image Technology and Internet Based Systems (SITIS 2011), Track Internet Based Computing and Systems, Dijon (France), Nov.28 - Dec. 1, 2011

  • Haojin Yang, Christoph Oehlke and Christoph Meinel, "A Solution for German Speech Recognition for Analysis and Processing of Lecture Videos" 10th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2011) , Sanya, Heinan Island, China, May 2011

Contact

Dr. Haojin Yang

To find my work place in google map