International Workshop on Document Analysis Systems

June 9th-11th, 2010

Cambridge, MA, USA

List of Accepted Papers

Full Paper - Oral
Xujun Peng, Srirangaraj Setlur, Venu Govindaraju and Ramachandrula Sitaram. Overlapped Text Segmentation Using Markov Random Field and Aggregation
Andreas Fischer, Emanuel Indermühle, Horst Bunke, Gabriel Viehhauser and Michael Stolz. Ground Truth Creation for Handwriting Recognition in Historical Documents
Oleg Golubitsky, Vadim Mazalov and Stephen Watt. Towards Affine Recognition of Handwritten Mathematical Characters
Faisal Shafait and Ray Smith. Table Detection in Heterogenous Documents
Marcus Liwicki, Seiichi Uchida, Masakazu Iwamura, Shinichiro Omachi and Koichi Kise. Data-Embedding Pen - Augmenting Ink Strokes with Meta-Information
Masakazu Iwamura, Tomohiko Tsuji and Koichi Kise. Memory-Based Recognition of Camera-Captured Characters
Xiaoli Zhang, Jie Zou, Daniel X. Le and George Thoma. Investigator Name Recognition from Medical Journal Articles: A Comparative Study of SVM and Structural SVM
Marçal Rusiñol and Josep Lladós. Efficient Logo Retrieval Through Hashing Shape Context Descriptors
Jayant Kumar, Wael Abd-Almageed, Le Kang and David Doermann. Handwritten Arabic Text Line Segmentation using Affinity Propagation
Mudit Agrawal and David Doermann. Context-Aware and Content-Based Dynamic Voronoi Page Segmentation
Reza Farrahi Moghaddam, Mohamed Cheriet, Mathias Adankon, kostyantyn Filonenko and Robert Wisnovsky. IBN SINA: A database for research on processing and understanding of Arabic manuscripts images
Antonio Clavelli, Dimosthenis Karatzas and Josep Llados. Assessment of Text Extraction Algorithms on Complex Colour Images
Oleg Golubitsky and Stephen Watt. Improved Classification through Runoff Elections
Pingping Xiu and Henry Baird. Analysis of Whole-Book Recognition
Sharad Seth, Ramana C. Jandhyala, Mukkai Krishnamoorthy and George Nagy. Analysis and Taxonomy of Column Header Categories for Web Tables
Thai V. Hoang and Salvatore Tabbone. Text Extraction From Graphical Document Images Using Sparse Representation
Su bolan, Lu shijian and Tan Chew Lim. Binarization of Historical Document Images Using the Local Maximum and Minimum
Emanuel Indermühle, Marcus Liwicki and Horst Bunke. IAM-OnDo-database: an Online Handwritten Document Database with Non-uniform Contents
Elisa H. Barney Smith. An analysis of binarization ground truthing
Sébastien Macé, Ernest Valveny, Hervé Locteau and Salvatore Tabbone. A System to Detect Rooms in Architectural Floor Plan Images
Daniel Lopresti, George Nagy and Elisa Barney Smith. Document Analysis Issues in Reading Optical Scan Ballots
Asif Shahab, Faisal Shafait, Thomas Kieninger and Andreas Dengel. An open approach towards the benchmarking of table structure recognition systems
Lanlan Chang, Jun Sun, Misako Suwa, Hiroaki Takebe and Yuan He. Occluded text restoration and recognition
Partha Roy, Umapada Pal and Josep Llados. Query Driven Word Retrieval in Graphical Documents
Jin Chen, Huaigu Cao, Rohit Prasad, Anurag Bhardwaj and Premkumar Natarajan. Gabor Features for Offline Arabic Handwriting Recognition
Evgeniy Bart and Prateek Sarkar. Information extraction by finding repeated structure
Syed Saqib Bukhari, Mayce Ibrahim Ali, Faisal Shafait and Thomas Breuel. Document Image Segmentation using Discriminative Learning over Connected Components
Pramod Kompalli, Jawahar C. V. and Manmatha R.. Nearest Neighbor based Collection OCR
Full Paper - Poster
Koichi Kise, Megumi Chikano, Kazumasa Iwata, Masakazu Iwamura, Seiichi Uchida and Shinichiro Omachi. Expansion of Queries and Databases for Improving the Retrieval Accuracy of Document Portions
Marcus Liwicki, Hassan Mohamed Abou Eisha and Andreas Dengel. Improving Handwriting Recognition by the Use of Semantic Information
Albert Gordo, Alicia Fornés, Ernest Valveny and Josep Llados. A Bag of Notes Approach to Writer Identification in Old Handwritten Music Scores
Farshad Nourbakhsh, Dimosthenis Karatzas and Ernest Valveny. A Polar-based Logo Representation based on Topological and Colour Features
Raman Jain and C. V. Jawahar. Towards More Effective Distance Functions for Word Image Matching
Karthika Mohan and C.V. Jawahar. A Post-Processing Scheme for Malayalam using Statistical Sub-character Language Models
Ying Li and Ching Y. Suen. Typeface Personality Traits and Their Design Characteristics
Joost van Beusekom, Faisal Shafait and Thomas M. Breuel. Document Inspection Using the Text-Line Alignment
Hervé Déjean and Jean-Luc Meunier. Methodological Considerations on the INEX Structure Extraction Competition
Sebastian Colutto. Introducing a New Image Dissimilarity Measure with an Application to Character Image Clustering in Degraded Historical Documents
Giorgos Vamvakas, Nikolaos Stamatopoulos, Basilis Gatos and Stavros Perantonis. Automatic Unsupervised Parameter Selection for Character Segmentation
Dawei Yin, Chang An and Henry Baird. Safely Selecting Subsets of Training Data
Ehry MacRostie, Rohit Prasad, Stephen Rawls, Matin Kamali, Huaigu Cao and Premkumar Natarajan. The BBN Document Analysis Service: A Platform for Multilingual Document Translation
Vu Nguyen and Michael Blumenstein. Techniques for Static Handwriting Trajectory Recovery: A Survey
Iyad Abu Doush and Enrico Pontelli. Detecting and Recognizing Tables in Spreadsheets
Josef Baker, Alan Sexton and Volker Sorge. Faithful Mathematical Formula Recognition from PDF Documents
Albert Gordo, Jaume Gibert, Ernest Valveny and Marçal Rusiñol. A Kernel-based Approach to Document Retrieval
Markus Diem, Florian Kleber and Robert Sablatnig. Document Analysis Applied to Fragments: Feature Set for the Reconstruction of Torn Documents
Trung Quy Phan, Palaiahnakote Shivakumara and Chew Lim Tan. A Skeleton-Based Method for Multi-Oriented Text Detection
Shivakumara Palaiahnakote, Anjan Dutt, Chew Lim Tan and Umapada Pal. A New Wavelet-Median-Moment based Method for Mult-Oriented Video Text Detection
Gudila P. Moshi, Fumitaka Kimura, Tetsushi Wakabayashi, Wataru Ohyama and Lazaro S.P. Busagala. An Impact of Lingustic Features on Automated Classification of OCR Texts
Stefano Ferilli, Teresa M.A. Basile and Floriana Esposito. A Histogram-based Technique for Automatic Threshold Assessment in a Run Length Smoothing-based Algorithm
Anurag Bhardwaj, Manavender Malgireddy, Srirangaraj Setlur, Venu Govindaraju and Sitaram Ramachandrula. Latent Dirichlet Allocation Based Writer Identification in Offline Handwriting
Shusen Zhou, Qingcai Chen and Xiaolong Wang. HIT-OR3C: A Opening Recognition Corpus for Chinese Characters
Benjamin Seidler, Markus Ebbecke and Michael Gillmann. smartFIX Statistics - Towards Systematic Document Analysis Performance Evaluation and Optimization
Ehtesham Hassan, Santanu Chaudhury, M Gopal and Jignesh Dholakia. Use of MKL as Symbol Classifier for Gujarati Character Recognition
De Cao Tran, Patrick Franco and Jean-Marc Ogier. Form Recognition from ink strokes on tablet
Martin Lettner and Robert Sablatnig. Higher Order MRF for Foreground-Background Separation in Multispectral Images of Historical Manuscripts
Marcus Liwicki, Saher Mohamed El-Neklawy and Andreas Dengel. Touch & Write - A Multi-Touch Table with Pen-Input
Linlin Li and Chew Lim Tan. Associating Figures with Descriptions for Patent Documents
Nikolaos Stamatopoulos, Basilis Gatos and Thodoris Georgiou. Page Frame Detection for Double Page Document Images
Koji Nakagawa, Akio Fujiyoshi and Masakazu Suzuki. Ground-Truthed Dataset of Chemical Structure Images in Japanese Published Patent Applications
Guru D. S, Manjunath S, Shivakumara Palaiahnakote and Chew-Lim Tan. An Eigen Value Based Approach for Text Detection in Video
Karim HADJAR and Rolf INGOLD. Improving XED extracting Arabic Documents
Short Paper - Oral
George Nagy, Mukkai Krishnamoorthy, Raghav Padmanabhan, Ramana C. Jandhyala and William Silversmith. Table Metadata: Headers, Augmentations and Aggregates
Emilie Philippot, Yolande Belaïd and Abdel Belaïd. Bayesian networks for online form classification
Akio Fujiyoshi, Koji Nakagawa and Masakazu Suzuki. Robust Recognition Method of Chemical Structure Images for Japanese Published Patent Applications
Christian Reuschling, Stefan Agne and Andreas Dengel. DynaQ - Faceted Search for Document Retrieval
David Doermann and Elena Zotkina. GEDI - A Groundtruthing Environment for Document Images
Short Paper - Poster
Sheikh Faisal Rashid, Faisal Shafait and Thomas Breuel. Connected Component level Multiscript Identification from Ancient Document Images
Igor Filippov, Marc Nicklaus and John Kinney. Improvements in Optical Structure Recognition Application
Masakazu FUJIO, Takeshi NAGASAKI and Toshikazu TAKAHASHI. Bibliographic information Extraction of Document Title Pages by Combining FDA's N-best Results and Layout DP-Matching
Carlos Mello. A Visual Perception Approach to Segment Images of Historical Documents
Afef Kacem, Kawther Khazri and Abdel Belaïd. A system for the Recognition of single Arabic Mathematical Symbols
Nibal Nayef and Thomas M. Breuel. A Branch and Bound Algorithm for Graphical Symbol Recognition in Document Images
Henry F. Korth, Dezhao Song and Jeff Heflin. Metadata for Structured Document Datasets
Scott MacLean and George Labahn. Elastic matching in linear time and constant space
Syed Saqib Bukhari, Faisal Shafait and Thomas M. Breuel. Performance Evaluation and Benchmarking of Curled Textlines Segmentation Algorithms
Smita Vemulapalli and Monson H. Hayes III. Using Audio Based Disambiguation for Improving Handwritten Mathematical Content Recognition in Classroom Videos




Workshop and Program Co-Chairs

David Doermann, Univ. of MD
Venu Govindaraju, University at Buffalo, SUNY
Daniel Lopresti, Lehigh University
Prem Natarajan, Raytheon BBN Technologies

Publication and WWW Chair

Srirangaraj Setlur, University at Buffalo, SUNY

Finance and Local Arrangements Committee

Rohit Prasad, Raytheon BBN - Chair
David Frampton, Raytheon BBN
Laura Stephens, Raytheon BBN

Program Committee

Gady Agam (USA)
Adel M. Alimi (Tunisia)
Apostolos Antonacopoulos (UK)
Henry Baird (USA)
Elisa Barney Smith (USA)
Abdel Belaïd (France)
Kathrin Berkner (USA)
Thomas Breuel (Germany)
Horst Bunke (Switzerland)
Mohamed Cheriet (Canada)
Andreas Dengel (Germany)
Xiaoqing Ding (China)
Hiromichi Fujisawa (Japan)
Tin Kam Ho (USA)
Jianying Hu (USA)
Rolf Ingold (Switzerland)
Masakazu Iwamura (Japan)
Dimosthenis Karatzas (Spain)
Soo-Hyung Kim (Korea)
Koichi Kise (Japan)
Laurence Likforman-Sulem (France)
Rafael Dueire Lins (Brazil)
Cheng-Lin Liu (China)
Marcus Liwicki (Germany)
R. Manmatha (USA)
Simone Marinai (Italy)
Masaki Nakagawa (Japan)
Satoshi Naoi (Japan)
Il-Seok Oh (Korea)
Shinichiro Omachi (Japan)
Umapada Pal (India)
Hiroshi Sako (Japan)
Venkata Subramaniam (India)
Jun Sun (China)
Kazem Taghva (USA)
Chew Lim Tan (Singapore)
George Thoma (USA)
Karl Tombre (France)
Seiichi Uchida (Japan)
Berin Yanikoglu (Turkey)

