View: session overviewtalk overview
Keynote - John Collomosse
Poster Lightning Talks
10:30 | Towards a Model and a Textual Representation for Location-based Games SPEAKER: unknown ABSTRACT. Location-Based Mobile Games (LBMGs) are a subclass of pervasive games that make use of location technologies to consider the players' geographic position in the game rules and mechanics. This research presents LEGaL, a language to model and represent the structure and multimedia contents (e.g., video, audio, 3D objects, etc.) of LBMGs. LEGaL is an extension of NCL (Nested Context Language) that allows the modelling and representation of mission-based games by supporting spatial and temporal relationships between game elements. |
10:32 | SketchTab3d: An hybrid sketch library using tablets and immersive 3d environments SPEAKER: Johann Habakuk Israel ABSTRACT. This paper proposes a 2d sketching tool and an immersive 3d sketch library as an approach to easily create and access documents (i.e. sketches). The sketch library allows users to store, arrange and assemble their own sketches and others’ in theoretically unlimited space. A user can get an idea about the general activities of all users since the sketch library is updated whenever changes are made. The system provides 2d and 3d means to access the sketch library. Whereas the 2d interfaces offers a standard dash board, the 3d environment provides unrestricted spatial access to the sketch library. Furthermore, a 2d sketching interfaces is provided in order to create sketch-based documents. Possible application areas are in the fields of engineering, design, public displays, shared knowledge applications, and art. The system was evaluated among eight participants regarding its pragmatic and hedonic qualities as well as searching performance. The results suggest that the users appreciate the particular combination of 2d and 3d technologies in SketchTab3d and requested for improvement in the 3d interaction technique. No significant differences were found in the search performance, however the physical demand during searching was perceived significantly higher in the 3d condition than in the 2d condition. |
10:34 | A tool for mixing XML annotations SPEAKER: Bertrand Gaiffe ABSTRACT. XML documents, in particular critical editions are usually very heavily annotated. They usually represent abbreviated part of words, variant readings, edition operations etc. Among such documents, a part of the PCDATA\footnote{Character contents of the text} is the actual edition of the text. Very often, one wants to run automatic tools on this ``simple'' text and thereafter re-embed the result into the original file. The tool we present here is dedicated to this embedding of annotations. In order to achieve this, the tool sets the problem as an ambiguous input and parses that ambiguous input by the grammar of the XML language. It then proposes those solutions that are syntactically correct. In case there are none, the input is modified and reparsed until at least a solution is found. The tool is available at https://github.com/bgaiffe/XMLMixer. |
10:36 | Authenticity in a digital era: Still a document process. The case of laboratory notebooks SPEAKER: unknown ABSTRACT. Asymmetric cryptography brings the ability for anyone on the earth to check the signature of a digital object (Diffie & Hellman, 1976). From there, trusted timestamping of a digital object provides very strong evidence of its author or inventor and integrity (Haber, 1991). 26 years later, trusted timestamping could have replaced, for example, traditional paper laboratory notebooks a long time ago. It has not happened yet. In this paper, we argue that the reason is because authenticity is a document process: while trusted timestamping remains a necessary part of the process, a digital object must be involved in a sociotechnical process in order to become a document. We fisrt point out the gap, intractable with paper, between the strict administrative workflow required to create strong evidence, and the fluidity of collaborative authoring needed for creativity. This gap is relevant to laboratory notebooks, as they are commonly used by inventors to attest that they discovered elements at a specific time, in a specific context. Then we explain the design and implementation of our software system, according to document theory (Buckland, 1997), in order to reinvent the whole process to minimize the administrative burden, while preserving its well-known and valuable properties. |
10:38 | Fast Binarization with Chebyshev Inequality SPEAKER: unknown ABSTRACT. In order to enhance the binarization result of degraded document images with smudged and bleed-through background, we present an fast binarization technique that applies the Chebyshev theory in the image preprocessing. We introduce the Chebyshev filter which uses the Chebyshev inequality in the segmentation of objects and background. Our result shows that the Chebyshev filter is not only effective, but also simple, robust and easy to implement. Because of its simplicity, our method is sufficiently efficient to process live image sequences in real-time. We have implemented and compared with the Document Image Binarization Contest datasets (H-DIBCO 2014) for testing and evaluation. The experimental outcomes have demonstrated that this method achieved good result in this literature. |
10:40 | Post-Processing OCR Text using Web-Scale Corpora SPEAKER: unknown ABSTRACT. We introduce a (semi-)automatic OCR post-processing system that utilizes web-scale linguistic corpus in providing high-quality correction. This paper is a comprehensive system overview with the focus on the computational procedure, applied linguistic analysis, and batch processing optimization. |
10:42 | Qqmbr and indentml: extensible mathematical publishing for web and paper SPEAKER: Ilya Schurov ABSTRACT. We present qqmbr, novel publishing system aimed on preparation of high-quality mathematical publications. One source can be converted to a single interactive webpage, multipage website or PDF (via LaTeX). The markup language behind qqmbr entitled indentml is designed to be both human-readable and machine-readable (easily parsable). It is possible to extend basic qqmbr markup with custom tags that enrich its semantics and build plugins and applications that query qqmbr documents, extract information from them and process it in arbitrary way without much efforts. |
10:44 | The Common Fold: Utilizing the Four-Fold to Dewarp Printed Documents from a Single Image SPEAKER: unknown ABSTRACT. Handheld cameras are currently the device of choice for performing document digitization, due to their convenience, ubiquity and high performance at low cost. Software methods process a captured image, to rectify distortions and reconstruct the original document. Existing methods struggle to reconstruct a flattened version given a single image of a document distorted by folding. We propose a novel non-parametric page dewarping approach from a single image based on deep learning to identify creases due to folds on the paper. Our method then performs a 2D boundary method based on polynomial regression, and a Coons patch, to get a flattened reconstruction. We found our method improves OCR word accuracy by nearly 2.5 times when compared to the original distorted image. |
10:46 | Improving Version-Aware Word Documents SPEAKER: unknown ABSTRACT. Coakley et al. described how they developed Version Aware Word Documents, which is an enhanced document representation that includes a detailed version history that is self-contained and portable. However, they were not able to adopt the unique-ID-based techniques that have been shown to support efficient merging and differencing algorithms. This application note describes how it is possible to adapt existing features of MS Word's OOXML representation to provide a system of unique element IDs suitable for those algorithms. This requires taking over Word's Revision Save ID (RSID) system and also defining procedures for specifying ID values for elements that do not support the RSID mechanism. Important limitations remain but appear surmountable. |
10:48 | Classification of MathML Expressions Using Multilayer Perceptron SPEAKER: unknown ABSTRACT. MathML (Mathematical Markup Language) consists of two sets of elements: Presentation Markup and Content Markup. The former is more widely used to display math expressions in Web pages, while the latter is more suited to the calculation of math expressions. In this paper, we consider classifying math expressions in Presentation Markup. In general, a math expression in Presentation Markup cannot be uniquely converted into the corresponding expression in Content Markup. If the class of a given math expression can be identified automatically, such conversions can be done more appropriately. Moreover, identifying the class of a given math expression is useful for text-to-speech of math expression. In this paper, we propose a method for classifying math expressions in Presentation Markup by using a kind of deep learning; multilayer perceptron. Experimental results show that our method classifies math expressions with high accuracy. |
Document Analysis: classification and similarity
Document Analysis: content analysis
Sigweb Presentation