PROGRAM
Days: Monday, September 23rd Tuesday, September 24th Wednesday, September 25th Thursday, September 26th
Monday, September 23rd
View this program: with abstractssession overviewtalk overview
10:30-11:00Coffee Break
12:30-14:00Lunch Break (Doctoral Lunch)
15:30-16:00Coffee Break
Tuesday, September 24th
View this program: with abstractssession overviewtalk overview
10:30-11:00Coffee Break
11:00-12:30 Session 8
Chair:
11:00 | An Effective Scheme for Generating an Overview Report over a Very Large Corpus of Documents (abstract) |
11:25 | The CNN-Corpus: a Large Textual Corpus for Single-Document Extractive Summarization (abstract) |
11:50 | A Cell-Detection-Based Table Structure Recognition Method (abstract) |
12:10 | XLIndy: Interactive Recognition and Information Extraction in Spreadsheets (abstract) |
12:30-14:00Lunch Break (Steering Committee Lunch)
14:00-15:30 Session 9
Chair:
14:00 | Augmenting Music Sheets with Harmonic Fingerprints (abstract) |
14:25 | Writer Characterization and Identification in Short Modern and Historical Documents: Reconsidering Paleographic Tables (abstract) |
14:45 | Digital Degree Certificates for Higher Education in Brazil (abstract) |
15:10 | An Exploratory Analysis of Precedent Relevance in the Brazilian Supreme Court Rulings (abstract) |
15:30-16:00Coffee Break
16:45-17:30 Session 12: Lightning Talks
Chair:
16:45 | Semi-Automatic LaTeX-Based Labeling of Mathematical Objects in PDF Documents: MOP Data Set (abstract) |
16:45 | A Hybrid AI Tool to Extract Key Performance Indicators from Financial Reports for Benchmarking (abstract) |
16:45 | Combining Word Embeddings with Taxonomy Information for Multi-Label Document Classification (abstract) |
16:45 | The CNN-Corpus in Spanish: a Large Corpus for Extractive Text Summarization in the Spanish Language (abstract) |
16:45 | Enhancing Document-Camera Images (abstract) |
16:45 | The next Millennium Document Format (abstract) |
16:45 | Towards Automated Auditing with Machine Learning (abstract) |
Wednesday, September 25th
View this program: with abstractssession overviewtalk overview
09:40-10:30 Session 15
Chair:
09:40 | Searching and Ranking Questionnaires: an Approach to Calculate Similarity Between Questionnaires (abstract) |
10:05 | Multi-Layered Edits for Meaningful Interpretation of Textual Differences (abstract) |
10:30-11:00Coffee Break
11:00-12:30 Session 16
Chair:
11:00 | Predictable and Consistent Information Extraction (abstract) |
11:25 | Prediction of Mathematical Expression Declarations Based on Spatial, Semantic, and Syntactic Analysis (abstract) |
11:50 | An Algorithm for Extracting Shape Expression Schemas from Graphs (abstract) |
12:10 | Multi-Context Information for Word Representation Learning (abstract) |
12:30-14:00Lunch Break (BoF Session)
14:00-15:30 Session 17
Chair:
14:00 | Searching Document Repositories Using 3D Model Reconstruction (abstract) |
14:25 | Text Localization in Scientific Figures Using Fully Convolutional Neural Networks on Limited Training Data (abstract) |
14:50 | Automatic Identification and Normalisation of Physical Measurements in Scientific Literature (abstract) |
15:10 | Generating Digital Libraries of M.Sc. and Ph.D. Theses (abstract) |
15:30-16:00Coffee Break
16:00-17:30 Session 18
Chair:
16:00 | PaperWork: Exploring the Potential of Electronic Paper on Office Work (abstract) |
16:25 | TRIVIR: a Visualization System to Support Document Retrieval with High Recall (abstract) |
16:50 | Globally Optimal Page Breaking with Column Balancing – a Case Study (abstract) |
17:10 | A Vision for User-Defined Semantic Markup (abstract) |
Thursday, September 26th
View this program: with abstractssession overviewtalk overview
09:30-11:00 Session 21
Chair:
09:30 | On the Expressive Power of Declarative Constructs in Interactive Document Scripts (abstract) |
09:55 | Modeling Multimodal-Multiuser Interactions in Declarative Multimedia Languages (abstract) |
10:20 | Sentiment Classification Improvement Using Semantically Enriched Information (abstract) |
10:40 | Impact of In-domain Vector Representations on the Classification of Disease-related Tweets: Avian Influenza Case Study (abstract) |
11:00-11:30Coffee Break
11:30-13:00 Session 22
Chair:
11:30 | Using Knowledge Base Semantics in Context-Aware Entity Linking (abstract) |
11:55 | Multi-Objective GP Strategies for Topical Search Integrating Wikipedia Concepts (abstract) |
12:20 | Enhanced Automated Policy Enforcement eXchange Framework (eAPEX) (abstract) |
12:40 | Enhanced Document Retrieval and Discovery Based on a Combination of Implicit and Explicit Document Relationships (abstract) |
13:00-14:00Lunch Break