View: session overviewtalk overview
National University of Singapore
Title: Sustainable Innovation in Digital Libraries: Perspectives from NUS Libraries
Abstract: Digital libraries play an increasingly pivotal role in addressing challenges related to sustainability, including but not limited to: ensuring that data remains accessible and usable to communities, minimising energy consumption, addressing DEI (diversity, equity and inclusion) issues, and managing financial feasibility. In this talk, I will highlight key initiatives and innovations at NUS Libraries aimed at reducing the carbon footprints of our digital projects, as well as how digital collections are used to foster community engagement. For librarians and educators alike, the talk aims to provide key guiding principles in developing sustainable digital libraries that are beneficial to both communities and the environment.
Online participants: Zoom link
Online participants: Zoom link
11:00 | Metadata-Assisted Topic Modeling for Patent Analysis ABSTRACT. Patent documents serve as a valuable source of information for companies and researchers when making decisions. Therefore, there is a demand for classifying patent documents and analyzing them. Topic modeling is a major approach to automatically extracting topics from documents, and it can be used for document classification. Although we can apply existing topic modeling methods to patent documents for our intended purpose, most models do not fully take into account the relationships between patent documents and their metadata, such as titles, abstracts, and classification codes. This paper explores a topic modeling approach that considers the relationships between patent documents and their metadata, by developing and evaluating a novel method, named Metadata-Assisted Topic Modeling (MATopic). MATopic (1) generates more coherent and diverse topics by considering both the metadata of patents and their context, and (2) infers evidence about what components make up the topics by representing the detected topics as nodes in a knowledge graph. The experiments using real-world patent document datasets show that MATopic outperforms existing methods in both topic diversity and coherence. |
11:20 | How Knowledge Recombination Fuels Technological Innovation? Insights from IPC Co-occurrence Networks ABSTRACT. As knowledge recombination increasingly shapes technological advancement, scholarly efforts have focused on how integrating knowledge elements can enhance technological innovation performance. Our study con-tributes to this literature by adopting a network perspective and assessing the predictive value of various knowledge linkage features. Using granted patents in the pharmaceutical field, we constructed the annual International Patent Classification (IPC) co-occurrence network and extracted the real-time linkage features of pairwise co-occurring IPCs, including tie strength, knowledge distance, node assortativity, and betweenness centrality. Based on explainable machine learning, we found that XGBoost outperformed in predicting both patent impact and patent disruptiveness. More importantly, feature interpretation based on the Shapley value illustrates that patent impact and patent disruptiveness have different determinants, implicating different knowledge combinative strategies. Specifically, we found that betweenness centrality and node assortativity contribute significantly in predicting patent impact, suggesting that localized search combining hotspot and marginal knowledge components are more likely to produce impactful inventions. Conversely, tie strength between combined knowledge components is the most important predictor for patent disruptiveness, indicating that deeper exploitation along existing technology portfolios can effectively enhance patent disruptiveness. These results provide new insights into how knowledge recombination fuels technological innovation and offer practitioners valuable strategies for developing target technologies. |
11:40 | Managing Comprehensive Research Instrument Descriptions within a Scholarly Knowledge Graph PRESENTER: Muhammad Haris ABSTRACT. In research, measuring instruments play a crucial role in producing the data that underpin scientific discoveries. Information about instruments is essential in data interpretation and, thus, knowledge production. However, if at all available and accessible, such information is scattered across numerous data sources. Relating the relevant details, e.g. instrument specifications or calibrations, with associated research assets (data, but also operating infrastructures) is challenging. Moreover, understanding the (possible) use of instruments is essential for researchers in experiment design and execution. To address these challenges, we propose a Knowledge Graph (KG) based approach for representing, publishing, and using information, extracted from various data sources, about instruments and associated scholarly artefacts. The resulting KG serves as a foundation for exploring and gaining a deeper understanding of the use and role of instruments in research, discovering relations between instruments and associated artefacts (articles and datasets), and opens the possibility to quantify the impact of instruments in research. |
12:00 | Developing an AI-enhanced Conversation Application on DSpace: Technical Procedure and Details PRESENTER: Le Yang ABSTRACT. Open repository projects serve as valuable resources for scholarly communication and knowledge dissemination. However, interacting with these repositories can be challenging due to the vast amount of data, the restriction of browsing features, and the limitation of relational database queries. This paper presents a technical procedure and coding details that leverage Retrieval-augmented Generation (RAG), Large Language Models (LLMs), embeddings, and LangChain to develop an AI-enhanced conversation application tailored for communication with open repository systems, specifically focusing on DSpace. The chatbot uses RAG to enhance response generation by integrating relevant information retrieved from repository structural data and text documents. LLMs form the chatbot's core generation capabilities, ensuring coherent and contextually appropriate interactions. Embeddings are used to semantically enrich queries and responses, thereby enhancing understanding and relevance. LangChain coordinates among these components, managing the information flow and interaction between users and repositories. |
12:15 | Exploring GenAI’s Role in Digital Cultural Memory at Museums and Art Galleries in Indonesia: AR and VR Perspectives ABSTRACT. With the growing prevalence of generative artificial intelligence (GenAI), its impact on digital libraries and archives has become increasingly significant. This research explores the role of GenAI in enhancing digital cultural memory initiatives at museums and art galleries in Indonesia, aligning with the theme of "Sustainability and Empowerment." The study investigates how the integration of augmented reality (AR) and virtual reality (VR) technologies can advance the preservation, accessibility, and engagement of cultural artifacts and exhibitions. ImersifA, a 360-degree permanent video mapping installation at the National Museum of Indonesia, exemplifies these efforts. Utilizing AR mode, ImersifA animates cultural artifacts, providing contextual information and interactive elements that deepen visitor understanding and experience. Additionally, virtual tours of permanent exhibitions at the National Gallery of Indonesia enable remote exploration of artworks and cultural heritage, overcoming geographical barriers and expanding public access. Employing qualitative methods with direct observation, this study reveals that AR and VR significantly contribute to the sustainability and empowerment of cultural preservation, accessibility, and audience engagement in the digital age. |
Online participants: Zoom link
Online participants: Zoom link
15:30 | A BERT-Based Method of Named Entity Recognition for Ukiyo-e Titles ABSTRACT. Named entity recognition (NER) is a particularly challenging task, especially for historical documents that lack extensive annotated datasets. The titles of ukiyo-e, a genre of Japanese artworks, contain a significant number of entities and are composed of short texts rich in historical information. The complexity and brevity of these titles pose considerable challenges for analysis. This paper presents the construction of an ukiyo-e NER dataset and introduces a BERT-based NER methodology that achieves notable success in entity recognition within ukiyo-e titles. The study underscores the effectiveness of BERT and its derivative models on the ukiyo-e NER dataset, proposing a viable NER solution for historical documents. The proposed approach demonstrates the capability to perform NER tasks on ukiyo-e titles using pre-trained models on a relatively small annotated dataset, achieving an accuracy exceeding 80\%. Furthermore, this paper examines the distinct characteristics of BERT and its derivative models, optimizing their application to the ukiyo-e NER dataset. |
15:50 | Digitization in Indonesian libraries and archives: Mapping cultural heritage across the archipelago ABSTRACT. Despite the country’s cultural wealth, Indonesia is still struggling to identify and map its cultural heritage collections across the dispersed islands, as well as digitize and make them accessible online. This paper aims to identify and examine the provision and digitization of cultural collections carried by state libraries and archives in the 38 provinces across the archipelago. Out of 32 institutions, we obtained 65.6% responses which show that state libraries and archives in Indonesia hold a diverse range of cultural heritage collections that are being digitized and provide valuable resources which may not be found elsewhere. However, multiple layers of constraints ranging from policy and copyright issues, traditional beliefs and distrust of the local communities, and the lack and unequal development of digital infrastructures and skilled staffs challenge many digitization projects. Overcoming this complex situation requires a collaborative approach from different stakeholders to ensure these cultural resources are more widely accessible, and therefore beneficial to the public. |
16:10 | Beyond the Scroll: Eye-tracking Evaluations on the National Gallery of Indonesia’s Instagram PRESENTER: Rahmi Rahmi ABSTRACT. Eye-tracking studies focused on art galleries and museums within the realm of Instagram, especially in the Indonesian context, are notably scant. This investigation elucidates the user interaction and experience of the art gallery’s Instagram account, @galerinasional. Employing a quantitative approach, we implemented eye-tracking experiments using the well-known online eye-tracking software RealEye. The study participants were followers of the National Gallery of Indonesia’s (Galnas) Instagram profile, seven women and three men, in two age brackets: 18–24 years (n = 5) and 25–34 years (n = 5). The experimental protocol required participants to execute five tasks centered on locating exhibition details and artwork on the Galnas Instagram platform, with a duration of 100 seconds allocated to each task. The Participant Quality Grade (PQG), examination duration, optimal browser dimensionality, K-factor mean, gaze and fixation frequencies, and heat maps were determined for the participants. The key findings are as follows. (1) The participants’ PQG scores were predominantly between 4 and 6. (2) Investigative browsing typically exceeded the 100-second limit. (3) The consensus on the ideal browser dimensions hovered around 1,368.6 × 814.8 px. (4) The K-factor mean across participants tended toward K > 0. (5) The average gaze occurrence was 27 items per second for most participants. (6) A noticeable disparity in fixation counts was observed between the two age groups, with individuals aged 25–34 demonstrating prolonged engagement with Galnas’ content. (7) Heat maps accentuated a pronounced concentration on visual content, encompassing photographs, videos, and their associated captions. |
16:25 | Retrospectively Mining War Impacts upon Collective Moral Motivation from Historical Newspaper Archives PRESENTER: Liang Zhao ABSTRACT. Shall people be more moral during the war? By mining large-scale historical newspaper archives, we provide promising evidence through big data analysis to answer this question. Taking the American Civil War as the scenario, we exploited the historical newspaper database Chronicling America as corpus together with widely-used moral lexicons, and retrace the diurnal dynamics of collective moral motivation of American society from 1849 to 1876. Sufficient statistics revealed that compared with Pre-war and Post-war, collective moral motivation of the society increased significantly during wartime. Regression analysis suggested the situational sensitivity of moral motivation, i.e., during the first half of wartime the moral motivation boosted while then decreased in the second half when the war entered a stalemate. Moreover, the two component dimensions, agency and communion, also increased during wartime. |
16:40 | Does Oral Knowledge Belongs to Library: Library Professionals' Perspective ABSTRACT. Certain knowledge is transfer from one generation to the next through spoken word, a process known as oral tradition or oral knowledge transfer. Oral knowledge refers to the sharing and preservation of information, stories, and experiences through verbal communication. Oral knowledge are major sources of information which includes cultural and traditional beliefs, historical events, ancestral histories, developmental journey, myths, legends, and skills, among other things. It portrays a geographical and political landscape of a community or a social group. At the same time, since it passes from generation to generation orally and there is no documented form of these information. And at this point, the question arises whether these information’s are in a vulnerable phase of missing out? Proper documentation, archiving is the necessity of the hour to preserve these knowledges for sustainable use by budding generations. The study focuses on perspective of library professionals towards oral traditions and their roles and responsibilities in preserving cultural heritages where the initiatives taken by various institutions to preserve the oral knowledge is studied. To collect data of the study, a structured questionnaire was constructed and distributed among the library professionals of North East India through social media platforms. The results states that library professionals’ holds the responsibilities of collecting, documenting, preserving, and disseminating oral traditions, where technologies like Artificial Intelligence (AI) can be involved. Unfortunately, awareness of oral traditions among important stockholders like library professionals is not satisfactory. In this regard, culturally rich site like north east India is still in a position where education, awareness about oral knowledge is in theoretical form; without proper practical implementation of theories of creating awareness among people, preservation of our oral heritages for sustainable use will remain a myth. |