PROGRAM
Days: Friday, December 13th Saturday, December 14th
Friday, December 13th, 2024
View this program: with abstractssession overviewtalk overview
08:40-09:20 Session 1: Keynote I: Gopal Ramchurn (University of Southampton, United Kingdom)
Chair:
Location: Ballroom
09:20-10:00 Session 2: Kenote II: Nitesh V Chawla (University of Notre Dame, United States)
Chair:
Location: Ballroom
10:10-10:40 Session 3: Poster session I
A hybrid multifactorial evolutionary algorithm for the minimum s-Club cover problem (abstract) |
Leveraging Dynamic Graph Word Embedding for Efficient Contextual Representations (abstract) |
H-LSHADE: An Efficient Hybrid Approach for Solving Heterogeneous Target Coverage in Visual Sensor Networks (abstract) |
Developing a Mobile Virtual Assistant using Large Language Models for Task Automation (abstract) |
BKCrawler: A Scalable Web Data Extraction System Using Weak Supervision (abstract) |
Border Fuzzy C-Means Clustering Algorithm (abstract) |
On the Effects of Training Objectives of Multi-agent Reinforcement Learning for Energy Consumption in Residential Buildings (abstract) |
Enhancing Software Fault Localization with Variational Autoencoder and Residual Neural Networks (abstract) |
GCGE: GAN+CFM-powered Data Augmentation and GBT Ensemble Learning for Improving Diabetes Mellitus Prediction (abstract) |
Analysis of Behavioral Facilitation Information During Typhoon Period Based on Victim Attributes (abstract) |
Towards a Unified Delegated Authorization Framework for Microservice-based ERP Systems (abstract) |
Power and Subcarrier Optimization for Heterogeneous QoS Requirement in Wireless Sensor Networks (abstract) |
Improving Quality of Vietnamese to Khmer Neural Machine Translation Using Multi-stage Fine-tuning Strategy (abstract) |
Developing A Vietnamese Regional Voice Dataset and Benchmark For Region Recognition Based On Speech (abstract) |
Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking (abstract) |
A Novel Gradient-based Defense Method against Model Poisoning Attacks in Federated Learning (abstract) |
Dual-Domain Reconstruction Network for Enhancing Sparse-View and Low-Dose CT Imaging (abstract) |
10:40-12:00 Session 4A: Networking and Communication Technologies & Software Engineering
Chair:
Location: Danang 1
10:40 | An Evaluation of HTTP/3 and WebTransport over QUIC in Live Low Latency Video Streaming (abstract) |
11:00 | A MAC Protocol for multi-cluster scheduling based on geographical segmentation and Precoloring Extension (abstract) |
11:20 | CoverNexus: Multi-Agent LLM System for Automated Code Coverage Enhancement (abstract) |
11:40 | Optimizing Winograd-based Convolution on GPUs (abstract) |
10:40-12:00 Session 4B: AI Foundations and Big Data
Chair:
Location: Danang 2
10:40 | ASC: Aggregating Sentence-level Classifications for Multi-label Long Text Classification (abstract) |
11:00 | VSum-HB: A Vietnamese Text Summarization Dataset For Reinforcement Learning From Human Feedback (abstract) |
11:20 | Exploring Vegan Dining Experiences: Insights from User-Generated Content Analysis (abstract) |
11:40 | Impact of Style Transfer Approaches on Synthetic Data for Military Camouflaged Object Detection (abstract) ![]() PRESENTER: Thi-Thu-Hang Truong |
10:40-12:00 Session 4C: Multimedia Processing
Chair:
Location: Danang 3
10:40 | KidRisk: Benchmark Dataset for Children Dangerous Action Recognition (abstract) |
11:00 | An Attempt to Develop a Neural Parser based on Simplified Head-Driven Phrase Structure Grammar on Vietnamese (abstract) PRESENTER: Duc-Vu Nguyen |
11:20 | DanceDuo: Bridging Human Movement and AI Choreography (abstract) |
11:40 | Language-Guided Video Object Segmentation (abstract) |
10:40-12:00 Session 4D: Recent Advances in Cyber Security
Chair:
Location: Son Tra
10:40 | A User Privacy Risk - Driven Approach to Web Cookie Classification (abstract) |
11:00 | MADFuzz: A Study on Automatic Exploitation of Smart Contract Vulnerabilities Using Multi-Agent Reinforcement Learning-guided Fuzzing (abstract) ![]() |
11:20 | TL-SOINN: A Transfer Learning-Enhanced Self-Organizing Incremental Neural Network for Network Intrusion Detection (abstract) |
11:40 | SeFed-IDS: A Collaborative Intrusion Detection System Utilizing Semi-Supervised Federated Learning and Data Augmentation (abstract) PRESENTER: Quyen Nguyen Huu |
13:30-14:10 Session 5: Keynote III: Zhou Minghui (Peking University, China)
Chair:
Location: Ballroom
14:50-15:20 Session 7: Poster session II
Hybrid Compression: Integrating Pruning and Quantization for Optimized Neural Networks (abstract) |
Enhancing Unsupervised Person Re-identification with Multi-View Image Representation (abstract) |
OSA: FPGA-based Octa-core SPHINCS+ Accelerator for IoT Security Applications (abstract) |
Decoding Deepfakes: Caption Guided Learning for Robust Deepfake Detection (abstract) |
AYO-GAN: A novel GAN-based adversarial attack on YOLO object detection models (abstract) |
Distortion-Resilient DIBR for Novel View Synthesis from a Single Image (abstract) |
DehazeCLNet: A Contrastive Learning Framework with Advanced Feature Extraction for Image Dehazing (abstract) |
A Lightweight End-to-End Multi-task Learning System for Vietnamese Speaker Verification (abstract) |
Boosting Image Super-Resolution: Incorporating Locally-enhanced FFN and Data Augmentation in the Swin Transformer architecture (abstract) |
Distribution-Guided Object Counting with Optimal Transport and DINO-Based density Refinement (abstract) |
FDE-Net: Lightweight Depth Estimation for Monocular Cameras (abstract) |
Minimalist Preprocessing Approach for Image Synthesis Detection (abstract) |
A Novel Reversible Data Hiding for JPEG Images Based on Zero AC Coefficients Shifting (abstract) |
Motion Analysis in Static Images (abstract) |
AI-Generated Image Recognition via Fusion of CNNs and Vision Transformers (abstract) |
Diffusion-Based Purification for Adversarial Defense in Medical Image Classification (abstract) |
15:20-16:40 Session 8A: AI Applications
Chairs:
Location: Danang 1
15:20 | A combination of YOLO and OSNet Re-ID neuronal networks for tracking abnormalities in Upper Gastrointestinal Endoscopy Videos (abstract) |
15:40 | Integrating Graph and Transformer-Based Models for Enhanced Chemical-Drug Relation Extraction in Document-Level Contexts (abstract) |
16:00 | MedGraph-RPE: Graph-Based Medical Segmentation Enhanced by Novel Relative Positioning Encoding (abstract) |
16:20 | Predicting Bee Swarming: Leveraging Machine Learning and Audio Feature Extraction (abstract) |
15:20-16:40 Session 8B: AI Foundations and Big Data
Chair:
Location: Danang 2
15:20 | BSRBF-KAN: A combination of B-splines and Radial Basis Functions in Kolmogorov-Arnold Networks (abstract) |
15:40 | Diverse Adversarial Samples for Text-to-Image Generation via Quality-Diversity Optimization (abstract) |
16:00 | Contour-enhanced Segmentation: A Novel Approach for Ambiguous Boundary in Polyp Segmentation (abstract) |
16:20 | Adversarial Robustness of Medical Image Classifiers via Denoised Smoothing (abstract) |
15:20-16:40 Session 8C: Multimedia Processing
Chair:
Location: Danang 3
15:20 | Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders (abstract) |
15:40 | Domain Generalization in Vietnamese Dependency Parsing: A Novel Benchmark and Domain Gap Analysis (abstract) |
16:00 | TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems (abstract) |
16:20 | Unifying Convolution and Self-Attention for Liver Lesion Diagnosis on Multi-phase Magnetic Resonance Imaging (abstract) |
15:20-16:40 Session 8D: Recent Advances in Cyber Security
Chair:
Location: Son Tra
15:20 | Log-based Representation Transferable Learning for Cross-System Anomaly Detection (abstract) |
15:40 | A Deep Learning Approach to Early Identification of Remote Access Trojans (abstract) |
16:00 | Privacy Challenges in Genomic Data: A Scoping Review of Risks, Mitigation Strategies, and Research Gaps (abstract) |
16:20 | An Efficient Explainable Unsupervised Machine Learning Approach for Network Intrusion Detection in IoMT (abstract) |
16:40-17:40 Session 9A: AI Applications
Chairs:
Location: Danang 1
16:40 | Post-Correction of Handwriting Recognition Using Large Language Models (abstract) |
17:00 | A Proposed Large Language Model-Based Smart Search for Archive System (abstract) |
17:20 | SCA-DS: Face Anti-Spoofing Leveraging Enhanced Spatial and Channel-wise Attention and Depth Supervision (abstract) |
16:40-17:40 Session 9B: AI Foundations, Big Data, and Multimedia Processing
Chair:
Location: Danang 2
16:40 | DOLG-CNet: Deep Orthogonal Fusion of Local and Global Features combined with Contrastive Learning and Deep Supervision for Polyp Segmentation (abstract) |
17:00 | VisChronos: Revolutionizing Image Captioning Through Real-Life Events (abstract) |
17:20 | ViEduQA: A New Vietnamese Dataset for Question Answer Generation in Education (abstract) |
16:40-17:40 Session 9C: Multimedia Processing
Chair:
Location: Danang 3
16:40 | VOI-VR:Voice-driven Object Interaction in Virtual Reality with Large Language Models (abstract) |
17:00 | Towards Real-Time Open World Instance Segmentation (abstract) ![]() |
17:20 | MythraGen: Two-Stage Retrieval Augmented Art Generation Framework (abstract) |
16:40-17:40 Session 9D: Recent Advances in Cyber Security
Chairs:
Location: Son Tra
16:40 | A Study On Explainable Graph Presentation Learning With Semantic Features Embedding For Windows Malware Detection (abstract) |
17:00 | A Study on Efficient Provenance-Based Intrusion Detection System using Few-shot Graph Representation Learning (abstract) ![]() |
17:20 | An Approach of Fine-Tuning Language Models and Handling Long Sequences for Efficiently API Call Analysis in Uncovering Windows Malware (abstract) ![]() PRESENTER: Le Tran Gia Bao |
Saturday, December 14th, 2024
View this program: with abstractssession overviewtalk overview
08:30-09:10 Session 10: Keynote V: Timothy Baldwin (MBZUAI, United Arab Emirates and The University of Melbourne, Australia)
Chair:
Location: Ballroom
09:10-09:50 Session 11: Keynote VI: Yasuyuki Matsushita (Osaka University, Japan)
Chair:
Location: Ballroom
09:50-10:20 Session 12: Poster session III
MAVERICS: Multimodal Advanced Visual Event Retrieval with Integrated CPU-Optimized Search (abstract) |
Forecasting Traffic Flow under Uncertainty: A Case Study in Da Nang (abstract) |
Enhanced Video Retrieval System: Leveraging GPT-4 for Multimodal Query Expansion and Open Image Search (abstract) |
ReViMM: Enhanced Video Retrieval with Reweighting Mechanism for Multi-Modal Queries (abstract) |
LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System (abstract) |
Interactive Video Retrieval System for AI Challenge 2024 Using CLIP, RAM++, and LLM-Enhanced Tag Matching (abstract) |
Transforming Video Search: Leveraging Multimodal Techniques and LLMs for Optimal Retrieval (abstract) PRESENTER: Truong Dinh |
Real-Time Multi-User Multimedia Event Retrieval Application System Using WebSocket Protocol (abstract) |
Application of the SFE Feature Selection Method for Multi-Omic Biomarker Discovery in Brain Cancer Subtyping (abstract) |
Enhancing Video Retrieval via Synergized Image Embeddings and RAG (abstract) |
A Comprehensive Video Event Retrieval System for Vietnamese News: Integrating CLIP ViT, TASK-former, Transcripts, and OCR (abstract) |
LameFrames: Optimizing Video Event Retrieval Through Strategic Integration and Individual Strategy Enhancement (abstract) |
MMMSVR: An Advanced Video Retrieval and Question Answering System (abstract) |
CLIP-Enhanced Lifelog Retrieval System: Robust Multi-Modal Media Search with Real-Time Performance (abstract) |
Enhanced Video Event Retrieval through Adaptive Multi-Model Fusion with Large Language Models (abstract) |
"MAVEN: Video Retrieval System using A Multi-Agent Visual Exploration Network" (abstract) |
Can Image Generative Models be Considered Experts? (abstract) |
10:20-12:00 Session 13A: Generative AI
Chairs:
Tho Quan and Uichin Lee
Location: Danang 1
10:20 | Improving Vietnamese Legal Document Retrieval using Synthetic Data (abstract) |
10:40 | A Diffusion Model for Personalized Text-to-Image Generation (abstract) |
11:00 | Enhancing Neural Machine Translation with Direct Preference Optimization Using Human Feedback (abstract) |
11:20 | A Stable Diffusion Pipeline for Diverse Procedural Painting via Text Prompts (abstract) |
11:40 | Enhancing Image Authenticity in the Age of Generative AI: an Autoencoder-Driven Fourier Transform based Approach (abstract) |
10:20-12:00 Session 13B: Lifelog and Multimedia Event Retrieval
Chairs:
Location: Danang 2
10:20 | Event Retrieval from Large Video Collection in Ho Chi Minh City AI Challenge 2024 (abstract) |
10:40 | Fustar: Divide and Conquer Query in Video Retrieval System (abstract) |
11:00 | NewsInsight2.0: An Enhanced Version Integrating Large Language Model-based Query Optimization with Advanced Temporal Mechanisms (abstract) |
11:20 | AViSearch: A Multimodal Video Event Retrieval System via Query Enhancement and Optimized Keyframes (abstract) |
11:40 | An Optimized And Interactive Video Event Retrieval System With An Improved Temporal Algorithm (abstract) |
10:20-12:00 Session 13D: Applied Operations Research and Optimization
Chair:
Location: Son Tra
10:20 | Constraint Programming-Based Cutting Plane Algorithm for a Combination of Orienteering and Maximum Capture Problem (abstract) PRESENTER: Hoang Giang Pham |
10:40 | Cost Optimization in Competitive Facility Location under General Demand Model (abstract) PRESENTER: Ba Luat Le |
11:00 | Influence Maximization with Fairness Allocation Constraint (abstract) |
11:20 | A Reputation Scoring Framework for Lending Protocols using the PageRank Algorithm (abstract) |
11:40 | A method combining the reference information of the adaptive adjustment method and the decision maker of multi-objective evolutionary algorithms (abstract) |
14:10-14:40 Session 15: Poster session IV
Flow Velocity Analysis of Rivers Using Farneback Optical Flow and STIV Techniques with Drone Data (abstract) |
Faster, Larger, Stronger: Optimally Solving Employee Scheduling Problems with Graph Neural Networks (abstract) |
Advancing Geopolitical Map Analysis: An Intelligent System for Territorial Integrity Verification (abstract) |
Improving Human Action Recognition Using Quaternion Discrete Fourier Transform in Transfer Learning (abstract) |
Cardio Care: A Vision Transformer Cardiac Classification based on Electrocardiogram Images and Signals (abstract) |
A Tool for Preventing Consanguineous Marriages Using Vietnam's National Residents Database (abstract) |
Optimizing Smart Grids with Reinforcement Learning for Enhanced Energy Efficiency (abstract) |
Benchmarking Real-Time Object Detection: Evaluating YOLO and RT-DETR on Speed, Accuracy, and Efficiency (abstract) PRESENTER: Cao Vu Bui |
Progressive Retention Sampling for Sequence Generation-based Scene Text Spotting (abstract) |
Development of an Edge-Computing-Based Intelligent Service Framework for Smart Camera Applications (abstract) |
MedCapNet: A Novel Approach to Medical Image Captioning (abstract) |
Contrastive Perturbation Enhancement for LLM-Based Machine Translation (abstract) |
Traffic Anomaly Detection under Extreme Weather from Aerial Images (abstract) |
URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots - A Case Study at HCMUT (abstract) |
EPC-YOLOv7: The Proposed One-stage Detector for Aerial Scenario Detection (abstract) |
A Low-Cost EEG-Based System for Measuring and Forecasting Levels of Alertness with Long Short-Term Memory (abstract) |
Real-Time Multi-Face Emotion Recognition for Enhancing Student Engagement in Classroom Environments Using Low-Power IoT Devices (abstract) |
MEPC: Multi-level Product Category Recognition Image Dataset (abstract) |
14:40-16:00 Session 16A: Secured and Intelligent Multimedia Systems
Chair:
Location: Danang 1
14:40 | EPEdit: Redefining Image Editing with Generative AI and User-Centric Design (abstract) |
15:00 | A Simple Approach towards Frame Filtering for Efficient Gaussian Splatting (abstract) |
15:20 | CESE: A Clip-based Event Search Engine for AI Challenge HCMC 2024 (abstract) |
15:40 | GeoSI: An Interesting Interactive System for Retrieving and Mapping News from Multiple Online Sources (abstract) |
14:40-16:00 Session 16B: Lifelog and Multimedia Event Retrieval
Chairs:
Location: Danang 2
14:40 | Addressing Ambiguous Queries in Video Retrieval with Advanced Temporal Search (abstract) |
15:00 | SnapSeek: A Multimodal Video Retrieval System with Context Awareness for AI Challenge 2024 (abstract) |
15:20 | ArtemisSearch: A Multimodal Search Engine for Efficient Video Log-Life Event Retrieval Using Time-Segmented Queries and Vision Transformer-based Feature Extraction (abstract) PRESENTER: Hoang-Phuc Nguyen |
15:40 | KPI: Knowledge-based Processing for Interactive Video Retrieval (abstract) |
14:40-16:00 Session 16C: Human Computer Interaction
Chair:
Location: Danang 3
14:40 | MRClassroom: A Mixed-Reality Interface for Improving Remote Students' Presence in Hybrid Classrooms (abstract) |
15:00 | Multi-Agent Chatbot for Efficient Interaction with Blockchain APIs (abstract) |
15:20 | Evaluation of AI-Based Assistant Representations on User Interaction in Virtual Explorations (abstract) |
15:40 | A Novel Simulation-Driven Data Enrichment Approach to Improve Machine Learning Algorithm Performance (abstract) |
14:40-16:00 Session 16D: Applied Operations Research and Optimization
Chair:
Location: Son Tra
14:40 | Exemplar-Embed Complex Matrix Factorization with Elastic Net Penalty: An Advanced Approach for Data Representation (abstract) PRESENTER: Manh Quan Bui |
15:00 | Modeling Information Diffusion in Bibliographic Networks using Pretopology (abstract) |
15:20 | Optimizing Credit Scoring Models for Decentralized Financial Applications (abstract) |
15:40 | A Historical GPS Trajectory-Based Framework for Predicting Bus Travel Time (abstract) |
16:00-17:20 Session 17A: Secured and Intelligent Multimedia Systems
Chair:
Location: Danang 1
16:00 | Media Certificate Authority: A System to Ensure Media Content Originality for Daily Lifelog Media Collection (abstract) |
16:20 | Mouse Paw Inflammation Evaluation with Segment Anything and Lightness Classification (abstract) |
16:40 | Knowledge Distillation for Lumbar Spine X-ray Classification (abstract) |
17:00 | Exploring Prompt Injection: Methodologies and Risks with an Interactive Chatbot Demonstration (abstract) |
17:20 | Motorcycle Helmet Detection Benchmarking (abstract) |
16:00-17:20 Session 17B: Lifelog and Multimedia Event Retrieval
Chairs:
Location: Danang 2
16:00 | RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval (abstract) |
16:20 | A Hybrid Video Retrieval System Using CLIP and BEiT-3 for Enhanced Object and Contextual Understanding (abstract) |
16:40 | VizQuest: Enhanced Video Event Retrieval Using Fusion and Temporal Modeling (abstract) |
17:00 | Unveiling Peripheral Information: A Context-Aware Video Retrieval Approach (abstract) |
16:00-17:20 Session 17C: Human Computer Interaction
Chairs:
Location: Danang 3
16:00 | Now I Know What I am Eating: Real-time Tracking and Nutritional Insights Using VietFood67 to Enhance User Experience (abstract) |
16:20 | Towards Enabling Tangible Interaction with Physical Objects in Virtual Reality Desktop Workplaces (abstract) |
16:40 | Meal Plan App: Personalized meal plans based on personal unique needs. (abstract) |
17:00 | Budget-Aware Keyboardless Interaction (abstract) |