SOICT 2024: THE 13TH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY
PROGRAM

Days: Friday, December 13th Saturday, December 14th

Friday, December 13th, 2024

View this program: with abstractssession overviewtalk overview

10:10-10:40 Session 3: Poster session I
A hybrid multifactorial evolutionary algorithm for the minimum s-Club cover problem (abstract)
Leveraging Dynamic Graph Word Embedding for Efficient Contextual Representations (abstract)
H-LSHADE: An Efficient Hybrid Approach for Solving Heterogeneous Target Coverage in Visual Sensor Networks (abstract)
Developing a Mobile Virtual Assistant using Large Language Models for Task Automation (abstract)
BKCrawler: A Scalable Web Data Extraction System Using Weak Supervision (abstract)
Border Fuzzy C-Means Clustering Algorithm (abstract)
On the Effects of Training Objectives of Multi-agent Reinforcement Learning for Energy Consumption in Residential Buildings (abstract)
Enhancing Software Fault Localization with Variational Autoencoder and Residual Neural Networks (abstract)
GCGE: GAN+CFM-powered Data Augmentation and GBT Ensemble Learning for Improving Diabetes Mellitus Prediction (abstract)
Analysis of Behavioral Facilitation Information During Typhoon Period Based on Victim Attributes (abstract)
Towards a Unified Delegated Authorization Framework for Microservice-based ERP Systems (abstract)
Power and Subcarrier Optimization for Heterogeneous QoS Requirement in Wireless Sensor Networks (abstract)
Improving Quality of Vietnamese to Khmer Neural Machine Translation Using Multi-stage Fine-tuning Strategy (abstract)
Developing A Vietnamese Regional Voice Dataset and Benchmark For Region Recognition Based On Speech (abstract)
Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking (abstract)
A Novel Gradient-based Defense Method against Model Poisoning Attacks in Federated Learning (abstract)
Dual-Domain Reconstruction Network for Enhancing Sparse-View and Low-Dose CT Imaging (abstract)
10:40-12:00 Session 4A: Networking and Communication Technologies & Software Engineering
Location: Danang 1
10:40
An Evaluation of HTTP/3 and WebTransport over QUIC in Live Low Latency Video Streaming (abstract)
11:00
A MAC Protocol for multi-cluster scheduling based on geographical segmentation and Precoloring Extension (abstract)
11:20
CoverNexus: Multi-Agent LLM System for Automated Code Coverage Enhancement (abstract)
11:40
Optimizing Winograd-based Convolution on GPUs (abstract)
10:40-12:00 Session 4B: AI Foundations and Big Data
Location: Danang 2
10:40
ASC: Aggregating Sentence-level Classifications for Multi-label Long Text Classification (abstract)
11:00
VSum-HB: A Vietnamese Text Summarization Dataset For Reinforcement Learning From Human Feedback (abstract)
11:20
Exploring Vegan Dining Experiences: Insights from User-Generated Content Analysis (abstract)
11:40
Impact of Style Transfer Approaches on Synthetic Data for Military Camouflaged Object Detection (abstract)
10:40-12:00 Session 4C: Multimedia Processing
Location: Danang 3
10:40
KidRisk: Benchmark Dataset for Children Dangerous Action Recognition (abstract)
11:00
An Attempt to Develop a Neural Parser based on Simplified Head-Driven Phrase Structure Grammar on Vietnamese (abstract)
PRESENTER: Duc-Vu Nguyen
11:20
DanceDuo: Bridging Human Movement and AI Choreography (abstract)
11:40
Language-Guided Video Object Segmentation (abstract)
10:40-12:00 Session 4D: Recent Advances in Cyber Security
Location: Son Tra
10:40
A User Privacy Risk - Driven Approach to Web Cookie Classification (abstract)
11:00
MADFuzz: A Study on Automatic Exploitation of Smart Contract Vulnerabilities Using Multi-Agent Reinforcement Learning-guided Fuzzing (abstract)
11:20
TL-SOINN: A Transfer Learning-Enhanced Self-Organizing Incremental Neural Network for Network Intrusion Detection (abstract)
11:40
SeFed-IDS: A Collaborative Intrusion Detection System Utilizing Semi-Supervised Federated Learning and Data Augmentation (abstract)
PRESENTER: Quyen Nguyen Huu
14:50-15:20 Session 7: Poster session II
Hybrid Compression: Integrating Pruning and Quantization for Optimized Neural Networks (abstract)
Enhancing Unsupervised Person Re-identification with Multi-View Image Representation (abstract)
OSA: FPGA-based Octa-core SPHINCS+ Accelerator for IoT Security Applications (abstract)
Decoding Deepfakes: Caption Guided Learning for Robust Deepfake Detection (abstract)
AYO-GAN: A novel GAN-based adversarial attack on YOLO object detection models (abstract)
Distortion-Resilient DIBR for Novel View Synthesis from a Single Image (abstract)
DehazeCLNet: A Contrastive Learning Framework with Advanced Feature Extraction for Image Dehazing (abstract)
A Lightweight End-to-End Multi-task Learning System for Vietnamese Speaker Verification (abstract)
Boosting Image Super-Resolution: Incorporating Locally-enhanced FFN and Data Augmentation in the Swin Transformer architecture (abstract)
Distribution-Guided Object Counting with Optimal Transport and DINO-Based density Refinement (abstract)
FDE-Net: Lightweight Depth Estimation for Monocular Cameras (abstract)
Minimalist Preprocessing Approach for Image Synthesis Detection (abstract)
A Novel Reversible Data Hiding for JPEG Images Based on Zero AC Coefficients Shifting (abstract)
Motion Analysis in Static Images (abstract)
AI-Generated Image Recognition via Fusion of CNNs and Vision Transformers (abstract)
Diffusion-Based Purification for Adversarial Defense in Medical Image Classification (abstract)
15:20-16:40 Session 8A: AI Applications
Location: Danang 1
15:20
A combination of YOLO and OSNet Re-ID neuronal networks for tracking abnormalities in Upper Gastrointestinal Endoscopy Videos (abstract)
15:40
Integrating Graph and Transformer-Based Models for Enhanced Chemical-Drug Relation Extraction in Document-Level Contexts (abstract)
16:00
MedGraph-RPE: Graph-Based Medical Segmentation Enhanced by Novel Relative Positioning Encoding (abstract)
16:20
Predicting Bee Swarming: Leveraging Machine Learning and Audio Feature Extraction (abstract)
15:20-16:40 Session 8B: AI Foundations and Big Data
Location: Danang 2
15:20
BSRBF-KAN: A combination of B-splines and Radial Basis Functions in Kolmogorov-Arnold Networks (abstract)
15:40
Diverse Adversarial Samples for Text-to-Image Generation via Quality-Diversity Optimization (abstract)
16:00
Contour-enhanced Segmentation: A Novel Approach for Ambiguous Boundary in Polyp Segmentation (abstract)
16:20
Adversarial Robustness of Medical Image Classifiers via Denoised Smoothing (abstract)
15:20-16:40 Session 8C: Multimedia Processing
Location: Danang 3
15:20
Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders (abstract)
15:40
Domain Generalization in Vietnamese Dependency Parsing: A Novel Benchmark and Domain Gap Analysis (abstract)
16:00
TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems (abstract)
16:20
Unifying Convolution and Self-Attention for Liver Lesion Diagnosis on Multi-phase Magnetic Resonance Imaging (abstract)
15:20-16:40 Session 8D: Recent Advances in Cyber Security
Location: Son Tra
15:20
Log-based Representation Transferable Learning for Cross-System Anomaly Detection (abstract)
15:40
A Deep Learning Approach to Early Identification of Remote Access Trojans (abstract)
16:00
Privacy Challenges in Genomic Data: A Scoping Review of Risks, Mitigation Strategies, and Research Gaps (abstract)
16:20
An Efficient Explainable Unsupervised Machine Learning Approach for Network Intrusion Detection in IoMT (abstract)
16:40-17:40 Session 9A: AI Applications
Location: Danang 1
16:40
Post-Correction of Handwriting Recognition Using Large Language Models (abstract)
17:00
A Proposed Large Language Model-Based Smart Search for Archive System (abstract)
17:20
SCA-DS: Face Anti-Spoofing Leveraging Enhanced Spatial and Channel-wise Attention and Depth Supervision (abstract)
16:40-17:40 Session 9B: AI Foundations, Big Data, and Multimedia Processing
Location: Danang 2
16:40
DOLG-CNet: Deep Orthogonal Fusion of Local and Global Features combined with Contrastive Learning and Deep Supervision for Polyp Segmentation (abstract)
17:00
VisChronos: Revolutionizing Image Captioning Through Real-Life Events (abstract)
17:20
ViEduQA: A New Vietnamese Dataset for Question Answer Generation in Education (abstract)
16:40-17:40 Session 9C: Multimedia Processing
Location: Danang 3
16:40
VOI-VR:Voice-driven Object Interaction in Virtual Reality with Large Language Models (abstract)
17:00
Towards Real-Time Open World Instance Segmentation (abstract)
17:20
MythraGen: Two-Stage Retrieval Augmented Art Generation Framework (abstract)
16:40-17:40 Session 9D: Recent Advances in Cyber Security
Location: Son Tra
16:40
A Study On Explainable Graph Presentation Learning With Semantic Features Embedding For Windows Malware Detection (abstract)
17:00
A Study on Efficient Provenance-Based Intrusion Detection System using Few-shot Graph Representation Learning (abstract)
17:20
An Approach of Fine-Tuning Language Models and Handling Long Sequences for Efficiently API Call Analysis in Uncovering Windows Malware (abstract)
PRESENTER: Le Tran Gia Bao
Saturday, December 14th, 2024

View this program: with abstractssession overviewtalk overview

09:50-10:20 Session 12: Poster session III
MAVERICS: Multimodal Advanced Visual Event Retrieval with Integrated CPU-Optimized Search (abstract)
Forecasting Traffic Flow under Uncertainty: A Case Study in Da Nang (abstract)
Enhanced Video Retrieval System: Leveraging GPT-4 for Multimodal Query Expansion and Open Image Search (abstract)
ReViMM: Enhanced Video Retrieval with Reweighting Mechanism for Multi-Modal Queries (abstract)
LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System (abstract)
Interactive Video Retrieval System for AI Challenge 2024 Using CLIP, RAM++, and LLM-Enhanced Tag Matching (abstract)
Transforming Video Search: Leveraging Multimodal Techniques and LLMs for Optimal Retrieval (abstract)
PRESENTER: Truong Dinh
Real-Time Multi-User Multimedia Event Retrieval Application System Using WebSocket Protocol (abstract)
Application of the SFE Feature Selection Method for Multi-Omic Biomarker Discovery in Brain Cancer Subtyping (abstract)
Enhancing Video Retrieval via Synergized Image Embeddings and RAG (abstract)
A Comprehensive Video Event Retrieval System for Vietnamese News: Integrating CLIP ViT, TASK-former, Transcripts, and OCR (abstract)
LameFrames: Optimizing Video Event Retrieval Through Strategic Integration and Individual Strategy Enhancement (abstract)
MMMSVR: An Advanced Video Retrieval and Question Answering System (abstract)
CLIP-Enhanced Lifelog Retrieval System: Robust Multi-Modal Media Search with Real-Time Performance (abstract)
Enhanced Video Event Retrieval through Adaptive Multi-Model Fusion with Large Language Models (abstract)
"MAVEN: Video Retrieval System using A Multi-Agent Visual Exploration Network" (abstract)
Can Image Generative Models be Considered Experts? (abstract)
10:20-12:00 Session 13A: Generative AI
Chairs:
Location: Danang 1
10:20
Improving Vietnamese Legal Document Retrieval using Synthetic Data (abstract)
10:40
A Diffusion Model for Personalized Text-to-Image Generation (abstract)
11:00
Enhancing Neural Machine Translation with Direct Preference Optimization Using Human Feedback (abstract)
11:20
A Stable Diffusion Pipeline for Diverse Procedural Painting via Text Prompts (abstract)
11:40
Enhancing Image Authenticity in the Age of Generative AI: an Autoencoder-Driven Fourier Transform based Approach (abstract)
10:20-12:00 Session 13B: Lifelog and Multimedia Event Retrieval
Location: Danang 2
10:20
Event Retrieval from Large Video Collection in Ho Chi Minh City AI Challenge 2024 (abstract)
10:40
Fustar: Divide and Conquer Query in Video Retrieval System (abstract)
11:00
NewsInsight2.0: An Enhanced Version Integrating Large Language Model-based Query Optimization with Advanced Temporal Mechanisms (abstract)
11:20
AViSearch: A Multimodal Video Event Retrieval System via Query Enhancement and Optimized Keyframes (abstract)
11:40
An Optimized And Interactive Video Event Retrieval System With An Improved Temporal Algorithm (abstract)
10:20-12:00 Session 13D: Applied Operations Research and Optimization
Location: Son Tra
10:20
Constraint Programming-Based Cutting Plane Algorithm for a Combination of Orienteering and Maximum Capture Problem (abstract)
PRESENTER: Hoang Giang Pham
10:40
Cost Optimization in Competitive Facility Location under General Demand Model (abstract)
PRESENTER: Ba Luat Le
11:00
Influence Maximization with Fairness Allocation Constraint (abstract)
11:20
A Reputation Scoring Framework for Lending Protocols using the PageRank Algorithm (abstract)
11:40
A method combining the reference information of the adaptive adjustment method and the decision maker of multi-objective evolutionary algorithms (abstract)
14:10-14:40 Session 15: Poster session IV
Flow Velocity Analysis of Rivers Using Farneback Optical Flow and STIV Techniques with Drone Data (abstract)
Faster, Larger, Stronger: Optimally Solving Employee Scheduling Problems with Graph Neural Networks (abstract)
Advancing Geopolitical Map Analysis: An Intelligent System for Territorial Integrity Verification (abstract)
Improving Human Action Recognition Using Quaternion Discrete Fourier Transform in Transfer Learning (abstract)
Cardio Care: A Vision Transformer Cardiac Classification based on Electrocardiogram Images and Signals (abstract)
A Tool for Preventing Consanguineous Marriages Using Vietnam's National Residents Database (abstract)
Optimizing Smart Grids with Reinforcement Learning for Enhanced Energy Efficiency (abstract)
Benchmarking Real-Time Object Detection: Evaluating YOLO and RT-DETR on Speed, Accuracy, and Efficiency (abstract)
PRESENTER: Cao Vu Bui
Progressive Retention Sampling for Sequence Generation-based Scene Text Spotting (abstract)
Development of an Edge-Computing-Based Intelligent Service Framework for Smart Camera Applications (abstract)
MedCapNet: A Novel Approach to Medical Image Captioning (abstract)
Contrastive Perturbation Enhancement for LLM-Based Machine Translation (abstract)
Traffic Anomaly Detection under Extreme Weather from Aerial Images (abstract)
URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots - A Case Study at HCMUT (abstract)
EPC-YOLOv7: The Proposed One-stage Detector for Aerial Scenario Detection (abstract)
A Low-Cost EEG-Based System for Measuring and Forecasting Levels of Alertness with Long Short-Term Memory (abstract)
Real-Time Multi-Face Emotion Recognition for Enhancing Student Engagement in Classroom Environments Using Low-Power IoT Devices (abstract)
MEPC: Multi-level Product Category Recognition Image Dataset (abstract)
14:40-16:00 Session 16A: Secured and Intelligent Multimedia Systems
Location: Danang 1
14:40
EPEdit: Redefining Image Editing with Generative AI and User-Centric Design (abstract)
15:00
A Simple Approach towards Frame Filtering for Efficient Gaussian Splatting (abstract)
15:20
CESE: A Clip-based Event Search Engine for AI Challenge HCMC 2024 (abstract)
15:40
GeoSI: An Interesting Interactive System for Retrieving and Mapping News from Multiple Online Sources (abstract)
14:40-16:00 Session 16B: Lifelog and Multimedia Event Retrieval
Location: Danang 2
14:40
Addressing Ambiguous Queries in Video Retrieval with Advanced Temporal Search (abstract)
15:00
SnapSeek: A Multimodal Video Retrieval System with Context Awareness for AI Challenge 2024 (abstract)
15:20
ArtemisSearch: A Multimodal Search Engine for Efficient Video Log-Life Event Retrieval Using Time-Segmented Queries and Vision Transformer-based Feature Extraction (abstract)
15:40
KPI: Knowledge-based Processing for Interactive Video Retrieval (abstract)
14:40-16:00 Session 16C: Human Computer Interaction
Chair:
Location: Danang 3
14:40
MRClassroom: A Mixed-Reality Interface for Improving Remote Students' Presence in Hybrid Classrooms (abstract)
15:00
Multi-Agent Chatbot for Efficient Interaction with Blockchain APIs (abstract)
15:20
Evaluation of AI-Based Assistant Representations on User Interaction in Virtual Explorations (abstract)
15:40
A Novel Simulation-Driven Data Enrichment Approach to Improve Machine Learning Algorithm Performance (abstract)
14:40-16:00 Session 16D: Applied Operations Research and Optimization
Location: Son Tra
14:40
Exemplar-Embed Complex Matrix Factorization with Elastic Net Penalty: An Advanced Approach for Data Representation (abstract)
PRESENTER: Manh Quan Bui
15:00
Modeling Information Diffusion in Bibliographic Networks using Pretopology (abstract)
15:20
Optimizing Credit Scoring Models for Decentralized Financial Applications (abstract)
15:40
A Historical GPS Trajectory-Based Framework for Predicting Bus Travel Time (abstract)
16:00-17:20 Session 17A: Secured and Intelligent Multimedia Systems
Location: Danang 1
16:00
Media Certificate Authority: A System to Ensure Media Content Originality for Daily Lifelog Media Collection (abstract)
16:20
Mouse Paw Inflammation Evaluation with Segment Anything and Lightness Classification (abstract)
16:40
Knowledge Distillation for Lumbar Spine X-ray Classification (abstract)
17:00
Exploring Prompt Injection: Methodologies and Risks with an Interactive Chatbot Demonstration (abstract)
17:20
Motorcycle Helmet Detection Benchmarking (abstract)
16:00-17:20 Session 17B: Lifelog and Multimedia Event Retrieval
Location: Danang 2
16:00
RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval (abstract)
16:20
A Hybrid Video Retrieval System Using CLIP and BEiT-3 for Enhanced Object and Contextual Understanding (abstract)
16:40
VizQuest: Enhanced Video Event Retrieval Using Fusion and Temporal Modeling (abstract)
17:00
Unveiling Peripheral Information: A Context-Aware Video Retrieval Approach (abstract)
16:00-17:20 Session 17C: Human Computer Interaction
Location: Danang 3
16:00
Now I Know What I am Eating: Real-time Tracking and Nutritional Insights Using VietFood67 to Enhance User Experience (abstract)
16:20
Towards Enabling Tangible Interaction with Physical Objects in Virtual Reality Desktop Workplaces (abstract)
16:40
Meal Plan App: Personalized meal plans based on personal unique needs. (abstract)
17:00
Budget-Aware Keyboardless Interaction (abstract)