Publications

2025

  1. C&G
    SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)
    Trong-Thuan Nguyen, Viet-Tham Huynh, Quang-Thuc Nguyen, Hoang-Phuc Nguyen, Long Le Bao, and 28 more authors
    Computers & Graphics (Special Section on 3DOR 2025), 2025
    (Q2, IF = 2.8 in 2024)
  2. CBM
    DYNAFormer: Enhancing transformer segmentation with dynamic anchor mask for medical imaging
    Tan-Cong Nguyen, Kim Anh Phung, Thao Thi Phuong Dao, Trong-Hieu Nguyen-Mau, Thuc Nguyen-Quang, and 5 more authors
    Computers in Biology and Medicine, 2025
    (Q1, IF = 6.3 in 2024)
  3. IEEE Access
    LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification
    Shuhan Cui, Huy H. Nguyen, Trung-Nghia Le, Chun-Shien Lu, and Isao Echizen
    IEEE Access, 2025
    (Q1, IF = 3.9 in 2022)
  4. NCA
    GUNNEL: Guided Mixup Augmentation and Multi-Model Fusion for Aquatic Animal Segmentation
    Minh-Quan Le*Trung-Nghia Le*, Tam V. Nguyen, Isao Echizen, and Minh-Triet Tran
    Neural Computing & Applications, 2025
    (Q1, IF = 4.5 in 2023)
  5. SoICT
    Hierarchical Multi-Modal Retrieval for News Image Captioning
    Minh-Loi Nguyen*, Xuan-Vu Le*, Long-Bao Nguyen, Hoang-Bach Ngo, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2025
    (Oral)
  6. SoICT
    Vortex: Multi-Modal Fusion System for Intelligent Video Retrieval
    Duc-Tho Nguyen, Hieu-Hoc Tran-Minh, Khanh-Hoa Lam, Hoang-Nhut Ly, Huu-Phuc Huynh, and 2 more authors
    In International Symposium on Information and Communication Technology (SoICT), 2025
    (Oral)
  7. SoICT
    Forged Calamity: Benchmark for Cross-Domain Synthetic Disaster Detection in the Age of Diffusion
    Duc-Manh Phan*, Quoc-Duy Tran*, Duy-Khang Do*, Anh-Tuan Vo, Hai-Dang Nguyen, and 7 more authors
    In International Symposium on Information and Communication Technology (SoICT), 2025
    (Oral)
  8. SoICT
    CIAN: Multi-Stage Framework for Event-Enriched Image Captioning via Retrieval-Augmented Generation
    Thi Thu Hien Trinh, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2025
  9. SoICT
    VisionGuard: Synergistic Framework for Helmet Violation Detection
    Thanh-Hai Nguyen*, Thinh-Phuc Nguyen*, Gia-Huy Dinh*, Lam-Huy Nguyen*, Minh-Triet Tran, and 1 more author
    In International Symposium on Information and Communication Technology (SoICT), 2025
  10. SoICT
    Edit3DGS: Unified Framework for Dynamic Head Editing via 2D Instruction-Guided Diffusion and 3D Gaussian Splatting
    Duy-Dat Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2025
  11. SoICT
    Visual Retrieval-Augmented Generation for Silhouette-Guided Animal Art
    Quoc-Duy Tran, Anh-Tuan Vo, Minh-Triet Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2025
  12. SoICT
    Exploring Multi-Modal Large Language Models and Two-Stage Fine-Tuning for Fashion Image Retrieval
    Nguyen Hoang Cao*, Hoang Bui Le*, Nam Vo Hoang*, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2025
  13. SoICT
    DTD-Mamba: Dual Teacher Distillation for Mamba in Head and Neck Abscess Segmentation
    Thao Thi Phuong Dao, Tan-Cong Nguyen, Trong-Le Do, Mai-Khiem Tran, Minh-Khoi Pham, and 3 more authors
    In International Symposium on Information and Communication Technology (SoICT), 2025
    (Oral)
  14. SoICT
    MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration
    Thao Thi Phuong Dao, Tan-Cong Nguyen, Nguyen Chi Thanh, Truong Hoang Viet, Trong-Le Do, and 5 more authors
    In International Symposium on Information and Communication Technology (SoICT), 2025
    (Oral)
  15. SoICT
    AEye: Avian Monitoring from Streaming Videos
    Kasturi Jamale*, Kunal Agrawal*, Ba-Thinh Tran-Le, Jayanth Merakanapalli, Soham Chousalkar, and 3 more authors
    In International Symposium on Information and Communication Technology (SoICT), 2025
    (Oral)
  16. SoICT
    Research Paper Quality Recognition Through Textual Feature Analysis
    Saikiran Korla*, Sadwik Gummadavelli*Trung-Nghia Le, Minh-Triet Tran, and Tam V. Nguyen
    In International Symposium on Information and Communication Technology (SoICT), 2025
  17. OzCHI
    MultiPointing: Supporting Multiple Users’ Pointing in Hybrid Meetings
    Dinh-Thuan Duong-Le, Duy-Nam Ly, Trung-Nghia Le, Vinh-Tiep Nguyen, and Khanh-Duy Le
    In Australian Conference on Human-Computer Interaction (OzCHI), 2025
    (B Rank) (Late Breaking Work)
  18. ACM MM
    OpenEvents V1: Large-Scale Benchmark Dataset for Multimodal Event Grounding
    Hieu Nguyen, Phuc-Tan Nguyen, Thien-Phuc Tran, Minh-Quang Nguyen, Tam V. Nguyen, and 2 more authors
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Dataset)
  19. ACM MM
    Event-Enriched Image Analysis Grand Challenge at ACM Multimedia 2025
    Thien-Phuc Tran*, Minh-Quang Nguyen*, Minh-Triet Tran, Tam V. Nguyen, Trong-Le Do, and 5 more authors
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Challenge)
  20. ACM MM
    Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification
    Y Hop Nguyen, Doan Anh Phan Huu, Trung Thai Tran, Nhat Nam Mai, Van Toi Giap, and 2 more authors
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Challenge)
  21. ACM MM
    ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization
    Thinh-Phuc Nguyen, Thanh-Hai Nguyen, Gia-Huy Dinh, Lam-Huy Nguyen, Minh-Triet Tran, and 1 more author
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Challenge)
  22. ACM MM
    EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions
    Dinh-Khoi Vo, Van-Loc Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Challenge)
  23. ACM MM
    Streamlining Virtual KOL Generation Through Modular Generative AI Architecture
    Tan-Hiep To, Duy-Khang Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Demo)
  24. ACM MM
    Advancing Fashion Design Through Intelligent Sketchpad Studio
    Nhu-Binh Nguyen-Truc*, Nhu-Vinh Hoang*, Tam V. Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    In ACM International Conference on Multimedia (ACM MM), 2025
    (A* Rank) (Demo)
  25. MICCAI
    Learning Disentangled Stain and Structural Representations for Semi-Supervised Histopathology Segmentation
    Ha-Hieu Pham, Nguyen Lan Vi Vu, Thanh-Huy Nguyen, Ulas Bagci, Min Xu, and 2 more authors
    In MICCAI Workshop on Computational Pathology with Multimodal Data (COMPAYL), 2025
  26. MAPR
    SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification
    Dinh-Khoi Vo*, Van-Loc Nguyen*, Minh-Triet Tran, and Trung-Nghia Le
    In International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2025
  27. CBMI
    GenFlow: Interactive Modular System for Image Generation
    Duc-Hung Nguyen*, Huu-Phuc Huynh*, Minh-Triet Tran, and Trung-Nghia Le
    In International Conference on Content-Based Multimedia Indexing (CBMI), 2025
  28. ICCCI
    Automated Image Recognition Framework
    Quang-Binh Nguyen*, Trong-Vu Hoang*, Do Tran Ngoc, Tam V. Nguyen, Minh-Triet Tran, and 1 more author
    In International Conference on Computational Collective Intelligence (ICCCI), 2025
    (B Rank)
  29. ICCCI
    Chat2Edit: A Prompt-based Image Editor with Live Feedback and Parameter Recommendation
    Tin-Nghia Le, Phuong-Dao Duong Dinh, Quang Huy Che, Duc-Vu Nguyen, Vinh-Tiep Nguyen, and 3 more authors
    In International Conference on Computational Collective Intelligence (ICCCI), 2025
    (B Rank)
  30. ICCCI
    FaR: Enhancing Multi-Concept Text-to-Image Diffusion via Concept Fusion and Localized Refinement
    Gia-Nghia Tran, Quang-Huy Che, Trong-Tai Dam Vu, Bich-Nga Pham, Vinh-Tiep Nguyen, and 2 more authors
    In International Conference on Computational Collective Intelligence (ICCCI), 2025
    (B Rank)
  31. WACV
    CamoFA: A Learnable Fourier-based Augmentation for Camouflage Segmentation
    Minh-Quan Le, Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, and Thanh-Toan Do
    In Winter Conference on Applications of Computer Vision (WACV), 2025
    (A Rank)
  32. FAIR
    Comprehensive Analysis of AI-Synthetic Image Detection Architectures
    Thien-Hoa Hoang-Don, Tien-Dat Nguyen, Nam-Anh Nguyen, and Trung-Nghia Le
    In National Conference on Fundamental and Applied IT Research (FAIR), 2025
  33. GenKOL: Modular Generative AI Framework For Scalable Virtual KOL Generation
    Tan-Hiep To, Duy-Khang Nguyen, Tam V. Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    arXiv preprint arXiv:2509.14927, 2025
  34. KiseKloset: Comprehensive System For Outfit Retrieval, Recommendation, And Try-On
    Thanh-Tung Phan-Nguyen, Khoi-Nguyen Nguyen-Ngoc, Tam V. Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    arXiv preprint arXiv:2506.23471, 2025
  35. Interactive Interface For Semantic Segmentation Dataset Synthesis
    Ngoc-Do Tran, Minh-Tuan Huynh, Tam V. Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    arXiv preprint arXiv:2506.23470, 2025
  36. PrefPaint: Enhancing Image Inpainting through Expert Human Feedback
    Duy-Bao Bui, Hoang-Khang Nguyen, and Trung-Nghia Le
    arXiv preprint arXiv:2506.21834, 2025
  37. TaleForge: Interactive Multimodal System for Personalized Story Creation
    Minh-Loi Nguyen, Quang-Khai Le, Tam V. Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    arXiv preprint arXiv:2506.21832, 2025
  38. VisionGuard: Synergistic Framework for Helmet Violation Detection
    Thinh-Phuc Nguyen*, Thanh-Hai Nguyen*, Gia-Huy Dinh*, Lam-Huy Nguyen*, Minh-Triet Tran, and 1 more author
    arXiv preprint arXiv:2506.21005, 2025
  39. Shape2Animal: Creative Animal Generation from Natural Silhouettes
    Quoc-Duy Tran, Anh-Tuan Vo, Dinh-Khoi Vo, Tam V. Nguyen, Minh-Triet Tran, and 1 more author
    arXiv preprint arXiv:2506.20616, 2025
  40. ShowFlow: From Robust Single Concept to Condition-Free Multi-Concept Generation
    Trong-Vu Hoang, Quang-Binh Nguyen, Thanh-Toan Do, Tam V. Nguyen, Minh-Triet Tran, and 1 more author
    arXiv preprint arXiv:2506.18493, 2025
  41. CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing
    Dinh-Khoi Vo, Thanh-Toan Do, Tam V. Nguyen, Minh-Triet Tran, and Trung-Nghia Le
    arXiv preprint arXiv:2506.18438, 2025

2024

  1. ML
    Artificial Intelligence for Laryngoscopy in Vocal Fold Diseases: A Review of Dataset, Technology, and Ethics
    Thao Thi Phuong Dao, Tan-Cong Nguyen, Viet-Tham Huynh, Xuan-Hai Bui, Trung-Nghia Le, and 1 more author
    Machine Learning, 2024
    (Q1, IF = 4.3 in 2023) (ACML 2024, Journal track)
  2. IIM
    Improving Laryngoscopy Image Analysis through Integration of Global Information and Local Features in VoFoCD Dataset
    Thao Thi Phuong Dao, Tuan-Luc Huynh, Minh-Khoi Pham, Trung-Nghia Le, Tan-Cong Nguyen, and 5 more authors
    Imaging Informatics in Medicine, 2024
    (Q1, IF = 4.4 in 2022)
  3. IEEE Access
    eKYC-DF: A Large-Scale Deepfake Dataset for Developing and Evaluating eKYC Systems
    Hichem Felouat, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, and Isao Echizen
    IEEE Access, 2024
    (Q1, IF = 3.9 in 2022)
  4. IEEE Access
    Analysis of Fine-grained Counting Methods for Masked Face Counting: A Comparative Study
    Khanh-Duy Nguyen, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, and Isao Echizen
    IEEE Access, 2024
    (Q1, IF = 3.9 in 2022)
  5. SoICT
    Language-Guided Video Object Segmentation
    Minh Duy Phan, Minh Huan Le, Minh-Triet Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  6. SoICT
    VisChronos: Revolutionizing Image Captioning Through Real-Life Events
    Phuc-Tan Nguyen*, Hieu Nguyen*, Minh-Triet Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  7. SoICT
    EPEdit: Redefining Image Editing with Generative AI and User-Centric Design
    Hoang-Phuc Nguyen*, Dinh-Khoi Vo*, Trong-Le Do, Hai-Dang Nguyen, Tan-Cong Nguyen, and 5 more authors
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  8. SoICT
    MythraGen: Two-Stage Retrieval Augmented Art Generation Framework
    Quang-Khai Le*, Cong-Long Nguyen*, Minh-Triet Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  9. SoICT
    KidRisk: Benchmark Dataset for Children Dangerous Action Recognition
    Minh-Kha Nguyen*, Trung-Hieu Do*, Kim Anh Phung, Thao Thi Phuong Dao, Minh-Triet Tran, and 1 more author
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  10. SoICT
    DanceDuo: Bridging Human Movement and AI Choreography
    Gia-Cat Bui-Le, Tuong-Vy Truong-Thuy, Hai-Dang Nguyen, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  11. SoICT
    Budget-Aware Keyboardless Interaction
    Quang-Thang Nguyen*, Gia-Phuc Song-Dong*, Minh-Triet Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
    (Oral)
  12. SoICT
    Decoding Deepfakes: Caption Guided Learning for Robust Deepfake Detection
    Y-Hop Nguyen, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
  13. SoICT
    Minimalist Preprocessing Approach for Image Synthesis Detection
    Hoai-Danh Vo, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
  14. SoICT
    Hybrid Compression: Integrating Pruning and Quantization for Optimized Neural Networks
    Minh-Loi Nguyen*, Long-Bao Nguyen*, Van-Hieu Huynh*, Minh-Triet Tran, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2024
  15. SoICT
    Motion Analysis in Static Images
    Kunal Agrawal, Vatsa Patel, Reema Tharra, Trung-Nghia Le, Minh-Triet Tran, and 1 more author
    In International Symposium on Information and Communication Technology (SoICT), 2024
  16. SoICT
    AI-Generated Image Recognition via Fusion of CNNs and Vision Transformers
    Xuan-Bach Mai, Hoang-Tung Vu, Hoang-Minh Nguyen-Huu, Quoc-Nghia Nguyen, Minh-Triet Tran, and 1 more author
    In International Symposium on Information and Communication Technology (SoICT), 2024
  17. ACCV
    Rethinking Sampling for Music-Driven Long-Term Dance Generation
    Tuong-Vy Truong-Thuy, Gia-Cat Bui-Le, Hai-Dang Nguyen, and Trung-Nghia Le
    In Asian Conference on Computer Vision (ACCV), 2024
    (B Rank)
  18. ACCV
    CrossPAR: Enhancing Pedestrian Attribute Recognition with Vision-Language Fusion and Human-Centric Pre-training
    Bach-Hoang Ngo, Si-Tri Ngo, Phu-Duc Le, Quang-Minh Phan, Minh-Triet Tran, and 1 more author
    In Asian Conference on Computer Vision (ACCV), 2024
    (B Rank)
  19. ISMAR
    Immersive Spatiotemporal Travel in Virtual Reality
    Thanh Ngoc-Dat Tran, Viet-Tham Huynh, Poojitha Moganti, Trung-Nghia Le, Minh-Triet Tran, and 1 more author
    In International Symposium on Mixed and Augmented Reality (ISMAR), 2024
    (A* Rank, Poster)
  20. ISMAR
    Urban Traffic Planning Simulation with Time and Weather Dynamics
    Tam V. Nguyen, Thanh Ngoc-Dat Tran, Viet-Tham Huynh, Vatsa S Patel, Umang Jai, and 3 more authors
    In International Symposium on Mixed and Augmented Reality (ISMAR), 2024
    (A* Rank, Poster)
  21. CVPRW
    Synthetic Is All You Need For Semantic Segmentation
    Minh-Tuan Huynh*, Ngoc-Do Tran*, Minh-Triet Tran, and Trung-Nghia Le
    In SyntaGen Workshop, CVPR, 2024
    (First Prize)
  22. MAPR
    Rethinking Text-to-Image as Semantic-Aware Data Augmentation for Indoor Scene Recognition
    Trong-Vu Hoang, Quang-Binh Nguyen, Dinh-Khoi Vo, Hoai-Danh Vo, Minh-Triet Tran, and 1 more author
    In International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2024
  23. MAPR
    Evaluation of Image Matching for Art Skills Assessment
    Asaad Alghamdi, Michael Poor, Trung-Nghia Le, and Tam V. Nguyen
    In International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2024
  24. MAPR
    Masked Face Recognition on Limited Training Data
    Phuoc-Sang Pham, Minh-Kha Nguyen, Minh-Hien Le, Minh-Triet Tran, and Trung-Nghia Le
    In International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2024
  25. CHI
    iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer
    Dinh-Khoi Vo*, Duy-Nam Ly*, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, and 1 more author
    In ACM Conference on Human Factors in Computing Systems (CHI), 2024
    (A* Rank, Late Breaking Work)
  26. CHI
    ARtVista: Gateway To Empower Anyone Into Artist
    Trong-Vu Hoang*, Quang-Binh Nguyen*, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, and 2 more authors
    In ACM Conference on Human Factors in Computing Systems (CHI), 2024
    (A* Rank, Late Breaking Work)
  27. ISBI
    PISeg: Polyp Instance Segmentation with Texture Denoising and Adaptive Region
    Tan-Cong Nguyen, Kim Anh Phung, Tien-Phat Nguyen, Thao Dao, Cong Nhan Pham, and 5 more authors
    In IEEE International Symposium on Biomedical Imaging (ISBI), 2024
    (A Rank)
  28. AAAI
    MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
    Minh-Quan Le, Tam V. Nguyen, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, and 1 more author
    In AAAI Conference on Artificial Intelligence, 2024
    (A* Rank, Oral)
  29. MediaEval
    Medico Multimedia Task at MediaEval 2023: Transparent Tracking of Spermatozoa
    Vajira Thambawita, Andrea Storås, Tuan-Luc Huynh, Hai-Dang Nguyen, Minh-Triet Tran, and 5 more authors
    In Multimedia Evaluation Workshop (MediaEval), 2024
  30. MMM
    NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images
    Gia-Bao Le*, Van-Tien Nguyen*Trung-Nghia Le, and Minh-Triet Tran
    In International Conference on Multimedia Modeling (MMM), 2024
    (B Rank, Oral)

2023

  1. C&G
    SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval
    Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, and 29 more authors
    Computers & Graphics (Special Section on 3DOR 2023), 2023
    (Q2, IF = 2.62 in 2022)
  2. C&G
    TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval
    Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, and 28 more authors
    Computers & Graphics (Special Section on 3DOR 2023), 2023
    (Q2, IF = 2.62 in 2022)
  3. IEEE OJSP
    Purifying Adversarial Images using Adversarial Autoencoders with Conditional Normalizing Flows
    Yi Ji, Trung-Nghia Le, Huy H. Nguyen, and Isao Echizen
    IEEE Open Journal of Signal Processing, 2023
    (ICIP, Journal Track) (Q2, IF = 2.89 in 2022)
  4. AIR
    Image Synthesis: A Review of Methods, Datasets, Evaluation Metrics, and Future Outlook
    Samah Saeed Baraheem, Trung-Nghia Le, and Tam V. Nguyen
    Artificial Intelligence Review, 2023
    (Q1, IF = 12.0 in 2022)
  5. SoICT
    Multi-Branch Network for Imagery Emotion Prediction
    Quoc-Bao Ninh, Hai-Chan Nguyen, Triet Huynh, and Trung-Nghia Le
    In International Symposium on Information and Communication Technology (SoICT), 2023
  6. RIVF
    Budget-Aware Road Semantic Segmentation in Unseen Foggy Scenes
    Tan-Hiep To, Thanh-Nghi Do, Duc-Nghia Ngo, Minh-Triet Tran, and Trung-Nghia Le
    In International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2023
  7. RIVF
    Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
    Hieu Nguyen*, Cong-Hoang Ta*, Phuong-Thuy Le-Nguyen*, Minh-Triet Tran, and Trung-Nghia Le
    In International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2023
  8. PSIVT
    Efficient 3D Brain Tumor Segmentation with Axial-Coronal-Sagittal Embedding
    Tuan-Luc Huynh, Thanh-Danh Le, Tam V. Nguyen, Trung-Nghia Le, and Minh-Triet Tran
    In Pacific-Rim Symposium on Image and Video Technology (PSIVT), 2023
    (C Rank - Best Paper Award)
  9. PSIVT
    Cluster-based Video Summarization with Temporal Context Awareness
    Hai-Dang Huynh-Lam*, Ngoc-Phuong Ho-Thi*, Minh-Triet Tran, and Trung-Nghia Le
    In Pacific-Rim Symposium on Image and Video Technology (PSIVT), 2023
    (C Rank)
  10. ISMAR
    DM-VTON: Distilled Mobile Real-time Virtual Try-On
    Khoi-Nguyen Nguyen-Ngoc, Thanh-Tung Phan-Nguyen, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, and 1 more author
    In International Symposium on Mixed and Augmented Reality (ISMAR), 2023
    (A* Rank, Nominated for Best Poster)
  11. ISMAR
    VIDES: Virtual Interior Design via Natural Language and Visual Guidance
    Minh-Hien Le, Chi-Bien Chu, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, and 1 more author
    In International Symposium on Mixed and Augmented Reality (ISMAR), 2023
    (A* Rank, Poster)
  12. WACV
    Analysis of Master Vein Attacks on Finger Vein Recognition Systems
    Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, and Isao Echizen
    In Winter Conference on Applications of Computer Vision (WACV), 2023
    (A Rank)
  13. WACV
    Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently
    Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy H. Nguyen, and Isao Echizen
    In Winter Conference on Applications of Computer Vision (WACV), 2023
    (A Rank)

2022

  1. ITE
    Current Status of Deepfake Generation and Detection (Deepfakeの生成と検出の現状)
    Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, and Isao Echizen
    The Journal of The Institute of Image Information and Television Engineers (ITE), Jul 2022
    (In Japanese, ISSN 1342-6907) — Special Feature: AI and Cyber Security in the Infodemic Era
  2. MVA
    Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation
    Trung-Nghia Le, Tam V. Nguyen, and Minh-Triet Tran
    Machine Vision and Applications (MVA), Jul 2022
    (Q2, IF = 3.3 in 2022)
  3. Springer Book Chapter
    Robust Deepfake On Unrestricted Media: Generation And Detection
    Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, and Isao Echizen
    In Frontiers in Fake Media Generation and Detection, Jul 2022
  4. IEEE T-IP
    Camouflaged Instance Segmentation In-The-Wild: Dataset, Method, and Benchmark Suite
    Trung-Nghia Le, Yubo Cao, Tan-Cong Nguyen, Minh-Quan Le, Khanh-Duy Nguyen, and 3 more authors
    IEEE Transactions on Image Processing (T-IP), Jul 2022
    (Q1, IF = 10.6 in 2022)
  5. MediaEval
    Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa
    Tuan-Luc Huynh, Huu-Hung Nguyen, Xuan-Nhat Hoang, Thao Thi Phuong Dao, Tien-Phat Nguyen, and 4 more authors
    In Multimedia Evaluation Workshop (MediaEval), Jul 2022
  6. RIVF
    Multilingual Communication System with Deaf Individuals Utilizing Natural and Visual Languages
    Tuan-Luc Huynh*, Khoi-Nguyen Nguyen-Ngoc*, Chi-Bien Chu*, Minh-Triet Tran, and Trung-Nghia Le
    In International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), Jul 2022
  7. WIFS
    Rethinking Adversarial Examples for Location Privacy Protection
    Trung-Nghia Le*, Ta Gu*, Huy H. Nguyen, and Isao Echizen
    In IEEE International Workshop on Information Forensics and Security (WIFS), Jul 2022
  8. ISMAR
    Public Speaking Simulator with Speech and Audience Feedback
    Bao Truong, Trung-Nghia Le, Khanh-Duy Le, Minh-Triet Tran, and Tam V. Nguyen
    In IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Jul 2022
    (A* Rank, Poster)
  9. CVPRW
    GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal Segmentation
    Minh-Quan Le*Trung-Nghia Le*, Tam V. Nguyen, Isao Echizen, and Minh-Triet Tran
    In CV4Animal Workshop, CVPR, Jul 2022
    (Invited Poster)

2021

  1. JoI
    Masked Face Analysis via Multi-task Deep Learning
    Vatsa S. Patel, Zhongliang Nie, Trung-Nghia Le, and Tam V. Nguyen
    Journal of Imaging, Jul 2021
    (Q2, IF = 3.2 in 2022)
  2. IEEE Access
    MirrorNet: Bio-Inspired Camouflaged Object Segmentation
    Jinnan Yan, Trung-Nghia Le, Khanh-Duy Nguyen, Minh-Triet Tran, Thanh-Toan Do, and 1 more author
    IEEE Access, Jul 2021
    (Q1, IF = 3.9 in 2022)
  3. FG4COVID19
    Effectiveness of Detection-based and Regression-based Approaches for Estimating Mask-Wearing Ratio
    Khanh-Duy Nguyen, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, and Isao Echizen
    In FG4COVID19, Jul 2021
  4. ICCV
    OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild
    Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, and Isao Echizen
    In International Conference on Computer Vision (ICCV), Jul 2021
    (A* Rank, Acceptance rate 25.9%)
  5. CVPRW
    Fashion-Guided Adversarial Attack on Person Segmentation
    Marc Treu*Trung-Nghia Le*, Huy H. Nguyen*, Junichi Yamagishi, and Isao Echizen
    In CVPR Workshop on Media Forensics, Jul 2021
    (*Equal Contributions)
  6. AAAI
    Interactive Video Object Mask Annotation
    Trung-Nghia Le, Tam V. Nguyen, Quoc-Cuong Tran, Lam Nguyen, Trung-Hieu Hoang, and 2 more authors
    In AAAI Conference on Artificial Intelligence, Jul 2021
    (A* Rank, Demo)
  7. AAAI
    CamouFinder: Finding Camouflaged Instances in Images
    Trung-Nghia Le, Vuong Nguyen, Cong Le, Tan-Cong Nguyen, Minh-Triet Tran, and 1 more author
    In AAAI Conference on Artificial Intelligence, Jul 2021
    (A* Rank, Demo)

2020

  1. MM
    Text-to-Image Synthesis via Aesthetic Layout
    Samah Saeed Baraheem, Trung-Nghia Le, and Tam V. Nguyen
    In International Conference on Multimedia, Jul 2020
    (A* Rank, Demo)
  2. CVPRW
    Multi-Referenced Guided Instance Segmentation Framework for Semi-supervised Video Instance Segmentation
    Minh-Triet Tran, Trung-Hieu Hoang, Tam V. Nguyen, Trung-Nghia Le, E-Ro Nguyen, and 4 more authors
    In CVPR Workshop on DAVIS Challenge on Video Object Segmentation, Jul 2020
    (4th place)
  3. CVPRW
    iTASK: Intelligent Traffic Analysis Software Kit
    Minh-Triet Tran, Tam V. Nguyen, Trung-Hieu Hoang, Trung-Nghia Le, Khac-Tuan Nguyen, and 22 more authors
    In CVPR Workshop on AI City Challenge, Jul 2020
    (10th place on Track 1, 26th on Track 2, 5th on Track 4)
  4. IV
    Attention R-CNN for Accident Detection
    Trung-Nghia Le, Akihiro Sugimoto, Shintaro Ono, and Hiroshi Kawasaki
    In Intelligent Vehicles Symposium (IV), Jul 2020
    (B Rank)
  5. WACV
    Toward Interactive Self-Annotation For Video Object Bounding Box: Recurrent Self-Learning And Hierarchical Annotation Based Framework
    Trung-Nghia Le, Akihiro Sugimoto, Shintaro Ono, and Hiroshi Kawasaki
    In Winter Conference on Applications of Computer Vision (WACV), Jul 2020
    (A Rank)
  6. ITS Japan
    Learning-Based Semi-Automatic Annotation and Accident Detection from Driving Video (in Japanese)
    Trung-Nghia Le, Shintaro Ono, Akihiro Sugimoto, and Hiroshi Kawasaki
    In 18th ITS Symposium, Japan, Jul 2020

2019

  1. CVIU
    Anabranch Network for Camouflaged Object Segmentation
    Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, and Akihiro Sugimoto
    Computer Vision and Image Understanding (CVIU), Jul 2019
    (Q1, IF = 4.5 in 2024)
  2. CVPRW
    Guided Instance Segmentation Framework for Semi-Supervised Video Instance Segmentation
    Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, Vinh Ton-That, Trung-Hieu Hoang, and 6 more authors
    In CVPR Workshop on DAVIS Challenge on Video Object Segmentation, Jul 2019
    (3rd place)
  3. CVPRW
    Vehicle Re-identification with Learned Representation and Spatial Verification and Abnormality Detection with Multi-Adaptive Vehicle Detectors for Traffic Video Analysis
    Khac-Tuan Nguyen, Trung-Hieu Hoang, Minh-Triet Tran, Trung-Nghia Le, Ngoc-Minh Bui, and 8 more authors
    In CVPR Workshop on AI City Challenge, Jul 2019
    (8th place on Track 3 and 25th place on Track 2)
  4. WACV
    Semantic Instance Meets Salient Object: Study on Video Semantic Salient Instance Segmentation
    Trung-Nghia Le, and Akihiro Sugimoto
    In Winter Conference on Applications of Computer Vision (WACV), Jul 2019
    (A Rank)

2018

  1. IEEE T-IP
    Video Salient Object Detection Using Spatiotemporal Deep Features
    Trung-Nghia Le, and Akihiro Sugimoto
    IEEE Transactions on Image Processing (T-IP), Jul 2018
    (Q1, IF = 10.6 in 2022)
  2. CVPRW
    Context-based Instance Segmentation in Video Sequence
    Minh-Triet Tran, Vinh Ton-That, Trung-Nghia Le, Khac-Tuan Nguyen, Tu V. Ninh, and 4 more authors
    In CVPR Workshop on DAVIS Challenge on Video Object Segmentation, Jul 2018
    (6th place)
  3. WACV
    Balancing Content and Style with Two-Stream FCNs for Style Transfer
    Duc Minh Vo, Trung-Nghia Le, and Akihiro Sugimoto
    In Winter Conference on Applications of Computer Vision (WACV), Jul 2018
    (A Rank)
  4. HCMUS
    Instance Segmentation in Video with Human-Pose Guidance and Data Augmentation (in Vietnamese)
    Minh-Triet Tran, Tu V. Ninh, Tu-Khiem Le, Vinh Ton-That, Khac-Tuan Nguyen, and 2 more authors
    In Scientific Conference of University of Science, VNU-HCM, Vietnam, Jul 2018

2017

  1. BMVC
    Deeply Supervised 3D Recurrent FCN for Salient Object Detection in Videos
    Trung-Nghia Le, and Akihiro Sugimoto
    In British Machine Vision Conference (BMVC), Jul 2017
    (A Rank)
  2. CVPRW
    Instance Re-Identification Flow for Video Object Segmentation
    Trung-Nghia Le, Khac-Tuan Nguyen, Manh-Hung Nguyen-Phan, That-Vinh Ton, Toan-Anh Nguyen, and 7 more authors
    In CVPR Workshop on DAVIS Challenge on Video Object Segmentation, Jul 2017
    (3rd place)
  3. ICMEW
    Spatiotemporal Utilization of Deep Features for Video Saliency Detection
    Trung-Nghia Le, and Akihiro Sugimoto
    In ICME Workshop on Deep Learning for Intelligent Multimedia Analytics (DeLIMMA), Jul 2017
    (Oral presentation)
  4. Region-Based Multiscale Spatiotemporal Saliency for Video
    Trung-Nghia Le, and Akihiro Sugimoto
    arXiv preprint arXiv:1708.01589, Jul 2017

2015

  1. PSIVT
    Contrast Based Hierarchical Spatial-Temporal Saliency for Video
    Trung-Nghia Le, and Akihiro Sugimoto
    In Pacific-Rim Symposium on Image and Video Technology (PSIVT), Jul 2015
    (B Rank) (Oral presentation)

2014

  1. ICARCV
    Essential Keypoints to Enhance Visual Object Recognition with Saliency-based Metrics
    Trung-Nghia Le, Yen-Thanh Le, Minh-Triet Tran, and Anh-Duc Duong
    In International Conference on Control, Automation, Robotics and Vision (ICARCV), Jul 2014
    (A Rank) (Oral presentation)
  2. HCII
    Applying Saliency-based Region of Interest Detection in Developing a Collaborative Active Learning System with Augmented Reality
    Trung-Nghia Le, Yen-Thanh Le, and Minh-Triet Tran
    In International Conference on Human-Computer Interaction (HCII), Jul 2014

2012

  1. IHMSC
    Applying Fast Planar Object Detection in Multimedia Augmentation for Products with Mobile Devices
    Quoc-Minh Bui, Trung-Nghia Le, Vinh-Tiep Nguyen, Minh-Triet Tran, and Anh-Duc Duong
    In International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), Jul 2012
  2. SoICT
    Augmented Media for Traditional Magazines
    Vinh-Tiep Nguyen, Trung-Nghia Le, Quoc-Minh Bui, Minh-Triet Tran, and Anh-Duc Duong
    In International Symposium on Information and Communication Technology (SoICT), Jul 2012
  3. PACIS
    Smart Shopping Assistant: A Multimedia and Social Media Augmented System With Mobile Devices to Enhance Customers’ Experience and Interaction
    Vinh-Tiep Nguyen, Trung-Nghia Le, Quoc-Minh Bui, Minh-Triet Tran, and Anh-Duc Duong
    In Pacific Asia Conference on Information Systems (PACIS), Jul 2012
    (A Rank) (Oral presentation)
  4. RIVF
    Applying Virtual Reality for In-Door Jogging
    Trung-Nghia Le, Quoc-Minh Bui, Vinh-Tiep Nguyen, Minh-Triet Tran, and Anh-Duc Duong
    In International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), Jul 2012