Selected Publications

Nikou, C., & Giannakopoulos, T. Contrastive and Transfer Learning for Effective Audio Fingerprinting through a Real-World Evaluation Protocol IJMSTA, 7(1), 68-82 (2025)
Sgouropoulos, C., Nikou, C., Vlachos, S., Theiou, V., Foukanelis, C., & Giannakopoulos, T. Prototypical Contrastive Learning For Improved Few-Shot Audio IEEE MLSP (2025)
Nikou, C., Theiou, V., Vlachos, S., Sgouropoulos, C., Sgouropoulos, D., & Giannakopoulos, T. On the Robustness of State-of-the-Art Transformers for Sound Event Classification Against Black Box Adversarial Attacks 2025 EUSIPCO. IEEE (2025)
Koromilas, P., Bouritsas, G., Giannakopoulos, T., Nicolaou, M., & Panagakis, Y. Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses Forty-first International Conference on Machine Learning (2024)
Mitsou, A., Petrogianni, A., Vakalaki, E. A., Nikou, C., Psallidas, T., & Giannakopoulos, T. A multimodal dataset for electric guitar playing technique recognition Data in Brief, Vol 52 (2024)
Kaliosis, P., Eleftheriou, S., Nikou, C., & Giannakopoulos, T. A self-supervised learning approach for detecting non-psychotic relapses using wearable-based digital phenotyping 2024 IEEE ICASSPW. IEEE (2024)
Petrogianni, A., Kapelonis, L., Antoniou, N., Eleftheriou, S., Mitseas, P., Sgouropoulos, D., Katsamanis, N., Giannakopoulos, T., & Narayanan, S. RobuSER: A robustness Benchmark for Speech Emotion Recognition 2024 12th International Conference on Affective Computing and Intelligent Interaction (ACII) (pp. 1-7). IEEE (2024)
Eleftheriadis, K., Gini, M., Manousakas, M., Diapouli, E., Vratolis, S., Papagiannis, S., Zografou, O., Giannakopoulos, T., Konstantopoulos, S., Mocnik, G., & Drinovec, L. MItigating Transport-Related Air Pollution in Europe: The MI-TRAP project The European Aerosol Conference (2024)
Sgouropoulos, D., Mitseas, P., Eleftheriou, S., Giannakopoulos, T., Petrogianni, A., Kapelonis, L., Antoniou, N., Katsamanis, A., & Narayanan, S. Emotion-aware speech popularity prediction: a use-case on TED talks 2024 12th International Conference on Affective Computing and Intelligent Interaction (ACII). IEEE (2024)
Gkritzali, E., Kaliosis, P., Galanaki, S., Palogiannidi, E., & Giannakopoulos, T. Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation 13th Helenic Conference on AI, SETN 2024, Piraeus, Greece, September 11-13, 2024. Proceedings 4 (2024)
Melistas, T., Kapelonis, L., Antoniou, N., Mitseas, P., Sgouropoulos, D., Giannakopoulos, T., Katsamanis, A., Narayanan, S., & Demokritos, N.C.S.R. Cross-lingual features for alzheimer's dementia detection from speech Proc. INTERSPEECH (pp. 3008-3012) (2023)
Christopoulos, D., Chatzi, E., Sofianopoulos, G., Patiniotaki, E., Eleftheriou, S., Koromilas, P., Kaliosis, P., Gkouti, N., Giannakopoulos, T., Petridis, K., & Sismanidou, E. Smart Subs Subtitling App for Watching Live Virtual Dome Performances 2023 SMAP. IEEE (2023)
Bochalis, C., Vargas, C. D., Jarvis, E. D., & Giannakopoulos, T. Unsupervised Temporal Analysis of Mouse Vocalizations 2023 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) (pp. 1-8). IEEE (2023)
Petrogianni, A., Vassilakis, D., Klampanos, I. A., Giannakopoulos, T., & Andreopoulou, A. Jazz Mapping: An Advanced Framework for Solo Analysis and Discovery in Jazz Music Audio Engineering Society Convention 155 (2023)
Koromilas, P., Nicolaou, M. A., Giannakopoulos, T., & Panagakis, Y. MMATR: A Lightweight Approach for Multimodal Sentiment Analysis Based on Tensor Methods ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE (2023)
Antoniou, N., Katsamanis, A., Giannakopoulos, T., & Narayanan, S. Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE (2023)
Alygizakis, N., Giannakopoulos, T., Τhomaidis, N. S., & Slobodnik, J. Detecting the sources of chemicals in the Black Sea using non-target screening and deep learning convolutional neural networks Science of The Total Environment, 157554 (2022), html
Petrogianni, A., Koromilas, P., & Giannakopoulos, T. Film Shot Type Classification Based on Camera Movement Styles Iberian Conference on Pattern Recognition and Image Analysis (pp. 602-615), 2022 html
M. Moutti, S. Eleftheriou, P. Koromilas, T. Giannakopoulos A Dataset for Speech Emotion Recognition in Greek Theatrical Plays 13th Conference on Language Resources and Evaluation (LREC 2022), pages 1040–1046 pdf
Tsitos, A. C., Dagioglou, M., & Giannakopoulos, T. Real-time feasibility of a human intention method evaluated through a competitive human-robot reaching game 2022 ACM/IEEE International Conference on Human-Robot Interaction (pp. 1080-1084) html
Dagioglou, M., Soulounias, N., & Giannakopoulos, T. Object Size Prediction from Hand Movement Using a Single RGB Sensor International Conference on Human-Computer Interaction (pp. 369-386), 2022 html
Touros, G., & Giannakopoulos, T. Video soundtrack evaluation with machine learning: Data availability, feature extraction, and classification Advances in Speech and Music Technology (2022)
Moutti, M., Eleftheriou, S., Koromilas, P., & Giannakopoulos, T. Cross linguistic speech emotion recognition using CNNs: a use-case in Greek Theatrical Data Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments (pp. 662-667) (2022)
Zouros, M., & Giannakopoulos, T. Photography Style Analysis using Convolutional Neural Networks 2022 SITIS. IEEE (2022)
Stoumpou, V., Vargas, C. D., Schade, P. F., Boyd, J. L., Giannakopoulos, T., & Jarvis, E. D. Analysis of Mouse Vocal Communication (AMVOC): a deep, unsupervised method for rapid detection, analysis and classification of ultrasonic vocalisations Bioacoustics, 1-31 (2022)
Geroulanos, A., & Giannakopoulos, T. Emotion Recognition in Music Using Deep Neural Networks Advances in Speech and Music Technology: Computational Aspects and Applications. Cham: Springer International Publishing, 193-213 (2022)
Papaioannou, C., Valiantzas, I., Giannakopoulos, T., Kaliakatsos Papakostas, M., & Potamianos, A. A Dataset for Greek Traditional and Folk Music: Lyra 23rd International Society for Music Information Retrieval Conference (ISMIR 2022) (2022)
Chatziagapi, A., Sgouropoulos, D., Karouzos, C., Melistas, T., Giannakopoulos, T., Katsamanis, A., & Narayanan, S. Audio and ASR-based Filled Pause Detection 2022 10th International Conference on Affective Computing and Intelligent Interaction (ACII) (pp. 1-7). IEEE (2022)
Koromilas, P., & Giannakopoulos, T. Deep multimodal emotion recognition on human speech: A review Applied Sciences, 11(17), 7962 (2021). pdf
Paraskevoudis, K., & Giannakopoulos, T. Instrument Playing Technique Recognition: A Greek Music Use Case In Worldwide Music Conference (pp. 124-136). Springer (2021) html
Psallidas, T., Koromilas, P., Giannakopoulos, T., & Spyrou, E. Multimodal Summarization of User-Generated Videos Applied Sciences, 11(11), 5260. (2021) pdf
Eleftheriou, S., Koromilas, P., & Giannakopoulos, T. Automatic Assessment of Speaking Skills Using Aural and Textual Information Proceedings of The Fourth International Conference on Natural Language and Speech Processing (ICNLSP 2021) (pp. 166-1) (2021)
Mitsou, A., Spyrou, E., & Giannakopoulos, T. Multimodal Workplace Monitoring for Human Activity Recognition 25th Pan-Hellenic Conference on Informatics (pp. 206-211) (2021)
Melistas, T., & Giannakopoulos, T. Lyrics and Vocal Melody Generation conditioned on Accompaniment Proceedings of the 2nd Workshop on NLP for Music and Spoken Audio (NLP4MusA) (2021)
Psallidas, T., Mitsou, A., Pikramenos, G., Spyrou, E., & Giannakopoulos, T. ARCHEO: A Dataset for Sound Event Detection in Areas of Touristic Interest 2020 15th International Workshop on Semantic and Social Media Adaptation and Personalization (SMA (pp. 1-6). IEEE. html
Giannakopoulos, T., Orfanidi, M., & Perantonis, S. Athens Urban Soundscape (ATHUS): A Dataset for Urban Soundscape Quality Recognition. In International Conference on Multimedia Modeling (pp. 338-348). Springer, Cham., 2019 pdf
Pikramenos, G., Smyrnis, G., Vernikos, I., Konidaris, T., Spyrou, E., & Perantonis, S Sentiment Analysis from Sound Spectrograms via Soft BoVW and Temporal Structure Modelling. In ICPRAM (pp. 361-369) 2019 pdf
Giannakopoulos, T., Dimopoulos, S., Pantazopoulos, G., Chatziagapi, A., Sgouropoulos, D., Katsamanis, A., Potamianos, A., & Narayanan, S. Using Oliver API for emotion-aware movie content characterization 2019 CBMI. IEEE (2019)
Giannakopoulos, T., & Perantonis, S. Recognizing the quality of urban sound recordings using hand-crafted and deep audio features Proceedings of the 12th ACM International Conference on PErvasive Technologies Related to Assistive Environments (2019)
Paraskevopoulos, G., Spyrou, E., Sgouropoulos, D., Giannakopoulos, T., & Mylonas, P. Real-Time Arm Gesture Recognition Using 3D Skeleton Joint Data Algorithms, 12(5), 108 (2019)
Giannakopoulos, T., Konstantopoulos, S., Siantikos, G., & Karkaletsis, V. A System of Recognition Services for Clinical Assessment RADIO–Robots in Assisted Living (pp. 7-18). Springer (2019)
Chatziagapi, A., Paraskevopoulos, G., Sgouropoulos, D., Pantazopoulos, G., Nikandrou, M., Giannakopoulos, T., Katsamanis, A., Potamianos, A., & Narayanan, S. Data Augmentation Using GANs for Speech Emotion Recognition Proc. Interspeech 2019, 171-175 (2019)
Paraskevopoulos, G., Tzinis, E., Ellinas, N., Giannakopoulos, T., & Potamianos, A. Unsupervised Low-Rank Representations for Speech Emotion Recognition Proc. Interspeech 2019, 939-943 (2019)
Papakostas, Michalis, and Theodoros Giannakopoulos Speech-music discrimination using deep visual feature extractors Expert Systems with Applications 114 (2018): 334-344. pdf
Sarafianos, N., Giannakopoulos, T., Nikou, C., & Kakadiaris, I. A. Curriculum learning of visual attribute clusters for multi-task classification Pattern Recognition, 80, 94-108., 2018 pdf
Bougiatiotis, Konstantinos, and Theodoros Giannakopoulos Enhanced movie content similarity based on textual, auditory and visual information. Expert Systems with Applications 96 (2018): 86-102. pdf
Papakostas, M., Tsiakas, K., Giannakopoulos, T., & Makedon, F. Towards predicting task performance from EEG signals 2017 IEEE International Conference on Big Data (Big Data) (pp. 4423-4425) pdf
Sarafianos, N., Giannakopoulos, T., Nikou, C., & Kakadiaris, I. A. Curriculum learning for multi-task classification of visual attributes 2017 In Proceedings of the IEEE International Conference on Computer Vision Workshops (pp. 2608-2615) pdf
Korakakis, M., Spyrou, E., Mylonas, P., & Perantonis, S. Exploiting social media information toward a context-aware recommendation system Social Network Analysis and Mining, 7(1), 42., 2017 html
Giannakopoulos, T., & Konstantopoulos, S. Daily Activity Recognition based on Meta-classification of Low-level Audio Events Proceedings of ICT4AWE2017, ISBN: 978-989-758-251-6 (2017)
Giannakopoulos, T., Konstantopoulos, S., Siantikos, G., & Karkaletsis, V. Design for a System of Multimodal Interconnected ADL Recognition Services Components and Services for IoT Platforms, pp. 323-333. Springer International Publishing (2017)
Papakostas, M., Siantikos, G., Giannakopoulos, T., Spyrou, E., & Sgouropoulos, D. Recognizing Emotional States Using Speech Information GeNeDis 2016 (pp. 155-164). Springer, Cham (2017)
Giannakopoulos, T., & Siantikos, G. A ROS Framework for Audio-Based Activity Recognition Proceedings of the 9th ACM International Conference on PErvasive Technologies Related to Assistive Environments (PETRA 2016) (2016)
Nivolianitou, Z. S., Koromila, I. A., & Giannakopoulos, T. Bayesian network to predict environmental risk of a possible ship accident International Journal of Risk Assessment and Management, 19(3), 228-239 (2016)
Papakostas, M., Giannakopoulos, T., & Makedon, F. Short-term Recognition of Human Activities using Convolutional Neural Networks 2016 International Conference on Signal-Image Technology & Internet Based Systems (SITIS). IEEE (2016)
Smailis, C., Sarafianos, N., Giannakopoulos, T., & Perantonis, S. Fusing Active Orientation Models and Mid-term Audio Features for Automatic Depression Estimation Proceedings of the 9th ACM International Conference on PErvasive Technologies Related to Assistive Environments (PETRA 2016) (2016)
Bougiatiotis, K., & Giannakopoulos, T. Content Representation and Similarity of Movies based on Topic Extraction from Subtitles Proceedings of the 9th Hellenic Conference on Artificial Intelligence. ACM (2016)
Sarafianos, N., Giannakopoulos, T., & Petridis, S. Audio-visual speaker diarization using fisher linear semi-discriminant analysis. Multimedia Tools and Applications, 75(1), 115-130, 2016 pdf
Spyrou, E., & Mylonas, P. Analyzing Flickr metadata to extract location-based information and semantically organize its photo content. Neurocomputing, 172, 114-133., 2016 html
Giannakopoulos Theodoros pyaudioanalysis: An open-source python library for audio signal analysis. PloS one 10.12 (2015): e0144610. html
Giannakopoulos, T., Siantikos, G., Perantonis, S., Votsi, N. E., & Pantis, J. Automatic soundscape quality estimation using audio analysis Proceedings of the 8th ACM International Conference on PErvasive Technologies Related to Assistive Environments (p. 19) (2015)
Giannakopoulos, T., Gyftakis, S., Charou, E., Perantonis, S., Nivolianitou, Z., Koromila, I., & Makrygiorgos, A. Long-term marine traffic monitoring for environmental safety in the aegean sea International Archives of the Photogrammetry, Remote Sensing Spatial Information Sciences (2015)
Petridis, S., Giannakopoulos, T., & Spyropoulos, C. D. A Low Cost Pupillometry Approach International Journal of E-Health and Medical Communications (IJEHMC), 6(4), 49-61 (2015)
Sgouropoulos, D., Giannakopoulos, T., Siantikos, G., Spyrou, E., & Perantonis, S. Detection of Clothes Change Fusing Color, Texture, Edge and Depth Information E-Business and Telecommunications, vol. 554 of Communications in Computer and Information Science, pp. 383-392, Springer (2015)
Petridis, S., Giannakopoulos, T., & Perantonis, S. Unobtrusive Low-Cost Physiological Monitoring Using Visual Information Handbook of Research on Innovations in the Diagnosis and Treatment of Dementia, 306 (2015)
Sgouropoulos, D., Spyrou, E., Siantikos, G., & Giannakopoulos, T. Counting and tracking people in a smart room: An IoT approach Semantic and Social Media Adaptation and Personalization (SMAP), 2015 10th International Workshop on (pp. 1-5). IEEE (2015)
Siantikos, G., Sgouropoulos, D., Giannakopoulos, T., & Spyrou, E. Fusing multiple audio sensors for acoustic event detection Image and Signal Processing and Analysis (ISPA), 2015 9th International Symposium on (pp. 265-269). IEEE (2015)
Koromila, I., Nivolianitou, Z., Perantonis, S., Giannakopoulos, T., Charou, E., Gyftakis, S., & Spyrou, K. Environmental Risk Assessment for the Aegean Sea 11th International Conference on Marine Navigation and Safety of Sea Transportation (TransNav 2015) (2015)
Giannakopoulos, T., & Pikrakis, A. Introduction to audio analysis: a MATLAB® approach Academic Press (2014)
Giannakopoulos, T., Smailis, C., Perantonis, S. J., & Spyropoulos, C. D. Realtime depression estimation using mid-term audio features AI-AM/NetMed@ECAI (2014)
Giannakopoulos, T., & Petridis, S. Fisher linear semi-discriminant analysis for speaker diarization IEEE TASLP 20.7 (2012): 1913-1922
Giannakopoulos, T., & Petridis, S. Detection and clustering of musical audio parts using Fisher linear semi-discriminant analysis 2012 EUSIPCO. IEEE (2012)
Giannakopoulos, T., Makris, A., Kosmopoulos, D., Perantonis, S., & Theodoridis, S. Audio-visual fusion for detecting violent scenes in videos 2010