| Nikou, C., & Giannakopoulos, T. |
Contrastive and Transfer Learning for Effective Audio Fingerprinting through a
Real-World Evaluation Protocol |
IJMSTA, 7(1), 68-82 (2025) |
|
| Sgouropoulos, C., Nikou, C., Vlachos, S., Theiou, V., Foukanelis, C., &
Giannakopoulos, T. |
Prototypical Contrastive Learning For Improved Few-Shot Audio |
IEEE MLSP (2025) |
|
| Nikou, C., Theiou, V., Vlachos, S., Sgouropoulos, C., Sgouropoulos, D., &
Giannakopoulos, T. |
On the Robustness of State-of-the-Art Transformers for Sound Event
Classification Against Black Box Adversarial Attacks |
2025 EUSIPCO. IEEE (2025) |
|
| Koromilas, P., Bouritsas, G., Giannakopoulos, T., Nicolaou, M., & Panagakis, Y.
|
Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From
InfoNCE to Kernel-Based Losses |
Forty-first International Conference on Machine Learning (2024) |
|
| Mitsou, A., Petrogianni, A., Vakalaki, E. A., Nikou, C., Psallidas, T., &
Giannakopoulos, T. |
A multimodal dataset for electric guitar playing technique recognition |
Data in Brief, Vol 52 (2024) |
|
| Kaliosis, P., Eleftheriou, S., Nikou, C., & Giannakopoulos, T. |
A self-supervised learning approach for detecting non-psychotic relapses using
wearable-based digital phenotyping |
2024 IEEE ICASSPW. IEEE (2024) |
|
| Petrogianni, A., Kapelonis, L., Antoniou, N., Eleftheriou, S., Mitseas, P.,
Sgouropoulos, D., Katsamanis, N., Giannakopoulos, T., & Narayanan, S. |
RobuSER: A robustness Benchmark for Speech Emotion Recognition |
2024 12th International Conference on Affective Computing and Intelligent
Interaction (ACII) (pp. 1-7). IEEE (2024) |
|
| Eleftheriadis, K., Gini, M., Manousakas, M., Diapouli, E., Vratolis, S.,
Papagiannis, S., Zografou, O., Giannakopoulos, T., Konstantopoulos, S., Mocnik,
G., & Drinovec, L. |
MItigating Transport-Related Air Pollution in Europe: The MI-TRAP project |
The European Aerosol Conference (2024) |
|
| Sgouropoulos, D., Mitseas, P., Eleftheriou, S., Giannakopoulos, T., Petrogianni,
A., Kapelonis, L., Antoniou, N., Katsamanis, A., & Narayanan, S. |
Emotion-aware speech popularity prediction: a use-case on TED talks |
2024 12th International Conference on Affective Computing and Intelligent
Interaction (ACII). IEEE (2024) |
|
| Gkritzali, E., Kaliosis, P., Galanaki, S., Palogiannidi, E., & Giannakopoulos,
T. |
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation
|
13th Helenic Conference on AI, SETN 2024, Piraeus, Greece, September 11-13,
2024. Proceedings 4 (2024) |
|
| Melistas, T., Kapelonis, L., Antoniou, N., Mitseas, P., Sgouropoulos, D.,
Giannakopoulos, T., Katsamanis, A., Narayanan, S., & Demokritos, N.C.S.R. |
Cross-lingual features for alzheimer's dementia detection from speech |
Proc. INTERSPEECH (pp. 3008-3012) (2023) |
|
| Christopoulos, D., Chatzi, E., Sofianopoulos, G., Patiniotaki, E., Eleftheriou,
S., Koromilas, P., Kaliosis, P., Gkouti, N., Giannakopoulos, T., Petridis, K., &
Sismanidou, E. |
Smart Subs Subtitling App for Watching Live Virtual Dome Performances |
2023 SMAP. IEEE (2023) |
|
| Bochalis, C., Vargas, C. D., Jarvis, E. D., & Giannakopoulos, T. |
Unsupervised Temporal Analysis of Mouse Vocalizations |
2023 IEEE Conference on Computational Intelligence in Bioinformatics and
Computational Biology (CIBCB) (pp. 1-8). IEEE (2023) |
|
| Petrogianni, A., Vassilakis, D., Klampanos, I. A., Giannakopoulos, T., &
Andreopoulou, A. |
Jazz Mapping: An Advanced Framework for Solo Analysis and Discovery in Jazz
Music |
Audio Engineering Society Convention 155 (2023) |
|
| Koromilas, P., Nicolaou, M. A., Giannakopoulos, T., & Panagakis, Y. |
MMATR: A Lightweight Approach for Multimodal Sentiment Analysis Based on Tensor
Methods |
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal
Processing (ICASSP) (pp. 1-5). IEEE (2023) |
|
| Antoniou, N., Katsamanis, A., Giannakopoulos, T., & Narayanan, S. |
Designing and Evaluating Speech Emotion Recognition Systems: A reality check
case study with IEMOCAP |
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal
Processing (ICASSP) (pp. 1-5). IEEE (2023) |
|
| Alygizakis, N., Giannakopoulos, T., Τhomaidis, N. S., & Slobodnik, J. |
Detecting the sources of chemicals in the Black Sea using non-target screening
and deep learning convolutional neural networks |
Science of The Total Environment, 157554 (2022), |
html |
| Petrogianni, A., Koromilas, P., & Giannakopoulos, T. |
Film Shot Type Classification Based on Camera Movement Styles |
Iberian Conference on Pattern Recognition and Image Analysis (pp. 602-615), 2022
|
html |
| M. Moutti, S. Eleftheriou, P. Koromilas, T. Giannakopoulos |
A Dataset for Speech Emotion Recognition in Greek Theatrical Plays |
13th Conference on Language Resources and Evaluation (LREC 2022), pages
1040–1046 |
pdf |
| Tsitos, A. C., Dagioglou, M., & Giannakopoulos, T. |
Real-time feasibility of a human intention method evaluated through a
competitive human-robot reaching game |
2022 ACM/IEEE International Conference on Human-Robot Interaction (pp.
1080-1084) |
html |
| Dagioglou, M., Soulounias, N., & Giannakopoulos, T. |
Object Size Prediction from Hand Movement Using a Single RGB Sensor |
International Conference on Human-Computer Interaction (pp. 369-386), 2022 |
html |
| Touros, G., & Giannakopoulos, T. |
Video soundtrack evaluation with machine learning: Data availability, feature
extraction, and classification |
Advances in Speech and Music Technology (2022) |
|
| Moutti, M., Eleftheriou, S., Koromilas, P., & Giannakopoulos, T. |
Cross linguistic speech emotion recognition using CNNs: a use-case in Greek
Theatrical Data |
Proceedings of the 15th International Conference on PErvasive Technologies
Related to Assistive Environments (pp. 662-667) (2022) |
|
| Zouros, M., & Giannakopoulos, T. |
Photography Style Analysis using Convolutional Neural Networks |
2022 SITIS. IEEE (2022) |
|
| Stoumpou, V., Vargas, C. D., Schade, P. F., Boyd, J. L., Giannakopoulos, T., &
Jarvis, E. D. |
Analysis of Mouse Vocal Communication (AMVOC): a deep, unsupervised method for
rapid detection, analysis and classification of ultrasonic vocalisations |
Bioacoustics, 1-31 (2022) |
|
| Geroulanos, A., & Giannakopoulos, T. |
Emotion Recognition in Music Using Deep Neural Networks |
Advances in Speech and Music Technology: Computational Aspects and Applications.
Cham: Springer International Publishing, 193-213 (2022) |
|
| Papaioannou, C., Valiantzas, I., Giannakopoulos, T., Kaliakatsos Papakostas, M.,
& Potamianos, A. |
A Dataset for Greek Traditional and Folk Music: Lyra |
23rd International Society for Music Information Retrieval Conference (ISMIR
2022) (2022) |
|
| Chatziagapi, A., Sgouropoulos, D., Karouzos, C., Melistas, T., Giannakopoulos,
T., Katsamanis, A., & Narayanan, S. |
Audio and ASR-based Filled Pause Detection |
2022 10th International Conference on Affective Computing and Intelligent
Interaction (ACII) (pp. 1-7). IEEE (2022) |
|
| Koromilas, P., & Giannakopoulos, T. |
Deep multimodal emotion recognition on human speech: A review |
Applied Sciences, 11(17), 7962 (2021). |
pdf
|
| Paraskevoudis, K., & Giannakopoulos, T. |
Instrument Playing Technique Recognition: A Greek Music Use Case |
In Worldwide Music Conference (pp. 124-136). Springer (2021) |
html |
| Psallidas, T., Koromilas, P., Giannakopoulos, T., & Spyrou, E. |
Multimodal Summarization of User-Generated Videos |
Applied Sciences, 11(11), 5260. (2021) |
pdf
|
| Eleftheriou, S., Koromilas, P., & Giannakopoulos, T. |
Automatic Assessment of Speaking Skills Using Aural and Textual Information |
Proceedings of The Fourth International Conference on Natural Language and
Speech Processing (ICNLSP 2021) (pp. 166-1) (2021) |
|
| Mitsou, A., Spyrou, E., & Giannakopoulos, T. |
Multimodal Workplace Monitoring for Human Activity Recognition |
25th Pan-Hellenic Conference on Informatics (pp. 206-211) (2021) |
|
| Melistas, T., & Giannakopoulos, T. |
Lyrics and Vocal Melody Generation conditioned on Accompaniment |
Proceedings of the 2nd Workshop on NLP for Music and Spoken Audio (NLP4MusA)
(2021) |
|
| Psallidas, T., Mitsou, A., Pikramenos, G., Spyrou, E., & Giannakopoulos, T. |
ARCHEO: A Dataset for Sound Event Detection in Areas of Touristic Interest |
2020 15th International Workshop on Semantic and Social Media Adaptation and
Personalization (SMA (pp. 1-6). IEEE. |
html |
| Giannakopoulos, T., Orfanidi, M., & Perantonis, S. |
Athens Urban Soundscape (ATHUS): A Dataset for Urban Soundscape Quality
Recognition. |
In International Conference on Multimedia Modeling (pp. 338-348). Springer,
Cham., 2019 |
pdf |
| Pikramenos, G., Smyrnis, G., Vernikos, I., Konidaris, T., Spyrou, E., &
Perantonis, S |
Sentiment Analysis from Sound Spectrograms via Soft BoVW and Temporal Structure
Modelling. |
In ICPRAM (pp. 361-369) 2019 |
pdf |
| Giannakopoulos, T., Dimopoulos, S., Pantazopoulos, G., Chatziagapi, A.,
Sgouropoulos, D., Katsamanis, A., Potamianos, A., & Narayanan, S. |
Using Oliver API for emotion-aware movie content characterization |
2019 CBMI. IEEE (2019) |
|
| Giannakopoulos, T., & Perantonis, S. |
Recognizing the quality of urban sound recordings using hand-crafted and deep
audio features |
Proceedings of the 12th ACM International Conference on PErvasive Technologies
Related to Assistive Environments (2019) |
|
| Paraskevopoulos, G., Spyrou, E., Sgouropoulos, D., Giannakopoulos, T., &
Mylonas, P. |
Real-Time Arm Gesture Recognition Using 3D Skeleton Joint Data |
Algorithms, 12(5), 108 (2019) |
|
| Giannakopoulos, T., Konstantopoulos, S., Siantikos, G., & Karkaletsis, V. |
A System of Recognition Services for Clinical Assessment |
RADIO–Robots in Assisted Living (pp. 7-18). Springer (2019) |
|
| Chatziagapi, A., Paraskevopoulos, G., Sgouropoulos, D., Pantazopoulos, G.,
Nikandrou, M., Giannakopoulos, T., Katsamanis, A., Potamianos, A., & Narayanan,
S. |
Data Augmentation Using GANs for Speech Emotion Recognition |
Proc. Interspeech 2019, 171-175 (2019) |
|
| Paraskevopoulos, G., Tzinis, E., Ellinas, N., Giannakopoulos, T., & Potamianos,
A. |
Unsupervised Low-Rank Representations for Speech Emotion Recognition |
Proc. Interspeech 2019, 939-943 (2019) |
|
| Papakostas, Michalis, and Theodoros Giannakopoulos |
Speech-music discrimination using deep visual feature extractors |
Expert Systems with Applications 114 (2018): 334-344. |
pdf |
| Sarafianos, N., Giannakopoulos, T., Nikou, C., & Kakadiaris, I. A. |
Curriculum learning of visual attribute clusters for multi-task classification
|
Pattern Recognition, 80, 94-108., 2018 |
pdf |
| Bougiatiotis, Konstantinos, and Theodoros Giannakopoulos |
Enhanced movie content similarity based on textual, auditory and visual
information. |
Expert Systems with Applications 96 (2018): 86-102. |
pdf |
| Papakostas, M., Tsiakas, K., Giannakopoulos, T., & Makedon, F. |
Towards predicting task performance from EEG signals |
2017 IEEE International Conference on Big Data (Big Data) (pp. 4423-4425) |
pdf |
| Sarafianos, N., Giannakopoulos, T., Nikou, C., & Kakadiaris, I. A. |
Curriculum learning for multi-task classification of visual attributes |
2017 In Proceedings of the IEEE International Conference on Computer Vision
Workshops (pp. 2608-2615) |
pdf |
| Korakakis, M., Spyrou, E., Mylonas, P., & Perantonis, S. |
Exploiting social media information toward a context-aware recommendation system
|
Social Network Analysis and Mining, 7(1), 42., 2017 |
html |
| Giannakopoulos, T., & Konstantopoulos, S. |
Daily Activity Recognition based on Meta-classification of Low-level Audio
Events |
Proceedings of ICT4AWE2017, ISBN: 978-989-758-251-6 (2017) |
|
| Giannakopoulos, T., Konstantopoulos, S., Siantikos, G., & Karkaletsis, V. |
Design for a System of Multimodal Interconnected ADL Recognition Services |
Components and Services for IoT Platforms, pp. 323-333. Springer International
Publishing (2017) |
|
| Papakostas, M., Siantikos, G., Giannakopoulos, T., Spyrou, E., & Sgouropoulos,
D. |
Recognizing Emotional States Using Speech Information |
GeNeDis 2016 (pp. 155-164). Springer, Cham (2017) |
|
| Giannakopoulos, T., & Siantikos, G. |
A ROS Framework for Audio-Based Activity Recognition |
Proceedings of the 9th ACM International Conference on PErvasive Technologies
Related to Assistive Environments (PETRA 2016) (2016) |
|
| Nivolianitou, Z. S., Koromila, I. A., & Giannakopoulos, T. |
Bayesian network to predict environmental risk of a possible ship accident |
International Journal of Risk Assessment and Management, 19(3), 228-239 (2016)
|
|
| Papakostas, M., Giannakopoulos, T., & Makedon, F. |
Short-term Recognition of Human Activities using Convolutional Neural Networks
|
2016 International Conference on Signal-Image Technology & Internet Based
Systems (SITIS). IEEE (2016) |
|
| Smailis, C., Sarafianos, N., Giannakopoulos, T., & Perantonis, S. |
Fusing Active Orientation Models and Mid-term Audio Features for Automatic
Depression Estimation |
Proceedings of the 9th ACM International Conference on PErvasive Technologies
Related to Assistive Environments (PETRA 2016) (2016) |
|
| Bougiatiotis, K., & Giannakopoulos, T. |
Content Representation and Similarity of Movies based on Topic Extraction from
Subtitles |
Proceedings of the 9th Hellenic Conference on Artificial Intelligence. ACM
(2016) |
|
| Sarafianos, N., Giannakopoulos, T., & Petridis, S. |
Audio-visual speaker diarization using fisher linear semi-discriminant analysis.
|
Multimedia Tools and Applications, 75(1), 115-130, 2016 |
pdf |
| Spyrou, E., & Mylonas, P. |
Analyzing Flickr metadata to extract location-based information and semantically
organize its photo content. |
Neurocomputing, 172, 114-133., 2016 |
html
|
| Giannakopoulos Theodoros |
pyaudioanalysis: An open-source python library for audio signal analysis. |
PloS one 10.12 (2015): e0144610. |
html |
| Giannakopoulos, T., Siantikos, G., Perantonis, S., Votsi, N. E., & Pantis, J.
|
Automatic soundscape quality estimation using audio analysis |
Proceedings of the 8th ACM International Conference on PErvasive Technologies
Related to Assistive Environments (p. 19) (2015) |
|
| Giannakopoulos, T., Gyftakis, S., Charou, E., Perantonis, S., Nivolianitou, Z.,
Koromila, I., & Makrygiorgos, A. |
Long-term marine traffic monitoring for environmental safety in the aegean sea
|
International Archives of the Photogrammetry, Remote Sensing Spatial Information
Sciences (2015) |
|
| Petridis, S., Giannakopoulos, T., & Spyropoulos, C. D. |
A Low Cost Pupillometry Approach |
International Journal of E-Health and Medical Communications (IJEHMC), 6(4),
49-61 (2015) |
|
| Sgouropoulos, D., Giannakopoulos, T., Siantikos, G., Spyrou, E., & Perantonis,
S. |
Detection of Clothes Change Fusing Color, Texture, Edge and Depth Information
|
E-Business and Telecommunications, vol. 554 of Communications in Computer and
Information Science, pp. 383-392, Springer (2015) |
|
| Petridis, S., Giannakopoulos, T., & Perantonis, S. |
Unobtrusive Low-Cost Physiological Monitoring Using Visual Information |
Handbook of Research on Innovations in the Diagnosis and Treatment of Dementia,
306 (2015) |
|
| Sgouropoulos, D., Spyrou, E., Siantikos, G., & Giannakopoulos, T. |
Counting and tracking people in a smart room: An IoT approach |
Semantic and Social Media Adaptation and Personalization (SMAP), 2015 10th
International Workshop on (pp. 1-5). IEEE (2015) |
|
| Siantikos, G., Sgouropoulos, D., Giannakopoulos, T., & Spyrou, E. |
Fusing multiple audio sensors for acoustic event detection |
Image and Signal Processing and Analysis (ISPA), 2015 9th International
Symposium on (pp. 265-269). IEEE (2015) |
|
| Koromila, I., Nivolianitou, Z., Perantonis, S., Giannakopoulos, T., Charou, E.,
Gyftakis, S., & Spyrou, K. |
Environmental Risk Assessment for the Aegean Sea |
11th International Conference on Marine Navigation and Safety of Sea
Transportation (TransNav 2015) (2015) |
|
| Giannakopoulos, T., & Pikrakis, A. |
Introduction to audio analysis: a MATLAB® approach |
Academic Press (2014) |
|
| Giannakopoulos, T., Smailis, C., Perantonis, S. J., & Spyropoulos, C. D. |
Realtime depression estimation using mid-term audio features |
AI-AM/NetMed@ECAI (2014) |
|
| Giannakopoulos, T., & Petridis, S. |
Fisher linear semi-discriminant analysis for speaker diarization |
IEEE TASLP 20.7 (2012): 1913-1922 |
|
| Giannakopoulos, T., & Petridis, S. |
Detection and clustering of musical audio parts using Fisher linear
semi-discriminant analysis |
2012 EUSIPCO. IEEE (2012) |
|
| Giannakopoulos, T., Makris, A., Kosmopoulos, D., Perantonis, S., & Theodoridis,
S. |
Audio-visual fusion for detecting violent scenes in videos |
2010 |
|