TREC Video Retrieval Evaluation Partial bibliography of peer-reviewed journal and conference papers based on TRECVID resources (comprising mainly work publicly accessible via the ACM Digital Library and IEEE Explorer) 2022 (59) ------------------------------------------------------------------ Karbalaie, Abdolamir, Farhad Abtahi, and Mårten Sjöström. "Event detection in surveillance videos: a review." Multimedia tools and applications 81.24 (2022): 35463-35501. Chavate, Shrikant, and Ravi Mishra. "Efficient detection of abrupt transitions using statistical methods." ECS Transactions 107.1 (2022): 6541. Roomi, Mohamed Mansoor, and Saurav Gupta. "Pyramidal-Relative Entropy Based Temporal Signature for Video Transition Detection using LSTM." (2022). Dave, Ishan, et al. "Gabriellav2: Towards better generalization in surveillance videos for action detection." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022. Hu, Fan, et al. "Lightweight attentional feature fusion: A new baseline for text-to-video retrieval." European Conference on Computer Vision. Cham: Springer Nature Switzerland, 2022. Yu, Lijun, et al. "Argus++: Robust real-time activity detection for unconstrained video streams with overlapping cube proposals." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022. Du, Yunhao, et al. "Pami-ad: An activity detector exploiting part-attention and motion information in surveillance videos." 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). IEEE, 2022. Chakraborty, Saptarshi, et al. "ALO-SBD: a hybrid shot boundary detection technique for video surveillance system." Edge Analytics: Select Proceedings of 26th International Conference—ADCOM 2020. Singapore: Springer Singapore, 2022. Lin, Qiubin, Wenming Cao, and Zhiquan He. "Level-wise aligned dual networks for text–video retrieval." EURASIP Journal on Advances in Signal Processing 2022.1 (2022): 58. Nandini, H. M., H. K. Chethan, and B. S. Rashmi. "Shot based keyframe extraction using edge-LBP approach." Journal of King Saud University-Computer and Information Sciences 34.7 (2022): 4537-4545. Benoughidene, Abdelhalim, and Faiza Titouna. "A novel method for video shot boundary detection using CNN-LSTM approach." International Journal of Multimedia Information Retrieval 11.4 (2022): 653-667. Chakraborty, Saptarshi, Alok Singh, and Dalton Meitei Thounaojam. "A novel bifold-stage shot boundary detection algorithm: invariant to motion and illumination." The Visual Computer 38.2 (2022): 445-456. Lebron, Luis, et al. "Evaluation of Automatically Generated Video Captions Using Vision and Language Models." 2022 IEEE International Conference on Image Processing (ICIP). IEEE, 2022. Jose, Jasmin T., et al. "Efficient shot boundary detection with multiple visual representations." Mobile Information Systems 2022 (2022). Mishra, Ravi. "Hybrid feature extraction and optimized deep convolutional neural network based video shot boundary detection." Concurrency and Computation: Practice and Experience 34.25 (2022): e7256. Naveen Kumar, G. S., and V. S. K. Reddy. "High performance algorithm for content-based video retrieval using multiple features." Intelligent Systems and Sustainable Computing: Proceedings of ICISSC 2021. Singapore: Springer Nature Singapore, 2022. 637-646. Kalaivani, A., and S. Anusuya. "The Detection of Video Shot Transitions Based on Primary Segments Using the Adaptive Threshold of Colour-Based Histogram Differences and Candidate Segments Using the SURF Feature Descriptor." Symmetry 14.10 (2022): 2041. Singh, Alok, Thoudam Doren Singh, and Sivaji Bandyopadhyay. "V2t: video to text framework using a novel automatic shot boundary detection algorithm." Multimedia Tools and Applications 81.13 (2022): 17989-18009. Deotale, Disha, et al. "Optimized hybrid RNN model for human activity recognition in untrimmed video." Journal of Electronic Imaging 31.5 (2022): 051409-051409. Zhang, Binyu, et al. "Multi-actor activity detection by modeling object relationships in extended videos based on deep learning." Engineering Applications of Artificial Intelligence 114 (2022): 105055. Hamroun, Mohamed, Karim Tamine, and Benoît Crespin. "Multimodal video indexing (mvi): A new method based on machine learning and semi-automatic annotation on large video collections." International Journal of Image and Graphics 22.02 (2022): 2250022. Lokoč, Jakub, et al. "A task category space for user-centric comparative multimedia search evaluations." International conference on multimedia modeling. Cham: Springer International Publishing, 2022. Zhang, Yue, Chao Liang, and Longxiang Jiang. "Confidence-Aware Active Feedback for Interactive Instance Search." IEEE Transactions on Multimedia (2022). Sandeep, R., and Bora K. Prabin. "Application of Perceptual Video Hashing for Near-duplicate Video Retrieval." Evolutionary Computing and Mobile Sustainable Networks: Proceedings of ICECMSN 2021. Singapore: Springer Singapore, 2022. 253-275. Zare, Samin, and Mehran Yazdi. "A Survey on Semi-Automated and Automated Approaches for Video Annotation." 2022 12th International Conference on Computer and Knowledge Engineering (ICCKE). IEEE, 2022. Kaur, Lakhwinder, and Pankaj Kumar Mishra. "Estimation of concise video summaries from long sequence videos using deep learning via LSTM." Khan, Omar Shahbaz, Jan Zahálka, and Björn Þór Jónsson. "Influence of Late Fusion of High-Level Features on User Relevance Feedback for Videos." Proceedings of the 2nd International Workshop on Interactive Multimedia Retrieval. 2022. Ma, Zhixin, et al. "Reinforcement learning-based interactive video search." International Conference on Multimedia Modeling. Cham: Springer International Publishing, 2022. Wu, Weifei. "Multi-source selection transfer learning with privacy-preserving." Neural Processing Letters 54.6 (2022): 4921-4950. Prabavathy, A. Kethsy, M. Mythily, and J. A. M. Rexie. "Object based Video Retrieval with multiple features matching approach." 2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS). Vol. 1. IEEE, 2022. Nijhawan, Rahul, et al. "Gun identification from gunshot audios for secure public places using transformer learning." Scientific reports 12.1 (2022): 13300. Balaji, Avantika, et al. "Shot Boundary Detection and Video Captioning Using Neural Networks." Disruptive Technologies for Big Data and Cloud Applications: Proceedings of ICBDCC 2021. Singapore: Springer Nature Singapore, 2022. 277-285. Heller, Silvan, et al. "Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown." International Journal of Multimedia Information Retrieval 11.1 (2022): 1-18. Mai, Tien-Dung, Tien Do, and Duy-Dinh Le. "A Framework for Evaluating Video Summary Approaches." 2022 International Conference on Multimedia Analysis and Pattern Recognition (MAPR). IEEE, 2022. Harrando, Ismail. Representation, information extraction, and summarization for automatic multimedia understanding. Diss. Sorbonne Université, 2022. Perez-Martin, Jesus, et al. "A comprehensive review of the video-to-text problem." Artificial Intelligence Review (2022): 1-75. Allouche, Mohamed, and Mihai Mitrea. "Video fingerprinting: Past, present, and future." Frontiers in Signal Processing 2 (2022): 984169. Liang, Guoqiang, et al. "Video summarization with a dual-path attentive network." Neurocomputing 467 (2022): 1-9. Khan, Shakir, and Lulwah AlSuwaidan. "Agricultural monitoring system in video surveillance object detection using feature extraction and classification by deep learning techniques." Computers and Electrical Engineering 102 (2022): 108201. Behera, Nayan Kumar Subhashis, et al. "Person re-identification: A taxonomic survey and the path ahead." Image and Vision Computing 122 (2022): 104432. Yu, Qinghao, et al. "SUM-GAN-GEA: Video Summarization Using GAN with Gaussian Distribution and External Attention." Electronics 11.21 (2022): 3523. Harzig, Philipp. "Automatic generation of natural language descriptions of visual data: describing images and videos using recurrent and self-attentive models." (2022). Apostolidis, Evlampios, et al. "Summarizing videos using concentrated attention and considering the uniqueness and diversity of the video frames." Proceedings of the 2022 International Conference on Multimedia Retrieval. 2022. Khan, Omar Shahbaz, et al. "Exquisitor at the Video Browser Showdown 2022." International Conference on Multimedia Modeling. Cham: Springer International Publishing, 2022. Presa-Reyes, Maria, et al. "Multi-Source Weak Supervision Fusion for Disaster Scene Recognition in Videos." 2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 2022 Wang, Xu, et al. "A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency." Sensors 22.19 (2022): 7689. Lughofer, Edwin. "Evolving multi-label fuzzy classifier." Information Sciences 597 (2022): 1-23. Ramesh, Raksha, et al. "Leveraging Text Representation and Face-head Tracking for Long-form Multimodal Semantic Relation Understanding." Proceedings of the 30th ACM International Conference on Multimedia. 2022. Li, Haopeng, et al. "Video joint modelling based on hierarchical transformer for co-summarization." IEEE Transactions on Pattern Analysis and Machine Intelligence 45.3 (2022): 3904-3917. Mavroudi, Effrosyni, Prashast Bindal, and René Vidal. "Actor-Centric Tubelets for Real-Time Activity Detection in Extended Videos." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022. Tao, Yudong. Data Analytics and Deep Learning for Multimodal Data. Diss. University of Miami, 2022. Li, Ding, and Scott Dick. "Semi-supervised multi-label classification using an extended graph-based manifold regularization." Complex & Intelligent Systems 8.2 (2022): 1561-1577. Li, Lihuan, Maurice Pagnucco, and Yang Song. "Graph-based spatial transformer with memory replay for multi-future pedestrian trajectory prediction." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022. Li, Changwei. -based video summarization using attention networks. Diss. University of Cincinnati, 2022. Loc, Erika, et al. "Development of a MultiModal Annotation Framework and Dataset for Deep Video Understanding." Proceedings of the 2nd Workshop on People in Vision, Language, and the Mind. 2022. Shi, Huizhong, Yana Zhang, and Yanfang Li. "Decision Fusion Based Multi-type Shot Boundary Detection in Real Time." 2022 5th International Conference on Information Communication and Signal Processing (ICICSP). IEEE, 2022. Chen, Yaosen, et al. "Video summarization with u-shaped transformer." Applied Intelligence 52.15 (2022): 17864-17880. Yan, Xue, et al. "Multimodal based attention-pyramid for predicting pedestrian trajectory." Journal of Electronic Imaging 31.5 (2022): 053008-053008. Reboud, A. (2022). Towards automatic understanding of narrative audiovisual content (Doctoral dissertation, Sorbonne université). ------------------------------------------------------------------ 2021 (62) ------------------------------------------------------------------ Chen, Aozhu, et al. "What matters for ad-hoc video search? A large-scale evaluation on TRECVID." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. Dong, Jianfeng, et al. "Dual encoding for video retrieval by text." IEEE Transactions on Pattern Analysis and Machine Intelligence (2021). Rashmi, B. S., and H. S. Nagendraswamy. "Video shot boundary detection using block based cumulative approach." Multimedia Tools and Applications 80.1 (2021): 641-664. Hao, Yanbin, Chong-Wah Ngo, and Bin Zhu. "Learning to match anchor-target video pairs with dual attentional holographic networks." IEEE Transactions on Image Processing 30 (2021): 8130-8143. Gkountakos, Konstantinos, et al. "Visual Recognition of Abnormal Activities in Video Streams." Technology Development for Security Practitioners. Springer, Cham, 2021. 151-165. Reboud, Alison, et al. "Exploring multimodality, perplexity and explainability for memorability prediction." Multimedia Benchmark Workshop. 2021. Luo, Minnan, Xiaojun Chang, and Chen Gong. "Reliable shot identification for complex event detection via visual-semantic embedding." Computer Vision and Image Understanding 213 (2021): 103300. Yang, Wenhao, et al. "Instance Search via Fusing Hierarchical Multi-level Retrieval and Human-object Interaction Detection." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. Dzabraev, Maksim, et al. "Mdmmt: Multidomain multimodal transformer for video retrieval." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. Chakraborty, Saptarshi, Dalton Meitei Thounaojam, and Nidul Sinha. "A shot boundary detection technique based on visual colour information." Multimedia Tools and Applications 80.3 (2021): 4007-4022. Nguyen, E-Ro, et al. "HCMUS at MediaEval2021: Attention-based Hierarchical Fusion Network for Predicting Media Memorability." (2021). Kleinlein, Ricardo, Cristina Luna-Jiménez, and Fernando Fernández-Martínez. "THAU-UPM at MediaEval 2021: From Video Semantics To Memorability Using Pretrained Transformers." (2021). Savran Kiziltepe, Rukiye, et al. "Overview of The MediaEval 2021 Predicting Media Memorability Task." (2021). Presa-Reyes, Maria, et al. "Deep Learning With Weak Supervision for Disaster Scene Description in Low-Altitude Imagery." IEEE transactions on geoscience and remote sensing 60 (2021): 1-10. Li, Changsheng, et al. "Deep Unsupervised Active Learning via Matrix Sketching." IEEE Transactions on Image Processing 30 (2021): 9280-9293. Chakraborty, Saptarshi, and Dalton Meitei Thounaojam. "SBD-Duo: a dual stage shot boundary detection technique robust to motion and illumination effect." Multimedia Tools and Applications 80.2 (2021): 3071-3087. Apostolidis, Evlampios, et al. "Video summarization using deep neural networks: A survey." Proceedings of the IEEE 109.11 (2021): 1838-1863. Lokoč, Jakub, et al. "Is the reign of interactive search eternal? Findings from the video browser showdown 2020." ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 17.3 (2021): 1-26. Raja, T. Naga, V. V. Ramana, and A. Damodaram. "A Novel Framework for Video Retrieval Algorithm Evaluations and Methods for Effective Context-Aware Video Content Retrial Method on Cloud." Proceedings of International Conference on Advances in Computer Engineering and Communication Systems. Springer, Singapore, 2021. Jin, Yang, et al. "Zero-shot video event detection with high-order semantic concept discovery and matching." IEEE Transactions on Multimedia 24 (2021): 1896-1908. Chavate, Shrikant, and Ravi Mishra. "A comparison of different procedures for hardware-based video shot boundary detection." Advances in Image and Data Processing using VLSI Design, Volume 1: Smart vision systems. IOP Publishing, 2021. Nishimoto, Koki, and Kimiaki Shirahama. "Acquisition of Human's Memory Mechanism for Video Frames." ITE Technical Report; ITE Tech. Rep. 45.31 (2021): 17-20. Dilawari, Aniqa, et al. "Natural language description of videos for smart surveillance." Applied Sciences 11.9 (2021): 3730. Gowri, S., et al. "Human Action Detection Using Deep Learning." Machine Learning for Predictive Analysis. Springer, Singapore, 2021. 229-235. Constantin, Mihai Gabriel, and Bogdan Ionescu. "Using Vision Transformers and Memorable Moments for the Prediction of Video Memorability." (2021). Nandini, H. M., H. K. Chethan, and B. S. Rashmi. "An efficient method for video shot transition detection using probability binary weight Approach." International Journal of Computer Vision and Image Processing (IJCVIP) 11.3 (2021): 1-20. Mishra, Ravi. "Video shot boundary detection using hybrid dual tree complex wavelet transform with Walsh Hadamard transform." Multimedia Tools and Applications 80.18 (2021): 28109-28135. Xinwei, Li, Xu Lianghao, and Yang Yi. "Compact video fingerprinting via an improved capsule net." Systems Science & Control Engineering 9.sup1 (2021): 122-130. Chavate, Shrikant, Ravi Mishra, and Pranay Yadav. "A Comparative Analysis of Video Shot Boundary Detection using Different Approaches." 2021 10th International Conference on System Modeling & Advancement in Research Trends (SMART). IEEE, 2021. Dong, Jianfeng, et al. "Multi-level alignment network for domain adaptive cross-modal retrieval." Neurocomputing 440 (2021): 207-219. Nguyen, Phuong-Anh, and Chong-Wah Ngo. "Interactive search vs. automatic search: an extensive study on video retrieval." ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 17.2 (2021): 1-24. Jose, Jasmin T., and S. Rajkumar. "Multiple Grey-scale Feature Based Shot Boundary Detection." 2021 Asian Conference on Innovation in Technology (ASIANCON). IEEE, 2021. Han, Tingting, Yuankai Qi, and Suguo Zhu. "A Continuous Semantic Embedding Method for Video Compact Representation." Electronics 10.24 (2021): 3106. Kiziltepe, Rukiye Savran, et al. "An annotated video dataset for computing video memorability." Data in Brief 39 (2021): 107671. Wu, Jiaxin, et al. "SQL-like interpretable interactive video search." International Conference on Multimedia Modeling. Springer, Cham, 2021. Huang, Qing, Hongcai Feng, and Li Liu. "A Video Scene Segmentation Optimization Algorithm Based on Convolutional Neural Network." 2021 2nd International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI). IEEE, 2021. Iinuma, Yuko, and Shin'ichi Satoh. "Video Action Retrieval Using Action Recognition Model." Proceedings of the 2021 International Conference on Multimedia Retrieval. 2021. Lu, Youwei, and Xiaoyu Wu. "Cross-modal Interaction for Video Memorability Prediction." (2021). Galanopoulos, Damianos, and Vasileios Mezaris. "Hard-negatives or Non-negatives? A hard-negative selection strategy for cross-modal retrieval using the improved marginal ranking loss." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. Galanopoulos, Damianos, et al. "Automatic and Semi-automatic Augmentation of Migration Related Semantic Concepts for Visual Media Retrieval." Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks. 2021. Gkountakos, Konstantinos, et al. "Spatio-temporal activity detection and recognition in untrimmed surveillance videos." Proceedings of the 2021 International Conference on Multimedia Retrieval. 2021. Kumar, Neetish. "Shot Boundary Detection Framework For Video Editing Via Adaptive Thresholds And Gradual Curve Point." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12.11 (2021): 3820-3828. Godil, Afzal, et al. "2020 Sequestered Data Evaluation for Known Activities in Extended Video: Summary and Results." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2021. Manjunath Aradhya, V. N., H. T. Basavaraju, and Devanur S. Guru. "Decade research on text detection in images/videos: a review." Evolutionary Intelligence 14.2 (2021): 405-431. Hezel, Nico, et al. "Video search with sub-image keyword transfer using existing image archives." International Conference on Multimedia Modeling. Springer, Cham, 2021. Rizve, Mamshad Nayeem, et al. "Gabriella: An online system for real-time activity detection in untrimmed security videos." 2020 25th International Conference on Pattern Recognition (ICPR). IEEE, 2021. Pavithra, N., and Y. H. Sharath Kumar. "FSBRS: Framework for Sketch-Based Retrieval System of the Color Images." Soft Computing and Signal Processing. Springer, Singapore, 2021. 1-15. Bouyahi, Mohamed, and Yassine Ben Ayed. "Multimodal features for shots boundary detection." Thirteenth International Conference on Machine Vision. Vol. 11605. SPIE, 2021. Jiang, Xuekun, et al. "Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos." IEEE Transactions on Multimedia (2021). Tao, Jianwen, Yufang Dan, and Di Zhou. "Robust multi-source co-adaptation with adaptive loss minimization." Signal Processing: Image Communication 99 (2021): 116455. Valand, Joakim O., et al. "Automated Clipping of Soccer Events using Machine Learning." 2021 IEEE International Symposium on Multimedia (ISM). IEEE, 2021. Thallinger, Georg, and Werner Bailer. "Automatic Analysis of Amateur Film and Video Collections." 2021 International Conference on Content-Based Multimedia Indexing (CBMI). IEEE, 2021. Yao, Wei, et al. "Early and Late Fusion of Multiple Modalities in Sentinel Imagery and Social Media Retrieval." International Conference on Pattern Recognition. Springer, Cham, 2021. Valand, Joakim Olav, et al. "AI-Based Video Clipping of Soccer Events." Machine Learning and Knowledge Extraction 3.4 (2021): 990-1008. Lan, Libin, and Chunxiao Ye. "Recurrent generative adversarial networks for unsupervised WCE video summarization." Knowledge-Based Systems 222 (2021): 106971. Zhu, Yunzhang, et al. "Collaborative multilabel classification." Journal of the American Statistical Association (2021): 1-12. Li, Yuke, Pin Wang, and Ching-Yao Chan. "RESTEP into the future: relational spatio-temporal learning for multi-person action forecasting." IEEE Transactions on Multimedia (2021). Li, Yu-Ke, et al. "Imitative Learning for Multi-Person Action Forecasting." Proceedings of the 29th ACM International Conference on Multimedia. 2021. Apostolidis, Evlampios, et al. "Combining global and local attention with positional encoding for video summarization." 2021 IEEE International Symposium on Multimedia (ISM). IEEE, 2021. Chen, Bo, Decai Li, and Yuqing He. "Simultaneous Prediction of Pedestrian Trajectory and Actions based on Context Information Iterative Reasoning." 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2021. Chen, Bo, et al. "SCR-graph: Spatial-causal relationships based graph reasoning network for human action prediction." The 2nd International Conference on Computing and Data Science. 2021. Kavitha, R., and D. Chitra. "An improved hybridized deep structured model for accurate video event recognition." Journal of Ambient Intelligence and Humanized Computing 12.6 (2021): 6019-6028. ------------------------------------------------------------------ 2020 (55) ------------------------------------------------------------------ Mejzlík, F. (2020). Evaluation of Keyword-Based Search Models for Known-Item Search Wang, Ying, Yongchen Wang, Cong Shi, Long Cheng, Huawei Li, and Xiaowei Li. "An Edge 3D CNN Accelerator for Low-Power Activity Recognition." IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 40, no. 5 (2020): 918-930. Zafirova, Deana. "Shot Boundary Detection: A fundamental base for automatic video analysis." PhD diss., Wien, 2020. Atencio Ortiz, Pedro Sandino. "Query-based Video summarization using machine learning and coordinated representations." Bátoryová, Jana. "Searching Image Collections Using Deep Representations of Local Regions." (2020). Wang, Han, Hao Song, Xinxiao Wu, and Yunde Jia. "Incremental transfer learning for video annotation via grouped heterogeneous sources." IET Computer Vision 14, no. 1 (2020): 26-35. Čech, Přemysl, Jakub Lokoč, and Yasin N. Silva. "Pivot-based approximate k-NN similarity joins for big high-dimensional data." Information Systems 87 (2020): 101410. Fan, S., Shen, Z., Koenig, B.L., Ng, T.T. and Kankanhalli, M.S., 2020. When and Why Static Images Are More Effective Than Videos. IEEE Transactions on Affective Computing, (01), pp.1-1. Lin, Sung-Chiang, Chih-Jou Chen, and Tsung-Ju Lee. "A multi-label classification with hybrid label-based meta-learning method in internet of things." IEEE Access 8 (2020): 42261-42269. Harsha, B. K., and G. Indumathi. "Skin Detection in Images based on Pattern Matching Algorithms-A Review." In 2020 International Conference on Inventive Computation Technologies (ICICT), pp. 359-363. IEEE, 2020. Qian, Yijun, Lijun Yu, Wenhe Liu, Guoliang Kang, and Alexander G. Hauptmann. "Adaptive feature aggregation for video object detection." In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, pp. 143-147. 2020. Sandeep, R., and Prabin Kumar Bora. "Detection of Malicious Video Modifications using Perceptual Video Hashing." 2020 5th International Conference on Computing, Communication and Security (ICCCS). IEEE, 2020. Apostolidis, E., Adamantidou, E., Metsai, A.I., Mezaris, V. and Patras, I., Unsupervised Video Summarization via Attention-Driven Adversarial Learning. Wang, Liyuan, et al. "Multilevel fusion of multimodal deep features for porn streamer recognition in live video." Pattern Recognition Letters 140 (2020): 150-157. Liu, Yanbing, Sanjev Dhakal, and Binyao Hao. "Multimedia image and video retrieval based on an improved HMM." Multimedia Systems (2020): 1-11. Bekhet, Saddam, and Amr Ahmed. "Evaluation of similarity measures for video retrieval." Multimedia Tools and Applications 79.9 (2020): 6265-6278. Subudhi, Badri Narayan, et al. "Automatic lecture video skimming using shot categorization and contrast based features." Expert Systems with Applications 149 (2020): 113341. Gornishka, Iva, Stevan Rudinac, and Marcel Worring. "Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings." International Conference on Multimedia Modeling. Springer, Cham, 2020. Qi, Haifeng, et al. "Hash length: a neglected element." Multimedia Tools and Applications 79.7 (2020): 4763-4782. Jónsson, Björn Þór, et al. "Exquisitor at the video browser showdown 2020." International Conference on Multimedia Modeling. Springer, Cham, 2020. Kim, Byoungjun, et al. "Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes." International Conference on Multimedia Modeling. Springer, Cham, 2020. Mantsis, Damianos Florin, et al. "Multimodal Fusion of Sentinel 1 Images and Social Media Data for Snow Depth Estimation." IEEE Geoscience and Remote Sensing Letters (2020). Lee, Yooyoung, et al. "Summary of the 2019 Activity Detection in Extended Videos Prize Challenge." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops. 2020. Shen, Ling, Richang Hong, and Yanbin Hao. "Advance on large scale near-duplicate video retrieval." Frontiers of Computer Science 14.5 (2020): 1-24. Bhaumik, Hrishikesh, Siddhartha Bhattacharyya, and Susanta Chakraborty. "Real-time video segmentation using a vague adaptive threshold." Hybrid Computational Intelligence. Academic Press, 2020. 191-220. Jadhav, Dattatraya, Yogesh Kumar Sharma, and Dr Arora. "Profound Learning Approach for Shot Boundary Location." Available at SSRN 3645409 (2020). Nandini, H. M., H. K. Chethan, and B. S. Rashmi. "Shot based keyframe extraction using edge-LBP approach." Journal of King Saud University-Computer and Information Sciences (2020). Xinwei, Li, Xu Lianghao, and Yang Yi. "Compact video fingerprinting via an improved capsule net." Systems Science & Control Engineering (2020): 1-9. GogiReddy, Hema Sundara Srinivasula Reddy, and Neelam Sinha. "Video Key Frame Detection Using Block Sparse Coding." Proceedings of 3rd International Conference on Computer Vision and Image Processing. Springer, Singapore, 2020. Janwe, Nitin, and Kishor Bhoyar. "Semantic concept based video retrieval using convolutional neural network." SN Applied Sciences 2.1 (2020): 1-8. Kar, T., and P. Kanungo. "Abrupt Scene Change Detection Using Block Based Local Directional Pattern." Data Management, Analytics and Innovation. Springer, Singapore, 2020. 191-203. Soboroff, Ian, et al. "Evaluating Multimedia and Language Tasks." Frontiers in Artificial Intelligence 3 (2020). Andreadis, Stelios, et al. "Verge in vbs 2020." International Conference on Multimedia Modeling. Springer, Cham, 2020. Sasithradevi, A., and S. Mohamed Mansoor Roomi. "A new pyramidal opponent color-shape model based video shot boundary detection." Journal of Visual Communication and Image Representation 67 (2020): 102754. Sasithradevi, A., and S. Mohamed Mansoor Roomi. "Video classification and retrieval through spatio-temporal Radon features." Pattern Recognition 99 (2020): 107099. Jakub Lokoć, Tomáš Soućek, Patrik Veselý, František Mejzlík, Jiaqi Ji, Chaoxi Xu, and Xirong Li. 2020. A W2VV++ Case Study with Automated and Interactive Text-to-Video Retrieval. In Proceedings of the 28th ACM International Conference on Multimedia (MM '20). Association for Computing Machinery, New York, NY, USA, 2553–2561. DOI:https://doi.org/10.1145/3394171.3414002 Jiaxin Wu and Chong-Wah Ngo. 2020. Interpretable Embedding for Ad-Hoc Video Search. In Proceedings of the 28th ACM International Conference on Multimedia (MM '20). Association for Computing Machinery, New York, NY, USA, 3357–3366. DOI:https://doi.org/10.1145/3394171.3413916 Zhihui Li, Xiaojun Chang, Lina Yao, Shirui Pan, Ge Zongyuan, and Huaxiang Zhang. 2020. Grounding Visual Concepts for Zero-Shot Event Detection and Event Captioning. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '20). Association for Computing Machinery, New York, NY, USA, 297–305. DOI:https://doi.org/10.1145/3394486.3403072 Shuo Chen, Pascal Mettes, Tao Hu, and Cees G.M. Snoek. 2020. Interactivity Proposals for Surveillance Videos. In Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR '20). Association for Computing Machinery, New York, NY, USA, 108–116. DOI:https://doi.org/10.1145/3372278.3390680 Damianos Galanopoulos and Vasileios Mezaris. 2020. Attention Mechanisms, Signal Encodings and Fusion Strategies for Improved Ad-hoc Video Search with Dual Encoding Networks. In Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR '20). Association for Computing Machinery, New York, NY, USA, 336–340. DOI:https://doi.org/10.1145/3372278.3390737 Pascal Mettes, Dennis C. Koelma, and Cees G. M. Snoek. 2020. Shuffled ImageNet Banks for Video Event Detection and Search. ACM Trans. Multimedia Comput. Commun. Appl. 16, 2, Article 44 (June 2020), 21 pages. DOI:https://doi.org/10.1145/3377875 Kazuya Ueki and Takayuki Hori. 2020. Comparison and Evaluation of Video Retrieval Approaches Using Query Sentences. In Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing (IMIP 2020). Association for Computing Machinery, New York, NY, USA, 103–107. DOI:https://doi.org/10.1145/3399637.3399657 X. Li, H. Li and Y. Dong, "Meta Learning for Task-Driven Video Summarization," in IEEE Transactions on Industrial Electronics, vol. 67, no. 7, pp. 5778-5786, July 2020, doi: 10.1109/TIE.2019.2931283. S. H. Abdulhussain, S. A. R. Al-Haddad, M. I. Saripan, B. M. Mahmmod and A. Hussien, "Fast Temporal Video Segmentation Based on Krawtchouk-Tchebichef Moments," in IEEE Access, vol. 8, pp. 72347-72359, 2020, doi: 10.1109/ACCESS.2020.2987870. X. Wang, Q. Wang and H. Wang, "Active Video Hashing via Structure Information Learning for Activity Analysis," in IEEE Access, vol. 8, pp. 96428-96437, 2020, doi: 10.1109/ACCESS.2020.2994783. J. Gleason, S. Schwarcz, R. Ranjan, C. D. Castillo, J. Chen and R. Chellappa, "Activity Detection in Untrimmed Videos Using Chunk-based Classifiers," 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 107-116, doi: 10.1109/WACVW50321.2020.9096912. J. Gleason, C. D. Castillo and R. Chellappa, "Real-time Detection of Activities in Untrimmed Videos," 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 117-125, doi: 10.1109/WACVW50321.2020.9096937. F. -F. Duan and F. Meng, "Video Shot Boundary Detection Based on Feature Fusion and Clustering Technique," in IEEE Access, vol. 8, pp. 214633-214645, 2020, doi: 10.1109/ACCESS.2020.3040861. C. Wang, L. Pang, X. Jiang and L. Jin, "SVD of Shot Boundary Detection Based on Accumulative Difference," 2020 International Conference on Culture-oriented Science & Technology (ICCST), Beijing, China, 2020, pp. 367-372, doi: 10.1109/ICCST50977.2020.00077. X. Li, F. Zhou, C. Xu, J. Ji and G. Yang, "SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2020.3042067. F. Hertlein, D. Münch and M. Arens, "Context Sensitivity of Spatio-Temporal Activity Detection using Hierarchical Deep Neural Networks in Extended Videos," 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 134-142, doi: 10.1109/WACVW50321.2020.9096934. H. M. Nandini, H. K. Chethan and B. S. Rashmi, "Abrupt Shot Change Detection using Midhinge Local Binary Pattern," 2020 IEEE-HYDCON, Hyderabad, India, 2020, pp. 1-5, doi: 10.1109/HYDCON48903.2020.9242841. H. -Q. Vo et al., "Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database," 2020 International Conference on Multimedia Analysis and Pattern Recognition (MAPR), Ha Noi, Vietnam, 2020, pp. 1-6, doi: 10.1109/MAPR49794.2020.9237781.~ Y. Hao, C. Ngo and B. Huet, "Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking," in IEEE Transactions on Multimedia, vol. 22, no. 1, pp. 188-200, Jan. 2020, doi: 10.1109/TMM.2019.2923121. W. Liu et al., "Argus: Efficient Activity Detection System for Extended Video Analysis," 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 126-133, doi: 10.1109/WACVW50321.2020.9096929. ------------------------------------------------------------------ 2019 (90) ------------------------------------------------------------------ L. Yao and Y. Qian, "Novel Activities Detection Algorithm in Extended Videos," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 9-15, doi: 10.1109/WACVW.2019.00009. J. Dong et al., "Dual Encoding for Zero-Example Video Retrieval," 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, pp. 9338-9347, doi: 10.1109/CVPR.2019.00957. S. Aakur, D. Sawyer and S. Sarkar, "Fine-grained Action Detection in Untrimmed Surveillance Videos," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 38-40, doi: 10.1109/WACVW.2019.00014. D. Francis, P. A. Nguyen, B. Huet and C. Ngo, "Fusion of Multimodal Embeddings for Ad-Hoc Video Search," 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), Seoul, Korea (South), 2019, pp. 1868-1872, doi: 10.1109/ICCVW.2019.00233. S. H. Abdulhussain et al., "A Fast Feature Extraction Algorithm for Image and Video Processing," 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019, pp. 1-8, doi: 10.1109/IJCNN.2019.8851750. Z. Zhou, J. Chen, C. Yang and X. Sun, "Video Copy Detection Using Spatio-Temporal CNN Features," in IEEE Access, vol. 7, pp. 100658-100665, 2019, doi: 10.1109/ACCESS.2019.2930173. Y. Gao, Y. Lai and Y. Liu, "Fast Video Shot Boundary Detection Based on Visual Perception," 2019 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2019, pp. 1-4, doi: 10.1109/ICCE.2019.8662083. Y. Wang, Y. Wang, H. Li, C. Shi and X. Li, "Systolic Cube: A Spatial 3D CNN Accelerator Architecture for Low Power Video Analysis," 2019 56th ACM/IEEE Design Automation Conference (DAC), Las Vegas, NV, USA, 2019, pp. 1-6. R. Thomanek et al., "A Scalable System Architecture for Activity Detection with Simple Heuristics," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 27-34, doi: 10.1109/WACVW.2019.00012. J. Chen et al., "Minding the Gaps in a Video Action Analysis Pipeline," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 41-46, doi: 10.1109/WACVW.2019.00015. L. Yu, P. Chen, W. Liu, G. Kang and A. G. Hauptmann, "Training-free Monocular 3D Event Detection System for Traffic Surveillance," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 3838-3843, doi: 10.1109/BigData47090.2019.9006063. X. Peng, R. Li, J. Wang and H. Shang, "User-Guided Clustering for Video Segmentation on Coarse-Grained Feature Extraction," in IEEE Access, vol. 7, pp. 149820-149832, 2019, doi: 10.1109/ACCESS.2019.2946889. J. Gleason, R. Ranjan, S. Schwarcz, C. Castillo, J. Chen and R. Chellappa, "A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos," 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 2019, pp. 141-150, doi: 10.1109/WACV.2019.00021. A. Yazici, M. Koyuncu, S. A. Sert and T. Yilmaz, "A Fusion-Based Framework for Wireless Multimedia Sensor Networks in Surveillance Applications," in IEEE Access, vol. 7, pp. 88418-88434, 2019, doi: 10.1109/ACCESS.2019.2926206. S. Lal, S. Duggal and I. Sreedevi, "Online Video Summarization: Predicting Future to Better Summarize Present," 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 2019, pp. 471-480, doi: 10.1109/WACV.2019.00056. H. Zhang and C. Ngo, "A Fine Granularity Object-Level Representation for Event Detection and Recounting," in IEEE Transactions on Multimedia, vol. 21, no. 6, pp. 1450-1463, June 2019, doi: 10.1109/TMM.2018.2884478. F. Markatopoulou, V. Mezaris and I. Patras, "Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 6, pp. 1631-1644, June 2019, doi: 10.1109/TCSVT.2018.2848458. Z. Gao, L. Wang, N. Jojic, Z. Niu, N. Zheng and G. Hua, "Video Imprint," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 12, pp. 3086-3099, 1 Dec. 2019, doi: 10.1109/TPAMI.2018.2866114. S. S. Thomas, S. Gupta and V. K. Subramanian, "Context Driven Optimized Perceptual Video Summarization and Retrieval," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 10, pp. 3132-3145, Oct. 2019, doi: 10.1109/TCSVT.2018.2873185. M. Elfeki and A. Borji, "Video Summarization Via Actionness Ranking," 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 2019, pp. 754-763, doi: 10.1109/WACV.2019.00085. W. Xie, H. Yao, X. Sun, T. Han, S. Zhao and T. Chua, "Discovering Latent Discriminative Patterns for Multi-Mode Event Representation," in IEEE Transactions on Multimedia, vol. 21, no. 6, pp. 1425-1436, June 2019, doi: 10.1109/TMM.2018.2879749. A. Dilawari and M. U. G. Khan, "ASoVS: Abstractive Summarization of Video Sequences," in IEEE Access, vol. 7, pp. 29253-29263, 2019, doi: 10.1109/ACCESS.2019.2902507. Z. Lu, L. Wu, M. Jian, S. Zhang, D. Wang and X. Wang, "Shot Boundary Detection with Key Motion Estimation and Appearance Differentiation," 2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP), Chongqing, China, 2019, pp. 1-7, doi: 10.1109/ICSIDP47821.2019.9173023. K. Liao et al., "IR Feature Embedded BOF Indexing Method for Near-Duplicate Video Retrieval," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 12, pp. 3743-3753, Dec. 2019, doi: 10.1109/TCSVT.2018.2884941. M. Ma, S. Mei, S. Wan, Z. Wang and D. Feng, "Video Summarization via Nonlinear Sparse Dictionary Selection," in IEEE Access, vol. 7, pp. 11763-11774, 2019, doi: 10.1109/ACCESS.2019.2891834. Y. Yuan, T. Mei, P. Cui and W. Zhu, "Video Summarization by Learning Deep Side Semantic Embedding," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 1, pp. 226-237, Jan. 2019, doi: 10.1109/TCSVT.2017.2771247. L. Wu, S. Zhang, M. Jian, Z. Lu and D. Wang, "Two Stage Shot Boundary Detection via Feature Fusion and Spatial-Temporal Convolutional Neural Networks," in IEEE Access, vol. 7, pp. 77268-77276, 2019, doi: 10.1109/ACCESS.2019.2922038. H. Tao, C. Hou, D. Yi, J. Zhu and D. Hu, "Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multiview Learning," in IEEE Transactions on Cybernetics, doi: 10.1109/TCYB.2019.2953564. P. Gunawardena et al., "Interest-Oriented Video Summarization with Keyframe Extraction," 2019 19th International Conference on Advances in ICT for Emerging Regions (ICTer), Colombo, Sri Lanka, 2019, pp. 1-8, doi: 10.1109/ICTer48817.2019.9023769. M. Gong, H. Li, D. Meng, Q. Miao and J. Liu, "Decomposition-Based Evolutionary Multiobjective Optimization to Self-Paced Learning," in IEEE Transactions on Evolutionary Computation, vol. 23, no. 2, pp. 288-302, April 2019, doi: 10.1109/TEVC.2018.2850769. H. Li, M. Gong, C. Wang and Q. Miao, "Pareto Self-Paced Learning Based on Differential Evolution," in IEEE Transactions on Cybernetics, doi: 10.1109/TCYB.2019.2935762. F. Yang and S. Satoh, "Burst-survive Temporal Matching Kernel with Fibonacci Periods," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 2062-2066, doi: 10.1109/ICASSP.2019.8682971. Schoeffmann, Klaus. "Video browser showdown 2012-2019: A review." 2019 International Conference on Content-Based Multimedia Indexing (CBMI). IEEE, 2019. Xirong Li, Chaoxi Xu, Gang Yang, Zhineng Chen, and Jianfeng Dong. 2019. W2VV++: Fully Deep Learning for Ad-hoc Video Search. In Proceedings of the 27th ACM International Conference on Multimedia. Association for Computing Machinery, New York, NY, USA, 1786-1794. DOI:https://doi.org/10.1145/3343031.3350906 Zheng Wang, Fan Yang, and Shin'ichi Satoh. 2019. Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series. In Proceedings of the ACM Multimedia Asia (MMAsia '19). Association for Computing Machinery, New York, NY, USA, Article 27, 1-6. DOI:https://doi.org/10.1145/3338533.3366594 Kashif Ahmad and Nicola Conci. 2019. How Deep Features Have Improved Event Recognition in Multimedia: A Survey. ACM Trans. Multimedia Comput. Commun. Appl. 15, 2, Article 39 (June 2019), 27 pages. DOI:https://doi.org/10.1145/3306240 Fabian Berns, Luca Rossetto, Klaus Schoeffmann, Christian Beecks, and George Awad. 2019. V3C1 Dataset: An Evaluation of Content Characteristics. In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). Association for Computing Machinery, New York, NY, USA, 334-338. DOI:https://doi.org/10.1145/3323873.3325051 Hung-Quoc Vo, Vu-Minh-Hieu Dang, Vinh-Tiep Nguyen, and Duy-Dinh Le. 2019. Noise Removal Based Query Pre-processing to Improve Face Search Performance in Large Scale Video Databases. In Proceedings of the Tenth International Symposium on Information and Communication Technology (SoICT 2019). Association for Computing Machinery, New York, NY, USA, 357-361. DOI:https://doi.org/10.1145/3368926.3369727 Xirong Li. 2019. Deep Learning for Video Retrieval by Natural Language. In Proceedings of the 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia (FAT/MM '19). Association for Computing Machinery, New York, NY, USA, 2-3. DOI:https://doi.org/10.1145/3347447.3350565 Mohamed Hamroun, Sonia Lajmi, Henri Nicolas, and Ikram Amous. 2019. Large-Scale Semantic Concept Detection Based On Visual Contents. In Proceedings of the 17th International Conference on Advances in Mobile Computing & Multimedia (MoMM2019). Association for Computing Machinery, New York, NY, USA, 165-174. DOI:https://doi.org/10.1145/3365921.3365925 Jakub Lokoc, Gregor Kovalcik, Bernd Münzer, Klaus Schöffmann, Werner Bailer, Ralph Gasser, Stefanos Vrochidis, Phuong Anh Nguyen, Sitapa Rujikietgumjorn, and Kai Uwe Barthel. 2019. Interactive Search or Sequential Browsing? A Detailed Analysis of the Video Browser Showdown 2018. ACM Trans. Multimedia Comput. Commun. Appl. 15, 1, Article 29 (February 2019), 18 pages. DOI:https://doi.org/10.1145/3295663 Madhushree Basavarajaiah and Priyanka Sharma. 2019. Survey of Compressed Domain Video Summarization Techniques. ACM Comput. Surv. 52, 6, Article 116 (January 2020), 29 pages. DOI:https://doi.org/10.1145/3355398 Mohamed Hamroun, Sonia Lajmi, Henri Nicolas, and Ikram Amous. 2019. VISEN: a video interactive retrieval engine based on semantic network in large video collections. In Proceedings of the 23rd International Database Applications & Engineering Symposium (IDEAS '19). Association for Computing Machinery, New York, NY, USA, Article 25, 1-10. DOI:https://doi.org/10.1145/3331076.3331094 Yujia Zhang, Michael Kampffmeyer, Xiaoguang Zhao, and Min Tan. 2019. DTR-GAN: dilated temporal relational adversarial network for video summarization. In Proceedings of the ACM Turing Celebration Conference - China (ACM TURC '19). Association for Computing Machinery, New York, NY, USA, Article 89, 1-6. DOI:https://doi.org/10.1145/3321408.3322622 Yongchen Wang, Ying Wang, Huawei Li, Cong Shi, and Xiaowei Li. 2019. Systolic Cube: A Spatial 3D CNN Accelerator Architecture for Low Power Video Analysis. In Proceedings of the 56th Annual Design Automation Conference 2019 (DAC '19). Association for Computing Machinery, New York, NY, USA, Article 210, 1-6. DOI:https://doi.org/10.1145/3316781.3317919 Junbo Wang, Wei Wang, Zhiyong Wang, Liang Wang, Dagan Feng, and Tieniu Tan. 2019. Stacked Memory Network for Video Summarization. In Proceedings of the 27th ACM International Conference on Multimedia (MM '19). Association for Computing Machinery, New York, NY, USA, 836-844. DOI:https://doi.org/10.1145/3343031.3350992 Xinyu Weng, Yongzhi Li, Lu Chi, and Yadong Mu. 2019. High-Capacity Convolutional Video Steganography with Temporal Residual Modeling. In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). Association for Computing Machinery, New York, NY, USA, 87-95. DOI:https://doi.org/10.1145/3323873.3325011 Evlampios Apostolidis, Alexandros I. Metsai, Eleni Adamantidou, Vasileios Mezaris, and Ioannis Patras. 2019. A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization. In Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV '19). Association for Computing Machinery, New York, NY, USA, 17-25. DOI:https://doi.org/10.1145/3347449.3357482 Jakub Lokoc, Gregor Kovalcik, Tomáš Scek, Jaroslav Moravec, and Premysl cech. 2019. A Framework for Effective Known-item Search in Video. In Proceedings of the 27th ACM International Conference on Multimedia (MM '19). Association for Computing Machinery, New York, NY, USA, 1777-1785. DOI:https://doi.org/10.1145/3343031.3351046 Singh, Alok, Dalton Meitei Thounaojam, and Saptarshi Chakraborty. "A novel automatic shot boundary detection algorithm: robust to illumination and motion effect." Signal, Image and Video Processing (2019): 1-9. Zhu, Yandong, et al. "A comprehensive solution for detecting events in complex surveillance videos." Multimedia Tools and Applications 78.1 (2019): 817-838. Rossetto, Luca, et al. "V3c-a research video collection." International Conference on Multimedia Modeling. Springer, Cham, 2019. Kavoosifar, Mohammad Reza, et al. "Effective video hyperlinking by means of enriched feature sets and monomodal query combinations." International Journal of Multimedia Information Retrieval (2019): 1-13. Patil, Nita, and Sudhir Sawarkar. "Semantic Concept Detection for Multilabel Unbalanced Dataset Using Global Features." Intelligent Communication Technologies and Virtual Mobile Networks. Springer, Cham, 2019. Saleem, Summra, et al. "Stateful human-centered visual captioning system to aid video surveillance." Computers & Electrical Engineering 78 (2019): 108-119. Smeaton, Alan F., et al. "Exploring the Impact of Training Data Bias on Automatic Generation of Video Captions." International Conference on Multimedia Modeling. Springer, Cham, 2019. Li, Zhihui, et al. "Zero-shot event detection via event-adaptive concept relevance mining." Pattern Recognition 88 (2019): 595-603. Asha, D., and Y. Madhavee Latha. "Content-Based Video Shot Boundary Detection Using Multiple Haar Transform Features." Soft Computing and Signal Processing. Springer, Singapore, 2019. 703-713. Chakraborty, Saptarshi, and Dalton Meitei Thounaojam. "A novel shot boundary detection system using hybrid optimization technique." Applied Intelligence 49.9 (2019): 3207-3220. Nguyen, Vinh-Tiep, et al. "Video instance search via spatial fusion of visual words and object proposals." International Journal of Multimedia Information Retrieval 8.3 (2019): 181-192. Yarmohammadi, Hadi, Hossein Marvi, and Hamid Hassanpour. "Application of 2-D fractal dimension in content based video summarization." International Journal of Nonlinear Analysis and Applications 10.2 (2019): 131-140. Roschke, Christian, et al. "Adaptation of Machine Learning Frameworks for Use in a Management Environment." International Conference on Human-Computer Interaction. Springer, Cham, 2019. Thomanek, Rico, et al. "Use of Multiple Distributed Process Instances for Activity Analysis in Videos." International Conference on Human-Computer Interaction. Springer, Cham, 2019. Ji, Hyesung, et al. "A semantic-based video scene segmentation using a deep neural network." Journal of Information Science 45.6 (2019): 833-844. Patil, Nita S., and Sudhir D. Sawarkar. "Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classifiers." International Journal of Image Processing (IJIP) 13.2 (2019): 13-28. Helm, Daniel, and Martin Kampel. "Shot boundary detection for automatic video analysis of historical films." International Conference on Image Analysis and Processing. Springer, Cham, 2019. Hamroun, Mohamed, et al. "Descriptor Optimization for Semantic Concept Detection Using Visual Content." International Journal of Strategic Information Technology and Applications (IJSITA) 10.1 (2019): 40-59. Zlitni, Tarek, and Walid Mahdi. "Extraction and Annotation of News Topics From TV Streams for Web Video Sharing: A Contribution to Produce Reliable Online Video News Content." Knowledge-Intensive Economies and Opportunities for Social, Organizational, and Technological Growth. IGI Global, 2019. 272-294. Abdulhussain, Sadiq H., et al. "Shot boundary detection based on orthogonal polynomial." Multimedia Tools and Applications 78.14 (2019): 20361-20382. Prabavathy, A. Kethsy, and J. Devi Shree. "Histogram difference with Fuzzy rule base modeling for gradual shot boundary detection in video cloud applications." Cluster Computing 22.1 (2019): 1211-1218. Daudpota, Sher Muhammad, Atta Muhammad, and Junaid Baber. "Video genre identification using clustering-based shot detection algorithm." Signal, Image and Video Processing 13.7 (2019): 1413-1420. Aote, Shailendra S., and Archana Potnurwar. "An automatic video annotation framework based on two level keyframe extraction mechanism." Multimedia Tools and Applications 78.11 (2019): 14465-14484. Zhang, Dacheng, et al. "Shot boundary detection based on block-wise principal component analysis." Journal of Electronic Imaging 28.2 (2019): 023029 Liu, Mengyang, et al. "Video copy detection by conducting fast searching of inverted files." Multimedia Tools and Applications 78.8 (2019): 10601-10624. Benuwa, Ben-Bright, et al. "Group sparse based locality-sensitive dictionary learning for video semantic analysis." Multimedia Tools and Applications 78.6 (2019): 6721-6744. Benuwa, Ben-Bright, et al. "Video semantic analysis based kernel locality-sensitive discriminative sparse representation." Expert Systems with Applications 119 (2019): 429-440. Bhattacharya, Paheli, et al. "Overview of the FIRE 2019 AILA Track: Artificial Intelligence for Legal Assistance." FIRE (Working Notes). 2019. Nguyen, Phuong Anh, et al. "VIREO@ video browser showdown 2019." International Conference on Multimedia Modeling. Springer, Cham, 2019. Markatopoulou, Foteini, et al. "Finding Semantically Related Videos in Closed Collections." Video Verification in the Fake News Era. Springer, Cham, 2019. 127-159. Bhaumik, Hrishikesh, Siddhartha Bhattacharyya, and Susanta Chakraborty. "A vague set approach for identifying shot transition in videos using multiple feature amalgamation." Applied Soft Computing 75 (2019): 633-651. Kim, Tae Soo, et al. "Safer: Fine-grained activity detection by compositional hypothesis testing." Fu, Jianjing, and Jianwen Tao. "Robust multi-model adaptation regression with local feature space representation." Knowledge-Based Systems 174 (2019): 160-176. Tao, Jianwen, and Wei Dai. "Discriminative multi-source adaptation multi-feature co-regression for visual classification." Neural Networks 114 (2019): 96-118. Tao, Jianwen, et al. "Latent multi-feature co-regression for visual recognition by discriminatively leveraging multi-source models." Pattern Recognition 87 (2019): 296-316. Bae, Gyujin, et al. "Dual-dissimilarity measure-based statistical video cut detection." Journal of Real-Time Image Processing 16.6 (2019): 1987-1997. Ma, Mingyang, et al. "Robust video summarization using collaborative representation of adjacent frames." Multimedia Tools and Applications 78.20 (2019): 28985-29005. Li, Yanping, et al. "Intraframe interpolation based on edge detection." Eleventh International Conference on Digital Image Processing (ICDIP 2019). Vol. 11179. International Society for Optics and Photonics, 2019. Ji, Zhong, et al. "Query-aware sparse coding for web multi-video summarization." Information Sciences 478 (2019): 152-166. Zhang, Yujia, et al. "Dilated temporal relational adversarial network for generic video summarization." Multimedia Tools and Applications 78.24 (2019): 35237-35261. Bekhet, Saddam, and Amr Ahmed. "Video similarity detection using fixed-length statistical dominant colour profile (SDCP) signatures." Journal of Real-Time Image Processing (2019): 1-16. ------------------------------------------------------------------- 2018 (67) ------------------------------------------------------------------- Anastasia Moumtzidou, Stelios Andreadis, Ilias Gialampoukidis, Anastasios Karakostas, Stefanos Vrochidis, and Ioannis Kompatsiaris. 2018. Flood Relevance Estimation from Visual and Textual Content in Social Media Streams. In Companion Proceedings of the The Web Conference 2018 (WWW ’18). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 1621–1627. DOI:https://doi.org/10.1145/3184558.3191620 Andrea Ceroni, Chenyang Ma, and Ralph Ewerth. 2018. Mining Exoticism from Visual Content with Fusion-based Deep Neural Networks. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR ’18). Association for Computing Machinery, New York, NY, USA, 37–45. DOI:https://doi.org/10.1145/3206025.3206044 Marcelino, Gon�alo Barreto Ferreira. A computational approach to the art of visual storytelling. Diss. 2018. Liu, Long, Lechao Yang, and Bin Zhu. "Sparse feature space representation: A unified framework for semi-supervised and domain adaptation learning." Knowledge-Based Systems 156 (2018): 43-61. Gornishka, Iva. "Interactive Search and Exploration in Social Multimedia Networks." (2018). Singh, Raahat Devender, and Naveen Aggarwal. "Video content authentication techniques: a comprehensive survey." Multimedia Systems 24.2 (2018): 211-240. Tao, Jianwen, Di Zhou, and Bin Zhu. "Multi-source adaptation embedding with feature selection by exploiting correlation information." Knowledge-Based Systems 143 (2018): 208-224. Abdulhussain, Sadiq H., et al. "Methods and challenges in shot boundary detection: a review." Entropy 20.4 (2018): 214. Ye, Guangnan. "Large-Scale Video Event Detection Using Deep Neural Networks." Applied Cloud Deep Semantic Recognition. Auerbach Publications, 2018. 1-23 Gong, Maoguo, et al. "Decomposition-Based Evolutionary Multiobjective Optimization to Self-Paced Learning." IEEE Transactions on Evolutionary Computation 23.2 (2018): 288-302 Mahapatra, Debabrata, Ragunathan Mariappan, and Vaibhav Rajan. "Automatic Hierarchical Table of Contents Generation for Educational Videos." Companion Proceedings of the The Web Conference 2018. International World Wide Web Conferences Steering Committee, 2018. Cirne, Marcos Vinicius Mussel, and Helio Pedrini. "VISCOM: A robust video summarization approach using color co-occurrence matrices." Multimedia Tools and Applications 77.1 (2018): 857-875. Zhao, Zhicheng, et al. "A unified framework with a benchmark dataset for surveillance event detection." Neurocomputing 278 (2018): 62-74. Bekhet, Saddam, and Amr Ahmed. "An integrated signature-based framework for efficient visual similarity detection and measurement in video shots." ACM Transactions on Information Systems (TOIS) 36.4 (2018): 37. Ji, Hyesung, et al. "A semantic-based video scene segmentation using a deep neural network." Journal of Information Science (2018): 0165551518819964. Yao, Li, and Ying Qian. "Dt-3dresnet-lstm: An architecture for temporal activity recognition in videos." Pacific Rim Conference on Multimedia. Springer, Cham, 2018. Liu, Junqi, et al. "Discriminative self-adapted locality-sensitive sparse representation for video semantic analysis." Multimedia Tools and Applications 77.21 (2018): 29143-29162. Rouhi, Amir H., and James A. Thom. "Encoder settings impact on intra-prediction-based descriptors for video retrieval." Journal of Visual Communication and Image Representation 50 (2018): 263-269. Loko�, Jakub, Tom�š Sou�ek, and Gregor Koval��k. "Using an interactive video retrieval tool for lifelog data." Proceedings of the 2018 ACM Workshop on The Lifelog Search Challenge. ACM, 2018. Benuwa, Ben-Bright, et al. "Sparsity Based Locality-Sensitive Discriminative Dictionary Learning for Video Semantic Analysis." Mathematical Problems in Engineering 2018 (2018). Wu, Lifang, et al. "Shot Boundary Detection with Spatial-Temporal Convolutional Neural Networks." Chinese Conference on Pattern Recognition and Computer Vision (PRCV). Springer, Cham, 2018. Xie, Wenlong, et al. "Event patches: Mining effective parts for event detection and understanding." Signal Processing 149 (2018): 82-87 Tang, Shitao, et al. "Fast Video Shot Transition Localization with Deep Structured Models." Asian Conference on Computer Vision. Springer, Cham, 2018. Kletz, Sabrina, Andreas Leibetseder, and Klaus Schoeffmann. "Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown." International Conference on Multimedia Modeling. Springer, Cham, 2018. Rouhi, A. "Near-duplicate video similarity detection in H. 264/AVC compressed domain." (2018) Hirakawa, Koji, et al. "Ad-hoc Video Search Improved by the Word Sense Filtering of Query Terms." Asia Information Retrieval Symposium. Springer, Cham, 2018. Sa, Qila, and Zhihui Wang. "Automatic video shot boundary detection using k-means clustering and improved adaptive dual threshold comparison." MIPPR 2017: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications. Vol. 10611. International Society for Optics and Photonics, 2018. Graham, Y., Awad, G., & Smeaton, A. (2018). Evaluation of automatic video captioning using direct assessment. PloS one, 13(9), e0202789. Dong, Jianfeng, et al. "Dual dense encoding for zero-example video retrieval." arXiv preprint arXiv:1809.06181 (2018). Jiang, D., & Kim, J. (2018). Video Searching and Fingerprint Detection by Using the Image Query and PlaceNet-Based Shot Boundary Detection Method. Applied Sciences, 8(10), 1735. Leibetseder, Andreas, Sabrina Kletz, and Klaus Schoeffmann. "Sketch-based similarity search for collaborative feature maps." International Conference on Multimedia Modeling. Springer, Cham, 2018. Girbau, A., Hinami, R., & Satoh, S. I. (2018, April). Tracked Instance Search. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1663-1667). IEEE. Liao, Kaiyang, et al. "IR Feature Embedded BOF Indexing Method for Near-Duplicate Video Retrieval." IEEE Transactions on Circuits and Systems for Video Technology (2018). Vagliano, Iacopo, et al. "Open Innovation in the Big Data Era With the MOVING Platform." IEEE MultiMedia 25.3 (2018): 8-21. Xie, Wenlong, et al. "Discovering Latent Discriminative Patterns for Multi-Mode Event Representation." IEEE Transactions on Multimedia (2018). Thomas, Sinnu Susan, Sumana Gupta, and Venkatesh K. Subramanian. "Context Driven Optimized Perceptual Video Summarization and Retrieval." IEEE Transactions on Circuits and Systems for Video Technology (2018). Gao, Zhanning, et al. "Video Imprint." IEEE transactions on pattern analysis and machine intelligence (2018). Huang, Shao, et al. "Egocentric Temporal Action Proposals." IEEE Transactions on Image Processing 27.2 (2018): 764-777. Lei, Jie, et al. "Action Parsing Driven Video Summarization Based on Reinforcement Learning." IEEE Transactions on Circuits and Systems for Video Technology (2018). Schoeffmann, Klaus, et al. "How Experts Search Different than Novices–An Evaluation of the Divexplore Video Retrieval System at Video Browser Showdown 2018." 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 2018. Zhang, Hao, and Chong-Wah Ngo. "A Fine Granularity Object-level Representation for Event Detection and Recounting." IEEE Transactions on Multimedia (2018). Chen, Zhixiang, et al. "Nonlinear structural hashing for scalable video search." IEEE Transactions on Circuits and Systems for Video Technology 28.6 (2018): 1421-1433. Lan, Shuyue, et al. "FFNet: Video Fast-Forwarding via Reinforcement Learning." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018. Gialampoukidis, Ilias, et al. "Fusion of Compound Queries with Multiple Modalities for Known Item Video Search." 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP). IEEE, 2018. Gao, Wenhui, et al. "MMH: Multi-Modal Hash for Instant Mobile Video Search." 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 2018. Dilawari, Aniqa, et al. "Natural language description of video streams using task-specific feature encoding." IEEE Access 6 (2018): 16639-16645. Zhicheng Zhao, Rui Xiang, and Fei Su. 2018. Complex event detection via attention-based video representation and classification. Multimedia Tools Appl. 77, 3 (February 2018), 3209-3227. DOI: https://doi.org/10.1007/s11042-017-5058-2 Yassine Himeur and Karima Ait Sadi. 2018. Robust video copy detection based on ring decomposition based binarized statistical image features and invariant color descriptor (RBSIF-ICD). Multimedia Tools Appl. 77, 13 (July 2018), 17309-17331. DOI: https://doi.org/10.1007/s11042-017-5307-4 Huan Liu, Qinghua Zheng, Zhihui Li, Tao Qin, and Lei Zhu. 2018. An efficient multi-feature SVM solver for complex event detection. Multimedia Tools Appl. 77, 3 (February 2018), 3509-3532. DOI: https://doi.org/10.1007/s11042-017-5166-z Rashmi B S and Nagendraswamy H S. 2018. Effective Video Shot Boundary Detection and Keyframe Selection using Soft Computing Techniques. Int. J. Comput. Vis. Image Process. 8, 2 (April 2018), 27-48. DOI: https://doi.org/10.4018/IJCVIP.2018040102 Jaydeb Mondal, Malay Kumar Kundu, Sudeb Das, and Manish Chowdhury. 2018. Video shot boundary detection using multiscale geometric analysis of nsct and least squares support vector machine. Multimedia Tools Appl. 77, 7 (April 2018), 8139-8161. DOI: https://doi.org/10.1007/s11042-017-4707-9 Nitin J. Janwe and Kishor K. Bhoyar. 2018. Multi-label semantic concept detection in videos using fusion of asymmetrically trained deep convolutional neural networks and foreground driven concept co-occurrence matrix. Applied Intelligence 48, 8 (August 2018), 2047-2066. DOI: https://doi.org/10.1007/s10489-017-1033-x Maaike H.T. de Boer. 2018. Semantic Mapping in Video Retrieval. SIGIR Forum 51, 3 (February 2018), 161-162. DOI: https://doi.org/10.1145/3190580.3190606 Tomokazu Murakami. 2018. Industrial Applications of Image Recognition and Retrieval Technologies for Public Safety and IT Services. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR '18). ACM, New York, NY, USA, 4-4. DOI: https://doi.org/10.1145/3206025.3210492 Klaus Schoeffmann, Werner Bailer, Cathal Gurrin, George Awad, and Jakub Lokoč. 2018. Interactive Video Search: Where is the User in the Age of Deep Learning?. In Proceedings of the 26th ACM international conference on Multimedia (MM '18). ACM, New York, NY, USA, 2101-2103. DOI: https://doi.org/10.1145/3240508.3241473 Nakamasa Inoue and Koichi Shinoda. 2018. Few-Shot Adaptation for Multimedia Semantic Indexing. In Proceedings of the 26th ACM international conference on Multimedia (MM '18). ACM, New York, NY, USA, 1110-1118. DOI: https://doi.org/10.1145/3240508.3240592 Ueki, Kazuya. "Latent Concept Extraction for Zero-Shot Video Retrieval." 2018 International Conference on Image and Vision Computing New Zealand (IVCNZ). IEEE, 2018. Xu, Zijun, et al. "S2L: Single-Streamline For Complex Video Event Detection." 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 2018. Ueki, Kazuya, et al. "Fine-grained Video Retrieval using Query Phrases—Waseda_Meisei TRECVID 2017 AVS System—." 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, 2018. Budnik, Mateusz, Mikail Demirdelen, and Guillaume Gravier. "A study on multimodal video hyperlinking with visual aggregation." 2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2018. Kar, Tejaswini, and Priyadarshi Kanungo. "Motion and illumination defiant cut detection based on Weber features." IET Image Processing (2018). V. Vukotić, C. Raymond and G. Gravier, "A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking," in IEEE MultiMedia, vol. 25, no. 2, pp. 11-23, Apr.-Jun. 2018. doi: 10.1109/MMUL.2018.023121161 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8424826&isnumber=8424760 H. Li, J. Zhu, C. Ma, J. Zhang and C. Zong, "Read, Watch, Listen and Summarize: Multi-modal Summarization for Asynchronous Text, Image, Audio and Video," in IEEE Transactions on Knowledge and Data Engineering. doi: 10.1109/TKDE.2018.2848260 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8387512&isnumber=4358933 F. Markatopoulou, V. Mezaris and I. Patras, "Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation," in IEEE Transactions on Circuits and Systems for Video Technology. doi: 10.1109/TCSVT.2018.2848458 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8387768&isnumber=4358651 J. Lokoc, W. Bailer, K. Schoeffmann, B. Muenzer and G. Awad, "On influential trends in interactive video retrieval: Video Browser Showdown 2015-2017," in IEEE Transactions on Multimedia. doi: 10.1109/TMM.2018.2830110 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8352047&isnumber=4456689 J. Dong, X. Li and C. G. M. Snoek, "Predicting Visual Features from Text for Image and Video Caption Retrieval," in IEEE Transactions on Multimedia. doi: 10.1109/TMM.2018.2832602 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8353472&isnumber=4456689 R. Panda, S. K. Kuanar and A. S. Chowdhury, "Nyström Approximated Temporally Constrained Multisimilarity Spectral Clustering Approach for Movie Scene Detection," in IEEE Transactions on Cybernetics, vol. 48, no. 3, pp. 836-847, March 2018. doi: 10.1109/TCYB.2017.2657692 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7845652&isnumber=8283862 -------------------------------------------------------------------- 2017 (97) -------------------------------------------------------------------- X. Luan, Y. Xie, Y. Guo, J. He, L. Zhang and X. Zhang, "A fast near-duplicate keyframe detection method based on local features," 2017 IEEE 17th International Conference on Communication Technology (ICCT), Chengdu, China, 2017, pp. 1544-1547. doi: 10.1109/ICCT.2017.8359890 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8359890&isnumber=8359466 M. Hmayda, R. Ejbali and M. Zaied, "Program Classification in a Stream TV Using Deep Learning," 2017 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Taipei, Taiwan, 2017, pp. 123-126. doi: 10.1109/PDCAT.2017.00029 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8327077&isnumber=8326788 T. Kar and P. Kanungo, "Video shot boundary detection based on Hilbert and wavelet transform," 2017 2nd International Conference on Man and Machine Interfacing (MAMI), Bhubaneswar, India, 2017, pp. 1-6. doi: 10.1109/MAMI.2017.8307865 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8307865&isnumber=8307855 K. Zhou, Y. Zhu and Y. Zhao, "A spatio-temporal deep architecture for surveillance event detection based on ConvLSTM," 2017 IEEE Visual Communications and Image Processing (VCIP), St. Petersburg, FL, USA, 2017, pp. 1-4. doi: 10.1109/VCIP.2017.8305063 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8305063&isnumber=8305018 S. Keshavarz, I. Saleemi and G. Atia, "Exploiting probabilistic relationships between action concepts for complex event classification," 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 1572-1576. doi: 10.1109/ICIP.2017.8296546 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8296546&isnumber=8296222 Ke Wang, Jiayong Liu, and Daniel González. 2017. Domain transfer multi-instance dictionary learning. Neural Comput. Appl. 28, 1 (January 2017), 983-992. DOI: https://doi.org/10.1007/s00521-016-2406-5 Yilin Yan, Min Chen, Saad Sadiq, and Mei-Ling Shyu. 2017. Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters. Int. J. Multimed. Data Eng. Manag. 8, 1 (January 2017), 1-20. DOI: https://doi.org/10.4018/IJMDEM.2017010101 Jinlai Lv and Huiru Bai. 2017. Research on Shot Detection Algorithm of Self-adaptive Dual Thresholds Based on Multi-feature Fusion. In LNCS on Transactions on Edutainment XIII - Volume 10092, Zhigeng Pan, Adrian David Cheok, Wolfgang Müller, and Mingmin Zhang (Eds.), Vol. 10092. Springer-Verlag New York, Inc., New York, NY, USA, 247-261. DOI: https://doi.org/10.1007/978-3-662-54395-5_21 Stefanos Vrochidis, Ioannis Patras, and Ioannis Kompatsiaris. 2017. Gaze movement-driven random forests for query clustering in automatic video annotation. Multimedia Tools Appl. 76, 2 (January 2017), 2861-2889. DOI: https://doi.org/10.1007/s11042-015-3221-1 Hao Song, Xinxiao Wu, Wei Liang, and Yunde Jia. 2017. Recognizing key segments of videos for video annotation by learning from web image sets. Multimedia Tools Appl. 76, 5 (March 2017), 6111-6126. DOI: https://doi.org/10.1007/s11042-016-3253-1 Wei-Xin Li and Nuno Vasconcelos. 2017. Complex Activity Recognition Via Attribute Dynamics. Int. J. Comput. Vision 122, 2 (April 2017), 334-370. DOI: https://doi.org/10.1007/s11263-016-0918-1 Jiyun Fan, Shangbo Zhou, and Muhammad Abubakar Siddique. 2017. Fuzzy color distribution chart -based shot boundary detection. Multimedia Tools Appl. 76, 7 (April 2017), 10169-10190. DOI: https://doi.org/10.1007/s11042-016-3604-y Muhammad Usman Khan and Yoshihiko Gotoh. 2017. Generating natural language tags for video information management. Mach. Vision Appl. 28, 3-4 (May 2017), 243-265. DOI: https://doi.org/10.1007/s00138-017-0825-7 Mateusz Budnik, Efrain-Leonardo Gutierrez-Gomez, Bahjat Safadi, Denis Pellerin, and Georges Quénot. 2017. Learned features versus engineered features for multimedia indexing. Multimedia Tools Appl. 76, 9 (May 2017), 11941-11958. DOI: https://doi.org/10.1007/s11042-016-4240-2 Peng Wang, Lifeng Sun, Shiqiang Yang, and Alan F. Smeaton. 2017. Training-free indexing refinement for visual media via multi-semantics. Neurocomput. 236, C (May 2017), 39-47. DOI: https://doi.org/10.1016/j.neucom.2016.08.107 Zhenxing Zhang, Rami Albatal, Cathal Gurrin, and Alan F. Smeaton. 2017. Enhancing instance search with weak geometric correlation consistency. Neurocomput. 236, C (May 2017), 164-172. DOI: https://doi.org/10.1016/j.neucom.2016.09.104 Petra GaluÅ¡Ä�áková, Michal Batko, Jan ÄŒech, Jiřà Matas, David Novák, and Pavel Pecina. 2017. Visual Descriptors in Methods for Video Hyperlinking. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 294-300. DOI: https://doi.org/10.1145/3078971.3079026 Damianos Galanopoulos, Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2017. Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 397-401. DOI: https://doi.org/10.1145/3078971.3079043 Chrysa Collyda, Evlampios Apostolidis, Alexandros Pournaras, Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2017. VideoAnalysis4ALL: An On-line Tool for the Automatic Fragmentation and Concept-based Annotation, and the Interactive Exploration of Videos. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 470-474. DOI: https://doi.org/10.1145/3078971.3079015 Junwei Liang, Lu Jiang, Deyu Meng, and Alexander Hauptmann. 2017. Leveraging Multi-modal Prior Knowledge for Large-scale Concept Learning in Noisy Web Data. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 32-40. DOI: https://doi.org/10.1145/3078971.3079003 Foteini Markatopoulou, Damianos Galanopoulos, Vasileios Mezaris, and Ioannis Patras. 2017. Query and Keyframe Representations for Ad-hoc Video Search. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 407-411. DOI: https://doi.org/10.1145/3078971.3079041 Omar Seddati, Stéphane Dupont, and Saïd Mahmoudi. 2017. Quadruplet Networks for Sketch-Based Image Retrieval. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 184-191. DOI: https://doi.org/10.1145/3078971.3078985 Zhi-Qi Cheng, Hao Zhang, Xiao Wu, and Chong-Wah Ngo. 2017. On the Selection of Anchors and Targets for Video Hyperlinking. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 287-293. DOI: https://doi.org/10.1145/3078971.3079025 Luca Rossetto, Ivan Giangreco, Claudiu Tănase, and Heiko Schuldt. 2017. Multimodal Video Retrieval with the 2017 IMOTION System. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 457-460. DOI: https://doi.org/10.1145/3078971.3079012 Werner Bailer. 2017. Efficient Approximate Medoids of Temporal Sequences. In Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing (CBMI '17). ACM, New York, NY, USA, Article 3, 6 pages. DOI: https://doi.org/10.1145/3095713.3095717 Xun Xu, Timothy Hospedales, and Shaogang Gong. 2017. Transductive Zero-Shot Action Recognition by Word-Vector Embedding. Int. J. Comput. Vision 123, 3 (July 2017), 309-333. DOI: https://doi.org/10.1007/s11263-016-0983-5 Tiziano Portenier, Qiyang Hu, Paolo Favaro, and Matthias Zwicker. 2017. SmartSketcher: sketch-based image retrieval with dynamic semantic re-ranking. In Proceedings of the Symposium on Sketch-Based Interfaces and Modeling (SBIM '17), Stephen N. Spencer (Ed.). ACM, New York, NY, USA, Article 1, 12 pages. DOI: https://doi.org/10.1145/3092907.3092910 Bendraou Youssef, Essannouni Fedwa, Aboutajdine Driss, and Salam Ahmed. 2017. Shot boundary detection via adaptive low rank and svd-updating. Comput. Vis. Image Underst. 161, C (August 2017), 20-28. DOI: https://doi.org/10.1016/j.cviu.2017.06.003 Huan Liu, Qinghua Zheng, Minnan Luo, Dingwen Zhang, Xiaojun Chang, and Cheng Deng. 2017. How unlabeled web videos help complex event detection?. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), Carles Sierra (Ed.). AAAI Press 4040-4046 Jia He, Changying Du, Changde Du, Fuzhen Zhuang, Qing He, and Guoping Long. 2017. Nonlinear maximum margin multi-view learning with adaptive kernel. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), Carles Sierra (Ed.). AAAI Press 1830-1836 Jingya Wang, Xiatian Zhu, and Shaogang Gong. 2017. Discovering visual concept structure with sparse and incomplete tags. Artif. Intell. 250, C (September 2017), 16-36. DOI: https://doi.org/10.1016/j.artint.2017.05.002 Linchao Zhu, Zhongwen Xu, Yi Yang, and Alexander G. Hauptmann. 2017. Uncovering the Temporal Context for Video Question Answering. Int. J. Comput. Vision 124, 3 (September 2017), 409-421. DOI: https://doi.org/10.1007/s11263-017-1033-7 Jeonghwan Gwak. 2017. Multi-object tracking through learning relational appearance features and motion patterns. Comput. Vis. Image Underst. 162, C (September 2017), 103-115. DOI: https://doi.org/10.1016/j.cviu.2017.05.010 Maaike H. T. De Boer, Yi-Jie Lu, Hao Zhang, Klamer Schutte, Chong-Wah Ngo, and Wessel Kraaij. 2017. Semantic Reasoning in Zero Example Video Event Retrieval. ACM Trans. Multimedia Comput. Commun. Appl. 13, 4, Article 60 (October 2017), 17 pages. DOI: https://doi.org/10.1145/3131288 Ke Xia, Yuqing Ma, Xianglong Liu, Yadong Mu, and Li Liu. 2017. Temporal Binary Coding for Large-Scale Video Search. In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 333-341. DOI: https://doi.org/10.1145/3123266.3123273 Jianfeng Dong. 2017. Cross-media Relevance Computation for Multimedia Retrieval. In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 831-835. DOI: https://doi.org/10.1145/3123266.3123963 Spencer Cappallo and Cees G.M. Snoek. 2017. Future-Supervised Retrieval of Unseen Queries for Live Video. In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 28-36. DOI: https://doi.org/10.1145/3123266.3123437 Jiamei Lan, Jun Chen, Zheng Wang, Chao Liang, and Shin'ichi Satoh. 2017. P-S Instance Retrieval via Early Elimination and Late Expansion. In Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities (VSCC '17). ACM, New York, NY, USA, 41-49. DOI: https://doi.org/10.1145/3132734.3136609 Qin Jin, Shizhe Chen, Jia Chen, and Alexander Hauptmann. 2017. Knowing Yourself: Improving Video Caption via In-depth Recap. In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 1906-1911. DOI: https://doi.org/10.1145/3123266.3127901 Nikolaos Gkalelis and Vasileios Mezaris. 2017. Incremental Accelerated Kernel Discriminant Analysis. In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 1575-1583. DOI: https://doi.org/10.1145/3123266.3123401 Stevan Rudinac, Iva Gornishka, and Marcel Worring. 2017. Multimodal Classification of Violent Online Political Extremism Content with Graph Convolutional Networks. In Proceedings of the on Thematic Workshops of ACM Multimedia 2017 (Thematic Workshops '17). ACM, New York, NY, USA, 245-252. DOI: https://doi.org/10.1145/3126686.3126776 Zobeida Jezabel Guzman-Zavaleta, Claudia Feregrino-Uribe, Miguel Morales-Sandoval, and Alejandra Menendez-Ortiz. 2017. A robust and low-cost video fingerprint extraction method for copy detection. Multimedia Tools Appl. 76, 22 (November 2017), 24143-24163. DOI: https://doi.org/10.1007/s11042-016-4168-6 Ilias Gialampoukidis, Anastasia Moumtzidou, Dimitris Liparas, Theodora Tsikrika, Stefanos Vrochidis, and Ioannis Kompatsiaris. 2017. Multimedia retrieval based on non-linear graph-based fusion and partial least squares regression. Multimedia Tools Appl. 76, 21 (November 2017), 22383-22403. DOI: https://doi.org/10.1007/s11042-017-4797-4 Maaike Boer, Geert Pingen, Douwe Knook, Klamer Schutte, and Wessel Kraaij. 2017. Improving video event retrieval by user feedback. Multimedia Tools Appl. 76, 21 (November 2017), 22361-22381. DOI: https://doi.org/10.1007/s11042-017-4798-3 Hao Liu, Qingjie Zhao, Hao Wang, Peng Lv, and Yanming Chen. 2017. An image-based near-duplicate video retrieval and localization using improved Edit distance. Multimedia Tools Appl. 76, 22 (November 2017), 24435-24456. DOI: https://doi.org/10.1007/s11042-016-4176-6 A. Kar, P. Mavin, Y. Ghaturle and V. M., "What Makes a Video Memorable?," 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA), Tokyo, Japan, 2017, pp. 373-381. doi: 10.1109/DSAA.2017.37 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8259797&isnumber=8259747 S. Shekhar, D. Singal, H. Singh, M. Kedia and A. Shetty, "Show and Recall: Learning What Makes Videos Memorable," 2017 IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, Italy, 2017, pp. 2730-2739. doi: 10.1109/ICCVW.2017.321 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8265533&isnumber=8265191 W. Liu and H. Ma, "Hybrid Semantic Concept Temporal Pooling for Large-Scale Video Event Analysis," in Chinese Journal of Electronics, vol. 26, no. 6, pp. 1125-1131, 11 2017. doi: 10.1049/cje.2017.09.010 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8128889&isnumber=8128699 P. Goyal, Z. Hu, X. Liang, C. Wang, E. P. Xing and C. Mellon, "Nonparametric Variational Auto-Encoders for Hierarchical Representation Learning," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 5104-5112. doi: 10.1109/ICCV.2017.545 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8237807&isnumber=8237262 A. Sasithradevi, S. M. M. Roomi and G. Maragatham, "Content based video retrieval via object based approach," TENCON 2017 - 2017 IEEE Region 10 Conference, Penang, Malaysia, 2017, pp. 781-787. doi: 10.1109/TENCON.2017.8227965 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8227965&isnumber=8227816 H. T. Shen, C. Li, J. Cao, Z. Huang and L. Zhu, "Leveraging Weak Semantic Relevance for Complex Video Event Classification," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 3667-3676. doi: 10.1109/ICCV.2017.394 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8237656&isnumber=8237262 R. Panda, A. Das, Z. Wu, J. Ernst and A. K. Roy-Chowdhury, "Weakly Supervised Summarization of Web Videos," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 3677-3686. doi: 10.1109/ICCV.2017.395 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8237657&isnumber=8237262 J. C. SanMiguel and A. Cavallaro, "Energy Consumption Models for Smart Camera Networks," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 12, pp. 2661-2674, Dec. 2017. doi: 10.1109/TCSVT.2016.2593598 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7517353&isnumber=8186326 M. Liu, C. Xu, Y. Luo, C. Xu, Y. Wen and D. Tao, "Cost-Sensitive Feature Selection by Optimizing F-measures," in IEEE Transactions on Image Processing, vol. PP, no. 99, pp. 1-1. doi: 10.1109/TIP.2017.2781298 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8170306&isnumber=4358840 A. C. S. e Santos and H. Pedrini, "Shot boundary detection for video temporal segmentation based on the weber local descriptor," 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, AB, Canada, 2017, pp. 1310-1315. doi: 10.1109/SMC.2017.8122794 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8122794&isnumber=8122565 Z. Gao et al., "ER3: A Unified Framework for Event Retrieval, Recognition and Recounting," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 2107-2116. doi: 10.1109/CVPR.2017.227 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8099710&isnumber=8099483 N. Hussein, E. Gavves and A. W. M. Smeulders, "Unified Embedding and Metric Learning for Zero-Exemplar Event Detection," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 2087-2096. doi: 10.1109/CVPR.2017.225 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8099708&isnumber=8099483 S. Huang, W. Wang, S. He and R. W. H. Lau, "Egocentric Temporal Action Proposals," in IEEE Transactions on Image Processing, vol. 27, no. 2, pp. 764-777, Feb. 2018. doi: 10.1109/TIP.2017.2772904 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8105826&isnumber=8103362 L. Zhu, Z. Xu and Y. Yang, "Bidirectional Multirate Reconstruction for Temporal Modeling in Videos," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 1339-1348. doi: 10.1109/CVPR.2017.147 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8099630&isnumber=8099483 J. Hou, X. Wu, Y. Sun and Y. Jia, "Content-Attention Representation by Factorized Action-Scene Network for Action Recognition," in IEEE Transactions on Multimedia, vol. PP, no. 99, pp. 1-1. doi: 10.1109/TMM.2017.2771462 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8101020&isnumber=4456689 C. Tzelepis, V. Mezaris and I. Patras, "Linear Maximum Margin Classifier for Learning from Uncertain Data," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PP, no. 99, pp. 1-1. doi: 10.1109/TPAMI.2017.2772235 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8103808&isnumber=4359286 N. Chesneau, K. Alahari and C. Schmid, "Learning from Web Videos for Event Classification," in IEEE Transactions on Circuits and Systems for Video Technology, vol. PP, no. 99, pp. 1-1. doi: 10.1109/TCSVT.2017.2764624 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8076905&isnumber=4358651 B. Selbes and M. Sert, "Multimodal vehicle type classification using convolutional neural network and statistical representations of MFCC," 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, 2017, pp. 1-6. doi: 10.1109/AVSS.2017.8078514 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8078514&isnumber=8078458 S. S. Thomas, S. Gupta and V. K. Subramanian, "Smart surveillance based on video summarization," 2017 IEEE Region 10 Symposium (TENSYMP), Cochin, 2017, pp. 1-5. doi: 10.1109/TENCONSpring.2017.8070003 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8070003&isnumber=8069969 H. Song, X. Wu, W. Yu and Y. Jia, "Extracting Key Segments of Videos for Event Detection by Learning from Web Sources," in IEEE Transactions on Multimedia, vol. PP, no. 99, pp. 1-1. doi: 10.1109/TMM.2017.2763322 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8068288&isnumber=4456689 X. Nie, Weizhen Jing, Lin Yuan Ma, Chaoran Cui and Y. Yin, "Two-layer video fingerprinting strategy for near-duplicate video detection," 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), Hong Kong, 2017, pp. 555-560. doi: 10.1109/ICMEW.2017.8026322 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8026322&isnumber=8026209 A. Habibian, T. Mensink and C. G. M. Snoek, "Video2vec Embeddings Recognize Events When Examples Are Scarce," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 10, pp. 2089-2103, Oct. 1 2017. doi: 10.1109/TPAMI.2016.2627563 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7740886&isnumber=8024097 N. Putpuek, N. Cooharojananone and S. Satoh, "A modification of retake detection using simple signature and LCS algorithm," 2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), Kanazawa, 2017, pp. 257-261. doi: 10.1109/SNPD.2017.8022730 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8022730&isnumber=8022642 E. Boyaci and M. Sert, "Video classification based on ConvNet collaboration and feature selection," 2017 25th Signal Processing and Communications Applications Conference (SIU), Antalya, 2017, pp. 1-4. doi: 10.1109/SIU.2017.7960515 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7960515&isnumber=7960135 L. Yu; Z. Huang; F. Shen; J. Song; H. T. Shen; X. Zhou, "Bilinear Optimized Product Quantization for Scalable Visual Content Analysis," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1 doi: 10.1109/TIP.2017.2722224 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7964737&isnumber=4358840 B. C. Chen et al., "Scalable Face Track Retrieval in Video Archives Using Bag-of-Faces Sparse Representation," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 7, pp. 1595-1603, July 2017. doi: 10.1109/TCSVT.2016.2538520 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7426412&isnumber=7963904 B. Selbes and M. Sert, "Multimodal video concept classification based on convolutional neural network and audio feature combination," 2017 25th Signal Processing and Communications Applications Conference (SIU), Antalya, 2017, pp. 1-4. doi: 10.1109/SIU.2017.7960723 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7960723&isnumber=7960135 O. Khalid, J. C. SanMiguel and A. Cavallaro, "Multi-Tracker Partition Fusion," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 7, pp. 1527-1539, July 2017. doi: 10.1109/TCSVT.2016.2542699 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7434028&isnumber=7963904 D. Francis, P. Pidou, B. Merialdo and B. Huet, "Natural Language Access to Video Databases," 2017 IEEE Third International Conference on Multimedia Big Data (BigMM), Laguna Hills, CA, USA, 2017, pp. 78-81. doi: 10.1109/BigMM.2017.34 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7966721&isnumber=7966694 W. Lu et al., "Unsupervised Sequential Outlier Detection With Deep Architectures," in IEEE Transactions on Image Processing, vol. 26, no. 9, pp. 4321-4330, Sept. 2017. doi: 10.1109/TIP.2017.2713048 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7942034&isnumber=7956620 S. Tippaya, S. Sitjongsataporn, T. Tan, M. M. Khan and K. Chamnongthai, "Multi-Modal Visual Features-Based Video Shot Boundary Detection," in IEEE Access, vol. 5, no. , pp. 12563-12575, 2017. doi: 10.1109/ACCESS.2017.2717998 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7954599&isnumber=7859429 H. Li; Y. Huang; Z. Zhang, "An Improved Faster R-CNN for Same Object Retrieval," in IEEE Access , vol.PP, no.99, pp.1-1 doi: 10.1109/ACCESS.2017.2729943 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7986979&isnumber=6514899 Z. Ma; X. Chang; Z. Xu; N. Sebe; A. G. Hauptmann, "Joint Attributes and Event Analysis for Multimedia Event Detection," in IEEE Transactions on Neural Networks and Learning Systems , vol.PP, no.99, pp.1-10 doi: 10.1109/TNNLS.2017.2709308 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7949100&isnumber=6104215 J. Liang, L. Jiang and A. Hauptmann, "Temporal localization of audio events for conflict monitoring in social media," 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, pp. 1597-1601. doi: 10.1109/ICASSP.2017.7952426 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7952426&isnumber=7951776 S. Tippaya; S. Sitjongsataporn; T. Tan; M. M. Khan; K. Chamnongthai, "Multi-modal Visual Features Based Video Shot Boundary Detection," in IEEE Access , vol.PP, no.99, pp.1-1. doi: 10.1109/ACCESS.2017.2717998 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7954599&isnumber=6514899 C. Ouali, P. Dumouchel and V. Gupta, "Robust video fingerprints using positions of salient regions," 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, pp. 3041-3045. doi: 10.1109/ICASSP.2017.7952715 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7952715&isnumber=7951776 X. Han, B. Singh, V. I. Morariu and L. S. Davis, "VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products," in IEEE Transactions on Multimedia, vol. 19, no. 7, pp. 1583-1595, July 2017. doi: 10.1109/TMM.2017.2671414 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7858779&isnumber=7949123 Z. Ma, X. Chang, Y. Yang, N. Sebe and A. G. Hauptmann, "The Many Shades of Negativity," in IEEE Transactions on Multimedia, vol. 19, no. 7, pp. 1558-1568, July 2017. doi: 10.1109/TMM.2017.2659221 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7835107&isnumber=7949123 Y. N. Li and X. P. Chen, "Robust and compact video descriptor learned by deep neural network," 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, pp. 2162-2166. doi: 10.1109/ICASSP.2017.7952539 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7952539&isnumber=7951776 W. Lu; Y. Cheng; C. Xiao; S. Chang; S. Huang; B. Liang; T. Huang, "Unsupervised Sequential Outlier Detection with Deep Architectures," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1 doi: 10.1109/TIP.2017.2713048 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7942034&isnumber=4358840 Y. Wang, W. Zhang, L. Wu, X. Lin and X. Zhao, "Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based Cross-View Diffusion," in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 1, pp. 57-70, Jan. 2017. doi: 10.1109/TNNLS.2015.2498149 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7348699&isnumber=7797565 X. Chang, Z. Ma, Y. Yang, Z. Zeng and A. G. Hauptmann, "Bi-Level Semantic Representation Analysis for Multimedia Event Detection," in IEEE Transactions on Cybernetics, vol. 47, no. 5, pp. 1180-1197, May 2017. doi: 10.1109/TCYB.2016.2539546 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7442559&isnumber=7898877 X. S. Wei, J. Wu and Z. H. Zhou, "Scalable Algorithms for Multi-Instance Learning," in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 4, pp. 975-987, April 2017. doi: 10.1109/TNNLS.2016.2519102 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7398097&isnumber=7879455 X. Nie, Y. Yin, J. Sun, J. Liu and C. Cui, "Comprehensive Feature-Based Robust Video Fingerprinting Using Tensor Model," in IEEE Transactions on Multimedia, vol. 19, no. 4, pp. 785-796, April 2017. doi: 10.1109/TMM.2016.2629758 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7745950&isnumber=7879458 K. Li; S. Li; S. Oh; Y. Fu, "Videography based Unconstrained Video Analysis," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1 doi: 10.1109/TIP.2017.2678800 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7872416&isnumber=4358840 Y. Xian, X. Rong, X. Yang and Y. Tian, "Evaluation of Low-Level Features for Real-World Surveillance Event Detection," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 3, pp. 624-634, March 2017. doi: 10.1109/TCSVT.2016.2589838 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7514916&isnumber=7870721 X. Han; B. Singh; V. Morariu; L. S. Davis, "VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1 doi: 10.1109/TMM.2017.2671414 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7858779&isnumber=4456689 C. Li; Z. Huang; Y. Yang; J. Cao; X. Sun; H. T. Shen, "Hierarchical Latent Concept Discovery for Video Event Detection," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1 doi: 10.1109/TIP.2017.2670782 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7858791&isnumber=4358840 Z. Ma; X. Chang; Y. Yang; N. Sebe; A. Hauptmann, "The Many Shades of Negativity," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1 doi: 10.1109/TMM.2017.2659221 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7835107&isnumber=4456689 D. Zhang; J. Han; L. Jiang; S. Ye; X. Chang, "Revealing Event Saliency in Unconstrained Video Collection," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1 doi: 10.1109/TIP.2017.2658957 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7835130&isnumber=4358840 C. L. Chou, H. T. Chen and S. Y. Lee, "Multimodal Video-to-Near-Scene Annotation," in IEEE Transactions on Multimedia, vol. 19, no. 2, pp. 354-366, Feb. 2017. doi: 10.1109/TMM.2016.2614426 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7579212&isnumber=7820230 Y. Wang, W. Zhang, L. Wu, X. Lin and X. Zhao, "Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based Cross-View Diffusion," in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 1, pp. 57-70, Jan. 2017. doi: 10.1109/TNNLS.2015.2498149 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7348699&isnumber=7797565 -------------------------------------------------------------------- 2016 (119) -------------------------------------------------------------------- Youxian Zheng and Yuan Zhang, "Abrupt shot boundary detection with combined features and SVM," 2016 2nd IEEE International Conference on Computer and Communications (ICCC), Chengdu, China, 2016, pp. 409-413. doi: 10.1109/CompComm.2016.7924733 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7924733&isnumber=7924647 C. Pingping, Y. Guan, X. Ding and Z. Yu, "Shot boundary detection with sparse presentation," 2016 IEEE 13th International Conference on Signal Processing (ICSP), Chengdu, China, 2016, pp. 900-904. doi: 10.1109/ICSP.2016.7877960 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7877960&isnumber=7877780 A. Dandashi, J. Aljaam and S. Foufou, "Audio-Visual Video Classification System Design: For Arabic News Domain," 2016 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 2016, pp. 745-751. doi: 10.1109/CSCI.2016.0145 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7881438&isnumber=7881293 A. Mazaheri, B. Gong and M. Shah, "Learning a Multi-concept Video Retrieval Model with Multiple Latent Variables," 2016 IEEE International Symposium on Multimedia (ISM), San Jose, CA, USA, 2016, pp. 615-620. doi: 10.1109/ISM.2016.0132 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823699&isnumber=7823367 N. Katayama, H. Mo and S. Satoh, "Unsupervised Estimation of Video Continuity Model from Large-Scale Video Archives and Its Application to Shot Boundary Detection," 2016 IEEE International Symposium on Multimedia (ISM), San Jose, CA, USA, 2016, pp. 52-59. doi: 10.1109/ISM.2016.0019 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823586&isnumber=7823367 J. Cao, L. Yu, M. Chen and X. Cui, "A Key Frame Selection Algorithm Based on Sliding Window and Image Features," 2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS), Wuhan, China, 2016, pp. 956-962. doi: 10.1109/ICPADS.2016.0128 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823843&isnumber=7823715 Y. Yan and M. L. Shyu, "Enhancing Rare Class Mining in Multimedia Big Data by Concept Correlation," 2016 IEEE International Symposium on Multimedia (ISM), San Jose, CA, USA, 2016, pp. 281-286. doi: 10.1109/ISM.2016.0062 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823629&isnumber=7823367 J. Xu, L. Song and R. Xie, "Shot boundary detection using convolutional neural networks," 2016 Visual Communications and Image Processing (VCIP), Chengdu, China, 2016, pp. 1-4. doi: 10.1109/VCIP.2016.7805554 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7805554&isnumber=7805413 D. O. Gorodnichy, D. Bissessar, E. Granger and R. Laganiére, "Recognizing People and Their Activities in Surveillance Video: Technology State of Readiness and Roadmap," 2016 13th Conference on Computer and Robot Vision (CRV), Victoria, BC, Canada, 2016, pp. 250-259. doi: 10.1109/CRV.2016.43. http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7801529&isnumber=7801481 B. Miller and S. McCloskey, "Metric Feature Indexing for Interactive Multimedia Search," 2016 13th Conference on Computer and Robot Vision (CRV), Victoria, BC, Canada, 2016, pp. 109-115. doi: 10.1109/CRV.2016.22. http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7801510&isnumber=7801481 Dalton Meitei Thounaojam, Thongam Khelchandra, Kh. Manglem Singh, and Sudipta Roy. 2016. A Genetic Algorithm and Fuzzy Logic Approach for Video Shot Boundary Detection. Intell. Neuroscience 2016 (March 2016), 14-. DOI: http://dx.doi.org/10.1155/2016/8469428 JianWen Tao, Wenjun Hu, and Shiting Wen. 2016. Multi-source adaptation joint kernel sparse representation for visual classification. Neural Netw. 76, C (April 2016), 135-151. DOI: http://dx.doi.org/10.1016/j.neunet.2016.01.008 Yanan Liu, Xiaoqing Feng, and Zhiguang Zhou. 2016. Multimodal video classification with stacked contractive autoencoders. Signal Process. 120, C (March 2016), 761-766. DOI=http://dx.doi.org/10.1016/j.sigpro.2015.01.001 Mohammad A. Al-Jarrah and Faruq A. Al-Omari. 2016. Fast Video Shot Boundary Detection Technique based on Stochastic Model. Int. J. Comput. Vis. Image Process. 6, 2 (July 2016), 1-17. DOI: https://doi.org/10.4018/IJCVIP.2016070101 Christos Tzelepis, Damianos Galanopoulos, Vasileios Mezaris, and Ioannis Patras. 2016. Learning to detect video events from zero or very few video examples. Image Vision Comput. 53, C (September 2016), 35-44. DOI: https://doi.org/10.1016/j.imavis.2015.09.005 Sinnu Susan Thomas, Sumana Gupta, and Venkatesh K. Subramanian. 2016. Perceptual synoptic view of pixel, object and semantic based attributes of video. J. Vis. Comun. Image Represent. 38, C (July 2016), 367-377. DOI: http://dx.doi.org/10.1016/j.jvcir.2016.03.015 Jiyin He, Pernilla Qvarfordt, Martin Halvey, and Gene Golovchinsky. 2016. Beyond actions. Inf. Process. Manage. 52, 6 (November 2016), 1200-1226. DOI: https://doi.org/10.1016/j.ipm.2016.05.007 Lixia Hong, Qingyue Jin, Xusheng Li, and Yizhen Huang. 2016. Image and medical annotations using non-homogeneous 2D ruler learning models. Comput. Electr. Eng. 50, C (February 2016), 102-110. DOI=http://dx.doi.org/10.1016/j.compeleceng.2016.01.011 Mohamed Elhoseiny, Jingen Liu, Hui Cheng, Harpreet Sawhney, and Ahmed Elgammal. 2016. Zero-shot Event Detection by multimodal distributional semantic embedding of videos. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press 3478-3486. Chuang Gan, Ming Lin, Yi Yang, Gerard de Melo, and Alexander G. Hauptmann. 2016. Concepts not alone: exploring pairwise relationships for zero-shot video activity recognition. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press 3487-3493. Jingya Wang, Xiatian Zhu, and Shaogang Gong. 2016. Video semantic clustering with sparse and incomplete tags. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press 3618-3624 Xiaojun Chang, Yi Yang, Guodong Long, Chengqi Zhang, and Alexander G. Hauptmann. 2016. Dynamic concept composition for zero-example event detection. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press 3464-3470. Diego Ortego, Juan C. SanMiguel, and José M. MartÃnez. 2016. Rejection based multipath reconstruction for background estimation in video sequences with stationary objects. Comput. Vis. Image Underst. 147, C (June 2016), 23-37. DOI=http://dx.doi.org/10.1016/j.cviu.2016.03.012 Ruben Fernandez-Beltran and Filiberto Pla. 2016. Latent topics-based relevance feedback for video retrieval. Pattern Recogn. 51, C (March 2016), 72-84. DOI=http://dx.doi.org/10.1016/j.patcog.2015.09.007 JianWen Tao, Shiting Wen, and Wenjun Hu. 2016. Multi-source adaptation learning with global and local regularization by exploiting joint kernel sparse representation. Know.-Based Syst. 98, C (April 2016), 76-94. DOI: http://dx.doi.org/10.1016/j.knosys.2016.01.021 Yingying Zhu, Xiaoyan Huang, Qiang Huang, and Qi Tian. 2016. Large-scale video copy retrieval with temporal-concentration SIFT. Neurocomput. 187, C (April 2016), 83-91. DOI: http://dx.doi.org/10.1016/j.neucom.2015.09.114 Irfan Mehmood, Muhammad Sajjad, Seungmin Rho, and Sung Wook Baik. 2016. Divide-and-conquer based summarization framework for extracting affective video content. Neurocomput. 174, PA (January 2016), 393-403. DOI=http://dx.doi.org/10.1016/j.neucom.2015.05.126 Haojie Li, Lijuan Liu, Fuming Sun, Yu Bao, and Chenxin Liu. 2016. Multi-level feature representations for video semantic concept detection. Neurocomput. 172, C (January 2016), 64-70. DOI=http://dx.doi.org/10.1016/j.neucom.2014.09.096 Lei Bao, Cao Juan, Jintao Li, and Yongdong Zhang. 2016. Boosted Near-miss Under-sampling on SVM ensembles for concept detection in large-scale imbalanced datasets. Neurocomput. 172, C (January 2016), 198-206. DOI=http://dx.doi.org/10.1016/j.neucom.2014.05.096 Haojie Li, Bin Liu, Lei Yi, Yue Guan, and Zhong-Xuan Luo. 2016. On the tag localization of web video. Multimedia Syst. 22, 4 (July 2016), 405-412. DOI: http://dx.doi.org/10.1007/s00530-014-0404-y Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2016. Ordering of Visual Descriptors in a Classifier Cascade Towards Improved Video Concept Detection. In Proceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 9516 (MMM 2016), Springer-Verlag New York, Inc., New York, NY, USA, 874-885. DOI: http://dx.doi.org/10.1007/978-3-319-27671-7_73 Peng Wang, Lifeng Sun, Shiqang Yang, and Alan F. Smeaton. 2016. Towards Training-Free Refinement for Semantic Indexing of Visual Media. In Proceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 9516 (MMM 2016), Springer-Verlag New York, Inc., New York, NY, USA, 251-263. DOI: http://dx.doi.org/10.1007/978-3-319-27671-7_21 Christos Tzelepis, Vasileios Mezaris, and Ioannis Patras. 2016. Video Event Detection Using Kernel Support Vector Machine with Isotropic Gaussian Sample Uncertainty KSVM-iGSU. In Proceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 9516 (MMM 2016), Qi Tian, Nicu Sebe, Guo-Jun Qi, Benoit Huet, Richang Hong, and Xueliang Liu (Eds.), Vol. 9516. Springer-Verlag New York, Inc., New York, NY, USA, 3-15. DOI: http://dx.doi.org/10.1007/978-3-319-27671-7_1 Xiao-Jun Chen, Yong-Zhao Zhan, Jia Ke, and Xiao-Bo Chen. 2016. Complex video event detection via pairwise fusion of trajectory and multi-label hypergraphs. Multimedia Tools Appl. 75, 22 (November 2016), 15079-15100. DOI: http://dx.doi.org/10.1007/s11042-015-2514-8 Saddam Bekhet, Amr Ahmed, Amjad Altadmri, and Andrew Hunter. 2016. Compressed video matching: Frame-to-frame revisited. Multimedia Tools Appl. 75, 23 (December 2016), 15763-15778. DOI: https://doi.org/10.1007/s11042-015-2887-8 Chuang Gan, Yi Yang, Linchao Zhu, Deli Zhao, and Yueting Zhuang. 2016. Recognizing an Action Using Its Name: A Knowledge-Based Approach. Int. J. Comput. Vision 120, 1 (October 2016), 61-77. DOI: http://dx.doi.org/10.1007/s11263-016-0893-6 Shyi-Chyi Cheng, Jui-Yuan Su, Kuei-Fang Hsiao, and Habib F. Rashvand. 2016. Latent semantic learning with time-series cross correlation analysis for video scene detection and classification. Multimedia Tools Appl. 75, 20 (October 2016), 12919-12940. DOI: http://dx.doi.org/10.1007/s11042-015-2548-y Luis Herranz and Shuqiang Jiang. 2016. Scalable storyboards in handheld devices: applications and evaluation metrics. Multimedia Tools Appl. 75, 20 (October 2016), 12597-12625. DOI: http://dx.doi.org/10.1007/s11042-014-2421-4 Debabrata Dutta, Sanjoy Kumar Saha, and Bhabatosh Chanda. 2016. A shot detection technique using linear regression of shot transition pattern. Multimedia Tools Appl. 75, 1 (January 2016), 93-113. DOI=http://dx.doi.org/10.1007/s11042-014-2273-y Mohamed Zarka, Anis Ben Ammar, and Adel M. Alimi. 2016. Fuzzy reasoning framework to improve semantic video interpretation. Multimedia Tools Appl. 75, 10 (May 2016), 5719-5750. DOI=http://dx.doi.org/10.1007/s11042-015-2537-1 Gabriel Sargent, Karina R. Perez-Daniel, Andrei Stoian, Jenny Benois-Pineau, Sofian Maabout, Henri Nicolas, Mariko Nakano Miyatake, and Jean Carrive. 2016. A scalable summary generation method based on cross-modal consensus clustering and OLAP cube modeling. Multimedia Tools Appl. 75, 15 (August 2016), 9073-9094. DOI: http://dx.doi.org/10.1007/s11042-015-2863-3 Matthijs Douze, Jérôme Revaud, Jakob Verbeek, Hervé Jégou, and Cordelia Schmid. 2016. Circulant Temporal Encoding for Video Retrieval and Temporal Alignment. Int. J. Comput. Vision 119, 3 (September 2016), 291-306. DOI: http://dx.doi.org/10.1007/s11263-015-0875-0 Heng Wang, Dan Oneata, Jakob Verbeek, and Cordelia Schmid. 2016. A Robust and Efficient Video Representation for Action Recognition. Int. J. Comput. Vision 119, 3 (September 2016), 219-238. DOI: http://dx.doi.org/10.1007/s11263-015-0846-5 Tarek Zlitni, Bassem Bouaziz, and Walid Mahdi. 2016. Automatic topics segmentation for TV news video using prior knowledge. Multimedia Tools Appl. 75, 10 (May 2016), 5645-5672. DOI=http://dx.doi.org/10.1007/s11042-015-2531-7 Abdelkader Hamadi, Philippe Mulhem, and Georges Quénot. 2016. A comparative study for multiple visual concepts detection in images and videos. Multimedia Tools Appl. 75, 15 (August 2016), 8973-8997. DOI: http://dx.doi.org/10.1007/s11042-015-2730-2 Maaike Boer, Klamer Schutte, and Wessel Kraaij. 2016. Knowledge based query expansion in complex multimedia event detection. Multimedia Tools Appl. 75, 15 (August 2016), 9025-9043. DOI: http://dx.doi.org/10.1007/s11042-015-2757-4 Chahid Ouali, Pierre Dumouchel, and Vishwa Gupta. 2016. A spectrogram-based audio fingerprinting system for content-based copy detection. Multimedia Tools Appl. 75, 15 (August 2016), 9145-9165. DOI: http://dx.doi.org/10.1007/s11042-015-3081-8 Vinh-Tiep Nguyen, Minh-Triet Tran, Thanh Duc Ngo, Duy-Dinh Le, and Duc Anh Duong. 2016. Searching a specific person in a specific location using deep features. In Proceedings of the Seventh Symposium on Information and Communication Technology (SoICT '16). ACM, New York, NY, USA, 79-86. DOI: https://doi.org/10.1145/3011077.3011138 S. Sadiq, Y. Yan, M. L. Shyu, S. C. Chen and H. Ishwaran, "Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests," 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI), Pittsburgh, PA, USA, 2016, pp. 601-608. doi: 10.1109/IRI.2016.87 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7785796&isnumber=7785148 A. Salvador, X. Giró-i-Nieto, F. Marqués and S. Satoh, "Faster R-CNN Features for Instance Search," 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Las Vegas, NV, USA, 2016, pp. 394-401. doi: 10.1109/CVPRW.2016.56 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7789546&isnumber=7789490 X. Chang, Y. L. Yu, Y. Yang and E. P. Xing, "They are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 1884-1893. doi: 10.1109/CVPR.2016.208 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7780577&isnumber=7780329 C. Gan, T. Yao, K. Yang, Y. Yang and T. Mei, "You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 923-932. doi: 10.1109/CVPR.2016.106 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7780475&isnumber=7780329 K. Kikuchi, K. Ueki, T. Ogawa and T. Kobayashi, "Video semantic indexing using object detection-derived features," 2016 24th European Signal Processing Conference (EUSIPCO), Budapest, Hungary, 2016, pp. 1288-1292. doi: 10.1109/EUSIPCO.2016.7760456 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7760456&isnumber=7760191 Y. Zheng and Y. Zhang, "GPU-accelerated abrupt shot boundary detection," 2016 16th International Symposium on Communications and Information Technologies (ISCIT), Qingdao, China, 2016, pp. 141-145. doi: 10.1109/ISCIT.2016.7751609 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7751609&isnumber=7751579 Reinhard Sonnleitner and Gerhard Widmer. 2016. Robust quad-based audio fingerprinting. IEEE/ACM Trans. Audio, Speech and Lang. Proc. 24, 3 (March 2016), 409-421. DOI=http://dx.doi.org/10.1109/TASLP.2015.2509248 Anurag Kumar and Bhiksha Raj. 2016. Audio Event Detection using Weakly Labeled Data. In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). ACM, New York, NY, USA, 1038-1047. DOI: https://doi.org/10.1145/2964284.2964310 Vedran Vukotić, Christian Raymond, and Guillaume Gravier. 2016. Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video Hyperlinking. In Proceedings of the 2016 ACM workshop on Vision and Language Integration Meets Multimedia Fusion (iV&L-MM '16). ACM, New York, NY, USA, 37-44. DOI: https://doi.org/10.1145/2983563.2983567 Ilias Gialampoukidis, Anastasia Moumtzidou, Theodora Tsikrika, Stefanos Vrochidis, and Ioannis Kompatsiaris. 2016. Retrieval of Multimedia Objects by Fusing Multiple Modalities. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA, 359-362. DOI: http://dx.doi.org/10.1145/2911996.2912068 Xiaoshan Yang, Tianzhu Zhang, and Changsheng Xu. 2016. Semantic Feature Mining for Video Event Understanding. ACM Trans. Multimedia Comput. Commun. Appl. 12, 4, Article 55 (August 2016), 22 pages. DOI: http://dx.doi.org/10.1145/2962719 Stavros Arestis-Chartampilas, Nikolaos Gkalelis, and Vasileios Mezaris. 2016. AKSDA-MSVM: A GPU-accelerated Multiclass Learning Framework for Multimedia. In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). ACM, New York, NY, USA, 461-465. DOI: https://doi.org/10.1145/2964284.2967263 Pascal Mettes. 2016. Weakly-Supervised Recognition, Localization, and Explanation of Visual Entities. In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). ACM, New York, NY, USA, 1459-1463. DOI: https://doi.org/10.1145/2964284.2971479 K. Raghurama Holla and B. H. Shekar. 2016. Video Retrieval based on Patterns of Oriented Edge Magnitude. In Proceedings of the Third International Symposium on Computer Vision and the Internet (VisionNet'16). ACM, New York, NY, USA, 115-120. DOI: http://dx.doi.org/10.1145/2983402.2983433 Lu Jiang. 2016. Web-scale Multimedia Search for Internet Video Content. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining (WSDM '16). ACM, New York, NY, USA, 701-701. DOI: http://dx.doi.org/10.1145/2835776.2855081 Jiang, L., 2016, April. Web-scale multimedia search for internet video content. In Proceedings of the 25th International Conference Companion on World Wide Web (pp. 311-316). International World Wide Web Conferences Steering Committee. Yi-Jie Lu. 2016. Zero-Example Multimedia Event Detection and Recounting with Unsupervised Evidence Localization. In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). ACM, New York, NY, USA, 1464-1468. DOI: https://doi.org/10.1145/2964284.2971480 Chahid Ouali, Pierre Dumouchel, and Vishwa Gupta. 2016. Fast audio fingerprinting system using GPU and a clustering-based technique. IEEE/ACM Trans. Audio, Speech and Lang. Proc. 24, 6 (June 2016), 1106-1118. Yi-Jie Lu, Hao Zhang, Maaike de Boer, and Chong-Wah Ngo. 2016. Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA, 127-134. DOI: http://dx.doi.org/10.1145/2911996.2912015 Zhiyong Cheng, Xuanchong Li, Jialie Shen, and Alexander G. Hauptmann. 2016. Which Information Sources are More Effective and Reliable in Video Search. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR '16). ACM, New York, NY, USA, 1069-1072. DOI: http://dx.doi.org/10.1145/2911451.2914765 Yuancheng Ye, Xuejian Rong, Xiaodong Yang, and YIngli Tian. 2016. Region Trajectories for Video Semantic Concept Detection. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA, 255-259. DOI: http://dx.doi.org/10.1145/2911996.2912046 Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2016. Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection. In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). ACM, New York, NY, USA, 501-505. DOI: https://doi.org/10.1145/2964284.2967271 Eva Mohedano, Kevin McGuinness, Noel E. O'Connor, Amaia Salvador, Ferran Marques, and Xavier Giro-i-Nieto. 2016. Bags of Local Convolutional Features for Scalable Instance Search. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA, 327-331. DOI: http://dx.doi.org/10.1145/2911996.2912061 Pascal Mettes, Dennis C. Koelma, and Cees G.M. Snoek. 2016. The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA, 175-182. DOI: http://dx.doi.org/10.1145/2911996.2912036 B. S. Rashmi and H. S. Nagendraswamy. 2016. Abrupt Shot Detection in Video using Weighted Edge Information. In Proceedings of the International Conference on Informatics and Analytics (ICIA-16). ACM, New York, NY, USA, , Article 69 , 5 pages. DOI: http://dx.doi.org/10.1145/2980258.2980406 Nakamasa Inoue and Koichi Shinoda. 2016. Adaptation of Word Vectors using Tree Structure for Visual Semantics. In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). ACM, New York, NY, USA, 277-281. DOI: https://doi.org/10.1145/2964284.2967226 A. Habibian; T. Mensink; C. G. M. Snoek, "VideoStory Embeddings Recognize Events when Examples are Scarce," in IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.PP, no.99, pp.1-1 doi: 10.1109/TPAMI.2016.2627563 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7740886&isnumber=4359286 X. Nie; Y. Yin; J. Sun; J. Liu; C. Cui, "Comprehensive Feature-based Robust Video Fingerprinting Using Tensor Model," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1 doi: 10.1109/TMM.2016.2629758 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7745950&isnumber=4456689 B. H. Shekar, K. P. Uma and K. R. Holla, "Shot boundary detection using correlation based spectral residual saliency map," 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, India, 2016, pp. 2242-2247. doi: 10.1109/ICACCI.2016.7732385 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7732385&isnumber=7732013 B. S. Rashmi and H. S. Nagendraswamy, "Video shot boundary detection using midrange local binary pattern," 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, India, 2016, pp. 201-206. doi: 10.1109/ICACCI.2016.7732047 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7732047&isnumber=7732013 K. P. Uma, B. H. Shekar and K. R. Holla, "Video clip retrieval using local phase quantization," 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, India, 2016, pp. 1522-1527. doi: 10.1109/ICACCI.2016.7732264 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7732264&isnumber=7732013 C. L. Chou; H. T. Chen; S. Y. Lee, "Multi-Modal Video-to-Near-Scene Annotation," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1 doi: 10.1109/TMM.2016.2614426 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7579212&isnumber=4456689 M. Yazdi and M. Fani, "Shot boundary detection with effective prediction of transitions' positions and spans by use of classifiers and adaptive thresholds," 2016 24th Iranian Conference on Electrical Engineering (ICEE), Shiraz, Iran, 2016, pp. 167-172. doi: 10.1109/IranianCEE.2016.7585511 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7585511&isnumber=7585374 Z. Li, X. Liu and S. Zhang, "Shot Boundary Detection based on Multilevel Difference of Colour Histograms," 2016 First International Conference on Multimedia and Image Processing (ICMIP), Bandar Seri Begawan, Brunei Darussalam, 2016, pp. 15-22. doi: 10.1109/ICMIP.2016.24 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7573060&isnumber=7573029 X. Chang; Y. L. Yu; Y. Yang; E. P. Xing, "Semantic Pooling for Complex Event Analysis in Untrimmed Videos," in IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.PP, no.99, pp.1-1 doi: 10.1109/TPAMI.2016.2608901 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7565615&isnumber=4359286 M. Sang, Z. Sun and K. Jia, "Semantic Similarity Based Video Reranking," 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 2015, pp. 1420-1423. doi: 10.1109/CICN.2015.274 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7546332&isnumber=7546033 F. Markatopoulou, V. Mezaris and I. Patras, "Online multi-task learning for semantic concept detection in video," 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 2016, pp. 186-190. doi: 10.1109/ICIP.2016.7532344 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7532344&isnumber=7532277 T. Sato, M. Iwamura, K. Kaneda and K. Kise, "Fast and Memory Saving Instance Search with Approximate Reverse Nearest Neighbor Search Using Reverse Lookup," 2016 IEEE Second International Conference on Multimedia Big Data (BigMM), Taipei, 2016, pp. 326-333. doi: 10.1109/BigMM.2016.76 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7545045&isnumber=7544979 J. Hou, X. Wu, F. Yu and Y. Jia, "Multimedia event detection via deep spatial-temporal neural networks," 2016 IEEE International Conference on Multimedia and Expo (ICME), Seattle, WA, USA, 2016, pp. 1-6. doi: 10.1109/ICME.2016.7552981 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7552981&isnumber=7552854 D. Ren, L. Zhuo, H. Long, P. Qu and J. Zhang, "MPEG-2 Video Copy Detection Method Based on Sparse Representation of Spatial and Temporal Features," 2016 IEEE Second International Conference on Multimedia Big Data (BigMM), Taipei, 2016, pp. 233-236. doi: 10.1109/BigMM.2016.21 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7545028&isnumber=7544979 Y. Huo, Y. Wang and H. Hu, "Effective algorithms for video shot and scene boundaries detection," 2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS), Okayama, Japan, 2016, pp. 1-6. doi: 10.1109/ICIS.2016.7550913 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7550913&isnumber=7550716 J. Pang et al., "Accelerate convolutional neural networks for binary classification via cascading cost-sensitive feature," 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 2016, pp. 1037-1041. doi: 10.1109/ICIP.2016.7532515 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7532515&isnumber=7532277 R. B. Wang, H. Chen, J. L. Yao and Y. T. Guo, "Video Copy Detection Based On Temporal Contextual Hashing," 2016 IEEE Second International Conference on Multimedia Big Data (BigMM), Taipei, 2016, pp. 223-228. doi: 10.1109/BigMM.2016.12 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7545026&isnumber=7544979 A. Kumar and B. Raj, "Weakly supervised scalable audio content analysis," 2016 IEEE International Conference on Multimedia and Expo (ICME), Seattle, WA, USA, 2016, pp. 1-6. doi: 10.1109/ICME.2016.7552989 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7552989&isnumber=7552854 T. Y. Chang, S. C. Tai and G. S. Lin, "Manipulation classification for near-duplicate videos," 2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), Nantou County, Taiwan, 2016, pp. 1-2. doi: 10.1109/ICCE-TW.2016.7520976 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7520976&isnumber=7520694 W. Zhang, C. W. Ngo and X. Cao, "Hyperlink-Aware Object Retrieval," in IEEE Transactions on Image Processing, vol. 25, no. 9, pp. 4186-4198, Sept. 2016. doi: 10.1109/TIP.2016.2590321 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7508952&isnumber=7502214 J. C. SanMiguel; A. Cavallaro, "Energy Consumption Models for Smart-Camera Networks," in IEEE Transactions on Circuits and Systems for Video Technology , vol.PP, no.99, pp.1-1 doi: 10.1109/TCSVT.2016.2593598 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7517353&isnumber=4358651 X. Liu; L. Huang; C. Deng; B. Lang; D. Tao, "Query-Adaptive Hash Code Ranking for Large-Scale Multi-View Visual Search," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1 doi: 10.1109/TIP.2016.2593344 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7516672&isnumber=4358840 Y. Xian; X. Rong; X. Yang; Y. Tian, "Evaluation of Low-Level Features for Real-World Surveillance Event Detection," in IEEE Transactions on Circuits and Systems for Video Technology , vol.PP, no.99, pp.1-1 doi: 10.1109/TCSVT.2016.2589838 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7514916&isnumber=4358651 B. Safadi, P. Mulhem, G. Quénot and J. P. Chevallet, "Lifelog Semantic Annotation using deep visual features and metadata-derived descriptors," 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI), Bucharest, Romania, 2016, pp. 1-6. doi: 10.1109/CBMI.2016.7500247 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7500247&isnumber=7500235 A. Moumtzidou, I. Gialampoukidis, T. Mironidis, D. Liparas, S. Vrochidis and I. Kompatsiaris, "A multimedia interactive search engine based on graph-based and non-linear multimodal fusion," 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI), Bucharest, Romania, 2016, pp. 1-4. doi: 10.1109/CBMI.2016.7500276 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7500276&isnumber=7500235 C. Lyu et al., "Identifying group-wise consistent sub-networks via spatial sparse representation of natural stimulus FMRI data," 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI), Prague, Czech Republic, 2016, pp. 62-65. doi: 10.1109/ISBI.2016.7493211 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7493211&isnumber=7493185 E. Mezghani, M. Charfeddine, C. Ben Amar and H. Nicolas, "Audiovisual video characterization using audio watermarking scheme," 2015 15th International Conference on Intelligent Systems Design and Applications (ISDA), Marrakech, 2015, pp. 213-218. doi: 10.1109/ISDA.2015.7489227 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7489227&isnumber=7489153 M. Chakroun, A. Wali, Y. Aribi and A. M. Alimi, "Video event detection using auto-associative neural network and incremental SVM models," 2015 15th International Conference on Intelligent Systems Design and Applications (ISDA), Marrakech, 2015, pp. 563-568. doi: 10.1109/ISDA.2015.7489178 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7489178&isnumber=7489153 O. Ben Said, A. Wali and A. M. Alimi, "Interlinking video programs with Linked Open Data," 2015 15th International Conference on Intelligent Systems Design and Applications (ISDA), Marrakech, 2015, pp. 462-467. doi: 10.1109/ISDA.2015.7489159 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7489159&isnumber=7489153 J. Geng, Z. Miao, Q. Liang and S. Wang, "Linear multimodal fusion in video concept analysis based on node equilibrium model," 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, 2015, pp. 316-320. doi: 10.1109/ACPR.2015.7486517 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7486517&isnumber=7486438 A. Agharwal, R. Kovvuri, R. Nevatia and C. G. M. Snoek, "Tag-based video retrieval by embedding semantic content in a continuous word space," 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA, 2016, pp. 1-8. doi: 10.1109/WACV.2016.7477706 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7477706&isnumber=7477446 K. Ueki and T. Kobayashi, "Improving semantic video indexing: Efforts in Waseda TRECVID 2015 SIN system," 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 1184-1188. doi: 10.1109/ICASSP.2016.7471863 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7471863&isnumber=7471614 Y. Wang, L. Neves and F. Metze, "Audio-based multimedia event detection using deep recurrent neural networks," 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 2742-2746. doi: 10.1109/ICASSP.2016.7472176 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7472176&isnumber=7471614 Q. Chen, W. Jiang, Y. Zhao and Z. Zhao, "Part-based deep network for pedestrian detection in surveillance videos," 2015 Visual Communications and Image Processing (VCIP), Singapore, Singapore, 2015, pp. 1-4. doi: 10.1109/VCIP.2015.7457855 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7457855&isnumber=7457773 C. Ouali, P. Dumouchel and V. Gupta, "Fast Audio Fingerprinting System Using GPU and a Clustering-Based Technique," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 6, pp. 1106-1118, June 2016. doi: 10.1109/TASLP.2016.2541303 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7431948&isnumber=7463555 M. Mazloom; X. Li; C. Snoek, "TagBook: A Semantic Video Representation without Supervision for Event Detection," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1 doi: 10.1109/TMM.2016.2559947 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7462268&isnumber=4456689 Q. Chen, W. Jiang, Y. Zhao and Z. Zhao, "Part-based deep network for pedestrian detection in surveillance videos," 2015 Visual Communications and Image Processing (VCIP), Singapore, Singapore, 2015, pp. 1-4. doi: 10.1109/VCIP.2015.7457855 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7457855&isnumber=7457773 P. Kanungo and T. Kar, "Cut detection using block based center symmetric local binary pattern," 2015 International Conference on Man and Machine Interfacing (MAMI), Bhubaneswar, India, 2015, pp. 1-5. doi: 10.1109/MAMI.2015.7456583 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7456583&isnumber=7456527 L. Yu; Z. Huang; J. Cao; H. T. Shen, "Scalable Video Event Retrieval by Visual State Binary Embedding," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1. doi: 10.1109/TMM.2016.2557059 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7457257&isnumber=4456689 S. Angadi and V. Naik, "Static video summarization - A minimum edge weight bipartite graph matching approach," 2015 IEEE International Conference on Computer Graphics, Vision and Information Security (CGVIS), Bhubaneshwar, Odisha, India, 2015, pp. 100-105. doi: 10.1109/CGVIS.2015.7449901 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7449901&isnumber=7449869 Himeur, Yassine; Sadi, Karima Ait, "A Rotation Invariant BSIF Descriptor for Video Copy Detection Using a Ring Decomposition," in Signal-Image Technology & Internet-Based Systems (SITIS), 2015 11th International Conference on , vol., no., pp.300-305, 23-27 Nov. 2015 doi: 10.1109/SITIS.2015.71 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7400580&isnumber=7400513 Himeur, Yassine; Sadi, Karima Ait, "Joint color and texture descriptor using ring decomposition for robust video copy detection in large databases," in Signal Processing and Information Technology (ISSPIT), 2015 IEEE International Symposium on , vol., no., pp.495-500, 7-10 Dec. 2015. doi: 10.1109/ISSPIT.2015.7394386 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7394386&isnumber=7394243 Wei, X.-S.; Wu, J.; Zhou, Z.-H., "Scalable Algorithms for Multi-Instance Learning," in Neural Networks and Learning Systems, IEEE Transactions on , vol.PP, no.99, pp.1-13 doi: 10.1109/TNNLS.2016.2519102 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7398097&isnumber=6104215 Himeur, Yassine; Sadi, Karima Ait, "Joint color and texture descriptor using ring decomposition for robust video copy detection in large databases," in Signal Processing and Information Technology (ISSPIT), 2015 IEEE International Symposium on, vol., no., pp.495-500, 7-10 Dec. 2015. doi: 10.1109/ISSPIT.2015.7394386 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7394386&isnumber=7394243 Zhang, X.; Zhang, H.; Zhang, Y.; Yang, Y.; Wang, M.; Luan, H.; Li, J.; Chua, T., "Deep Fusion of Multiple Semantic Cues for Complex Event Recognition," in Image Processing, IEEE Transactions on , vol.25, no.3, pp.1033-1046, March 2016 doi: 10.1109/TIP.2015.2511585 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7364255&isnumber=7383373 --------------------------------------------------------------------- 2015(97) --------------------------------------------------------------------- F. Markatopoulou, V. Mezaris, N. Pittaras and I. Patras, "Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video," in IEEE Transactions on Emerging Topics in Computing, vol. 3, no. 2, pp. 193-204, June 2015. doi: 10.1109/TETC.2015.2418714 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7073626&isnumber=7118282 X. Liu, Y. Mu, D. Zhang, B. Lang and X. Li, "Large-Scale Unsupervised Hashing with Shared Structure Learning," in IEEE Transactions on Cybernetics, vol. 45, no. 9, pp. 1811-1822, Sept. 2015. doi: 10.1109/TCYB.2014.2360856 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6960876&isnumber=7203181 Y. Han, Y. Yang, Y. Yan, Z. Ma, N. Sebe and X. Zhou, "Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition," in IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 2, pp. 252-264, Feb. 2015. doi: 10.1109/TNNLS.2014.2314123 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6786497&isnumber=7010866 Rouhi, Amir H., "Evaluating Spatio-Temporal Parameters in Video Similarity Detection by Global Descriptors," in Digital Image Computing: Techniques and Applications (DICTA), 2015 International Conference on , vol., no., pp.1-8, 23-25 Nov. 2015 doi: 10.1109/DICTA.2015.7371255 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7371255&isnumber=7371204 Rouhi, Amir H., "Enhanced-IPMH as a Robust Visual Descriptor from H.264/AVC and Evaluation of Parameters Effects," in Digital Image Computing: Techniques and Applications (DICTA), 2015 International Conference on , vol., no., pp.1-8, 23-25 Nov. 2015 doi: 10.1109/DICTA.2015.7371254 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7371254&isnumber=7371204 Jiang, L., Yu, S. I., Meng, D., Mitamura, T., & Hauptmann, A. G. (2015). Text-to-video: a semantic search engine for internet videos. International Journal of Multimedia Information Retrieval, 1-16. Zhang, X.; Zhang, H.; Zhang, Y.D.; Yang, Y.; Wang, M.; Luan, H.; Li, J.T.; Chua, Tat-Seng, "Deep Fusion of Multiple Semantic Cues for Complex Event Recognition," in Image Processing, IEEE Transactions on , vol.PP, no.99, pp.1-1. doi: 10.1109/TIP.2015.2511585 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7364255&isnumber=4358840 Li, X.; Zhao, X.; Zhang, Z.; Wu, F.; Zhuang, Y.; Wang, J.; Li, X., "Joint Multilabel Classification With Community-Aware Label Graph Learning," in Image Processing, IEEETransactions on , vol.25, no.1, pp.484-493, Jan. 2016. doi: 10.1109/TIP.2015.2503700 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7337423&isnumber=7331739 Changyu Liu, Dapeng Li, Bin Lu, Juntao Xiong, Event Bank based Multimedia Representation via Latent Group Logistic Regression Minimization, Neurocomputing, Available online 15 December 2015, ISSN 0925-2312, http://dx.doi.org/10.1016/j.neucom.2015.12.002. http://www.sciencedirect.com/science/article/pii/S0925231215018883 Juan Manuel Barrios and Jose Manuel Saavedra. 2015. Score Propagation Based on Similarity Shot Graph for Improving Visual Object Retrieval. In Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia (SLAM '15). ACM, New York, NY, USA, 19-22. DOI=http://dx.doi.org/10.1145/2802558.2814644 Andrea Ceroni, Vassilios Solachidis, Claudia Niederée, Olga Papadopoulou, Nattiya Kanhabua, and Vasileios Mezaris. 2015. To Keep or not to Keep: An Expectation-oriented Photo Selection Method for Personal Photo Collections. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 187-194. DOI=http://dx.doi.org/10.1145/2671188.2749372 Sébastien Poullot, Shunsuke Tsukatani, Anh Phuong Nguyen, Hervé Jégou, and Shin'Ichi Satoh. 2015. Temporal Matching Kernel with Explicit Feature Maps. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 381-390. DOI=http://dx.doi.org/10.1145/2733373.2806228 Julia Bernd, Damian Borth, Carmen Carrano, Jaeyoung Choi, Benjamin Elizalde, Gerald Friedland, Luke Gottlieb, Karl Ni, Roger Pearce, Doug Poland, Khalid Ashraf, David A. Shamma, and Bart Thomee. 2015. Kickstarting the Commons: The YFCC100M and the YLI Corpora. In Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions (MMCommons '15). ACM, New York, NY, USA, 1-6. DOI=http://dx.doi.org/10.1145/2814815.2816986 Shicheng Xu, Huan Li, Xiaojun Chang, Shoou-I Yu, Xingzhong Du, Xuanchong Li, Lu Jiang, Zexi Mao, Zhenzhong Lan, Susanne Burger, and Alexander Hauptmann. 2015. Incremental Multimodal Query Construction for Video Search. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 675-678. DOI=http://dx.doi.org/10.1145/2671188.2749413 Shoou-I Yu, Lu Jiang, Zhongwen Xu, Yi Yang, and Alexander G. Hauptmann. 2015. Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 419-426. DOI=http://dx.doi.org/10.1145/2671188.2749398 Klaus Schoeffmann, Marco A. Hudelist, and Jochen Huber. 2015. Video Interaction Tools: A Survey of Recent Work. ACM Comput. Surv. 48, 1, Article 14 (September 2015), 34 pages. DOI=http://dx.doi.org/10.1145/2808796 Guangnan Ye, Yitong Li, Hongliang Xu, Dong Liu, and Shih-Fu Chang. 2015. EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 471-480. DOI=http://dx.doi.org/10.1145/2733373.2806221 Yonghong Tian, Mengren Qian, and Tiejun Huang. 2015. TASC: A Transformation-Aware Soft Cascading Approach for Multimodal Video Copy Detection. ACM Trans. Inf. Syst. 33, 2, Article 7 (February 2015), 34 pages. DOI=http://dx.doi.org/10.1145/2699662 Khalid Ashraf, Benjamin Elizalde, Forrest Iandola, Matthew Moskewicz, Julia Bernd, Gerald Friedland, and Kurt Keutzer. 2015. Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 611-614. DOI=http://dx.doi.org/10.1145/2671188.2749396 Lu Jiang, Shoou-I Yu, Deyu Meng, Yi Yang, Teruko Mitamura, and Alexander G. Hauptmann. 2015. Fast and Accurate Content-based Semantic Search in 100M Internet Videos. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 49-58. DOI=http://dx.doi.org/10.1145/2733373.2806237 Xiaojun Chang, Yao-Liang Yu, Yi Yang, and Alexander G. Hauptmann. 2015. Searching Persuasively: Joint Event Detection and Evidence Recounting with Limited Supervision. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 581-590. DOI=http://dx.doi.org/10.1145/2733373.2806218 Sang Phan, Duy-Dinh Le, and Shin'ichi Satoh. 2015. Multimedia Event Detection Using Event-Driven Multiple Instance Learning. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 1255-1258. DOI=http://dx.doi.org/10.1145/2733373.2806330 Lu Jiang, Shoou-I Yu, Deyu Meng, Teruko Mitamura, and Alexander G. Hauptmann. 2015. Bridging the Ultimate Semantic Gap: A Semantic Search Engine for Internet Videos. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 27-34. DOI=http://dx.doi.org/10.1145/2671188.2749399 B. H. Shekar and K. P. Uma. 2015. Gabor Moments Based Shot Boundary Detection. In Proceedings of the Third International Symposium on Women in Computing and Informatics (WCI '15), Indu Nair (Ed.). ACM, New York, NY, USA, 359-364. DOI=http://dx.doi.org/10.1145/2791405.2791499 Stavros Arestis-Chartampilas, Nikolaos Gkalelis, and Vasileios Mezaris. 2015. GPU Accelerated Generalised Subclass Discriminant Analysis for Event and Concept Detection in Video. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 1219-1222. DOI=http://dx.doi.org/10.1145/2733373.2806321 Moitreya Chatterjee and Anton Leuski. 2015. A Novel Statistical Approach for Image and Video Retrieval and Its Adaption for Active Learning. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 935-938. DOI=http://dx.doi.org/10.1145/2733373.2806368 M. L. Smitha and B. H. Shekar. 2015. Illumination Invariant Text Recognition System Based On Contrast Limit Adaptive Histogram Equalization in Videos/Images. In Proceedings of the Third International Symposium on Women in Computing and Informatics (WCI '15), Indu Nair (Ed.). ACM, New York, NY, USA, 174-179. DOI=http://dx.doi.org/10.1145/2791405.2791498 Moitreya Chatterjee and Anton Leuski. 2015. CRMActive: An Active Learning Based Approach for Effective Video Annotation and Retrieval. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 535-538. DOI=http://dx.doi.org/10.1145/2671188.2749342 Eva Mohedano, Kevin McGuinness, Graham Healy, Noel E. O'Connor, Alan F. Smeaton, Amaia Salvador, Sergi Porta, and Xavier Giró-i-Nieto. 2015. Exploring EEG for Object Detection and Retrieval. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 591-594. DOI=http://dx.doi.org/10.1145/2671188.2749368 Nakamasa Inoue and Koichi Shinoda. 2015. Vocabulary Expansion Using Word Vectors for Video Semantic Indexing. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 851-854. DOI=http://dx.doi.org/10.1145/2733373.2806347 Jie Geng; Zhenjiang Miao; Xiao-Ping Zhang, "Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection," in Multimedia, IEEE Transactions on, vol.17, no.4, pp.498-511, April 2015 doi: 10.1109/TMM.2015.2398195 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7027217&isnumber=7060570 Pourian, N.; Manjunath, B.S., "PixNet: A Localized Feature Representation for Classification and Visual Search," in Multimedia, IEEE Transactions on , vol.17, no.5, pp.616-625, May 2015 doi: 10.1109/TMM.2015.2410734 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7055325&isnumber=7086387 Zhenzhong Lan; Ming Lin; Xuanchong Li; Hauptmann, A.G.; Raj, B., "Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition," in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.204-212, 7-12 June 2015 doi: 10.1109/CVPR.2015.7298616 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298616&isnumber=7298593 Markatopoulou, F.; Mezaris, V.; Pittaras, N.; Patras, I., "Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video," in Emerging Topics in Computing, IEEE Transactions on , vol.3, no.2, pp.193-204, June 2015 doi: 10.1109/TETC.2015.2418714 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7073626&isnumber=7118282 Yahong Han; Yi Yang; Yan Yan; Zhigang Ma; Sebe, N.; Xiaofang Zhou, "Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition," in Neural Networks and Learning Systems, IEEE Transactions on , vol.26, no.2, pp.252-264, Feb. 2015 doi: 10.1109/TNNLS.2014.2314123 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6786497&isnumber=7010866 Kalpakis, G.; Tsikrika, T.; Markatopoulou, F.; Pittaras, N.; Vrochidis, S.; Mezaris, V.; Patras, I.; Kompatsiaris, I., "Concept Detection in Multimedia Web Resources About Home Made Explosives," in Availability, Reliability and Security (ARES), 2015 10th International Conference on , vol., no., pp.632-641, 24-27 Aug. 2015 doi: 10.1109/ARES.2015.85 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7299974&isnumber=7299862 Ling Shao; Fan Zhu; Xuelong Li, "Transfer Learning for Visual Categorization: A Survey," in Neural Networks and Learning Systems, IEEE Transactions on , vol.26, no.5, pp.1019-1034, May 2015 doi: 10.1109/TNNLS.2014.2330900 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6847217&isnumber=7086401 Nguyen, Vinh-Tiep; Nguyen, Dinh-Luan; Tran, Minh-Triet; Le, Duy-Dinh; Duong, Duc Anh; Satoh, Shin'ichi, "Query-adaptive late fusion with neural network for instance search," in Multimedia Signal Processing (MMSP), 2015 IEEE 17th International Workshop on , vol., no., pp.1-6, 19-21 Oct. 2015 doi: 10.1109/MMSP.2015.7340795 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7340795&isnumber=7340786 Yaowei Wang; Yonghong Tian; Limin Su; Xiaoyu Fang; Ziwei Xia; Tiejun Huang, "Detecting Rare Actions and Events from Surveillance Big Data with Bag of Dynamic Trajectories," in Multimedia Big Data (BigMM), 2015 IEEE International Conference on , vol., no., pp.128-135, 20-22 April 2015 doi: 10.1109/BigMM.2015.74 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153866&isnumber=7153824 Ting Yao; Yingwei Pan; Chong-Wah Ngo; Houqiang Li; Tao Mei, "Semi-supervised Domain Adaptation with Subspace Learning for visual recognition," in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.2142-2150, 7-12 June 2015 doi: 10.1109/CVPR.2015.7298826 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298826&isnumber=7298593 Markatopoulou, Foteini; Mezaris, Vasileios; Patras, Ioannis, "Cascade of classifiers based on binary, non-binary and deep convolutional network descriptors for video concept detection," in Image Processing (ICIP), 2015 IEEE International Conference on , vol., no., pp.1786-1790, 27-30 Sept. 2015 doi: 10.1109/ICIP.2015.7351108 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7351108&isnumber=7350743 Xishan Zhang; Yang Yang; Yongdong Zhang; Huanbo Luan; Jintao Li; Hanwang Zhang; Tat-Seng Chua, "Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base," in Multimedia, IEEE Transactions on , vol.17, no.9, pp.1562-1575, Sept. 2015. doi: 10.1109/TMM.2015.2449660 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7132742&isnumber=7182813 Jie Xu; Tekin, C.; Zhang, S.; van der Schaar, M., "Distributed Multi-Agent Online Learning Based on Global Feedback," in Signal Processing, IEEE Transactions on , vol.63, no.9, pp.2225-2238, May1, 2015 doi: 10.1109/TSP.2015.2403288 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7041172&isnumber=7067505 Inoue, N.; Shinoda, K., "Fast Coding of Feature Vectors using Neighbor-To-Neighbor Search," in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1 doi: 10.1109/TPAMI.2015.2481390 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7274762&isnumber=4359286 Arabaci, M.A.; Esen, E., "Video copy detection using motion co-occurrence feature," in Signal Processing and Communications Applications Conference (SIU), 2015 23th , vol., no., pp.1946-1949, 16-19 May 2015. doi: 10.1109/SIU.2015.7130243 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7130243&isnumber=7129794 Zhongwen Xu; Yi Yang; Hauptmann, A.G., "A discriminative CNN video representation for event detection," in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.1798-1807, 7-12 June 2015 doi: 10.1109/CVPR.2015.7298789 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298789&isnumber=7298593 GarciÌ�a-MartiÌ�n, A.; MartiÌ�nez, J.M., "People detection in surveillance: classification and evaluation," in Computer Vision, IET , vol.9, no.5, pp.779-788, 10 2015 doi: 10.1049/iet-cvi.2014.0148 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7270482&isnumber=7270452 Han, J.; Ji, X.; Hu, X.; Guo, L.; Liu, T., "Arousal Recognition Using Audio-Visual Features and FMRI-Based Brain Response," in Affective Computing, IEEE Transactions on , vol.6, no.4, pp.337-347, Oct.-Dec. 1 2015. doi: 10.1109/TAFFC.2015.2411280 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7056522&isnumber=7335704 Chien-Li Chou; Hua-Tsung Chen; Suh-Yin Lee, "Pattern-Based Near-Duplicate Video Retrieval and Localization on Web-Scale Videos," in Multimedia, IEEE Transactions on , vol.17, no.3, pp.382-395, March 2015. doi: 10.1109/TMM.2015.2391674 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7008558&isnumber=7041253 Hyungtae Lee; Morariu, V.I.; Davis, L.S., "Clauselets: Leveraging Temporally Related Actions for Video Event Analysis," in Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on , vol., no., pp.1161-1168, 5-9 Jan. 2015 doi: 10.1109/WACV.2015.159 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7046013&isnumber=7045853 Guozhu Liang; Shivakumara, P.; Tong Lu; Chew Lim Tan, "Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images," in Image Processing, IEEE Transactions on , vol.24, no.11, pp.4488-4501, Nov. 2015 doi: 10.1109/TIP.2015.2465169 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7180356&isnumber=7131605 Hamadi, A.; Mulhem, P.; Quenot, G., "Temporal re-scoring vs. temporal descriptors for semantic indexing of videos," in Content-Based Multimedia Indexing (CBMI), 2015 13th International Workshop on , vol., no., pp.1-4, 10-12 June 2015 doi: 10.1109/CBMI.2015.7153626 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153626&isnumber=7153597 Ting-Chu Lin; Min-Chun Yang; Chia-Yin Tsai; Wang, Y.-C.F., "Query-Adaptive Multiple Instance Learning for Video Instance Retrieval," in Image Processing, IEEE Transactions on , vol.24, no.4, pp.1330-1340, April 2015. doi: 10.1109/TIP.2015.2403236 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7041233&isnumber=7038243 Ouali, C.; Dumouchel, P.; Gupta, V., "Efficient spectrogram-based binary image feature for audio copy detection," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on , vol., no., pp.1792-1796, 19-24 April 2015 doi: 10.1109/ICASSP.2015.7178279 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7178279&isnumber=7177909 Jingxin Xu; Denman, S.; Sridharan, S.; Fookes, C., "An Efficient and Robust System for Multiperson Event Detection in Real-World Indoor Surveillance Scenes," in Circuits and Systems for Video Technology, IEEE Transactions on , vol.25, no.6, pp.1063-1076, June 2015 doi: 10.1109/TCSVT.2014.2367352 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6948205&isnumber=7116636 Li, Xianfeng; Zhan, Yongzhao; Xu, Sen, "Video Shot Annotation Based on Hypergraph Random Walk Algorithm," in Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2015 7th International Conference on , vol.2, no., pp.167-170, 26-27 Aug. 2015 doi: 10.1109/IHMSC.2015.92 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7334942&isnumber=7334774 Yale Song; Vallmitjana, J.; Stent, A.; Jaimes, A., "TVSum: Summarizing web videos using titles," in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.5179-5187, 7-12 June 2015. doi: 10.1109/CVPR.2015.7299154 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7299154&isnumber=7298593 Ondel, L.; Anguera, X.; Luque, J., "MASK+: Data-driven regions selection for acoustic fingerprinting," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on , vol., no., pp.335-339, 19-24 April 2015. doi: 10.1109/ICASSP.2015.7177986 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7177986&isnumber=7177909 Etter, D.; Domeniconi, C., "Multi2Rank: Multimedia Multiview Ranking," in Multimedia Big Data (BigMM), 2015 IEEE International Conference on , vol., no., pp.80-87, 20-22 April 2015 doi: 10.1109/BigMM.2015.47 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153859&isnumber=7153824 Chuang Gan; Naiyan Wang; Yi Yang; Dit-Yan Yeung; Hauptmann, A.G., "DevNet: A Deep Event Network for multimedia event detection and evidence recounting," in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.2568-2577, 7-12 June 2015 doi: 10.1109/CVPR.2015.7298872 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298872&isnumber=7298593 Nan Nan; Guizhong Liu, "Video Copy Detection Based on Path Merging and Query Content Prediction," in Circuits and Systems for Video Technology, IEEE Transactions on, vol.25, no.10, pp.1682-1695, Oct. 2015. doi: 10.1109/TCSVT.2015.2395771 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7024903&isnumber=7284726 Wang, Jinzhuo; Wang, Wenmin; Wang, Ronggang; Gao, Wen, "A compact shot representation for video semantic indexing," in Image Processing (ICIP), 2015 IEEE International Conference on , vol., no., pp.3265-3269, 27-30 Sept. 2015 doi: 10.1109/ICIP.2015.7351407 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7351407&isnumber=7350743 Ceroni, A.; Solachidis, V.; Mingxin Fu; Kanhabua, N.; Papadopoulou, O.; Niederee, C.; Mezaris, V., "Investigating human behaviors in selecting personal photos to preserve memories," in Multimedia & Expo Workshops (ICMEW), 2015 IEEE International Conference on, vol., no., pp.1-6, June 29 2015-July 3 2015 doi: 10.1109/ICMEW.2015.7169750 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7169750&isnumber=7169738 Kyoungmin Lee; Kolsch, M., "Shot Boundary Detection with Graph Theory Using Keypoint Features and Color Histograms," in Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on , vol., no., pp.1177-1184, 5-9 Jan. 2015 doi: 10.1109/WACV.2015.161 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7046015&isnumber=7045853 Budnik, M.; Gutierrez-Gomez, E.-L.; Safadi, B.; Quenot, G., "Learned features versus engineered features for semantic video indexing," in Content-Based Multimedia Indexing (CBMI), 2015 13th International Workshop on , vol., no., pp.1-6, 10-12 June 2015 doi: 10.1109/CBMI.2015.7153637 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153637&isnumber=7153597 Huang, D.; Cabral, R.; de la Torre, F., "Robust Regression," in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1 doi: 10.1109/TPAMI.2015.2448091 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7130636&isnumber=4359286 Kuan-Ting Lai; Dong Liu; Shih-Fu Chang; Ming-Syan Chen, "Learning Sample Specific Weights for Late Fusion," in Image Processing, IEEE Transactions on , vol.24, no.9, pp.2772-2783, Sept. 2015 doi: 10.1109/TIP.2015.2423560 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7086303&isnumber=7110434 Kumar, A.; Raj, B., "A novel ranking method for multiple classifier systems," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on , vol., no., pp.1931-1935, 19-24 April 2015 doi: 10.1109/ICASSP.2015.7178307 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7178307&isnumber=7177909 Pourian, N.; Manjunath, B.S., "Retrieval of Images with Objects of Specific Size, Location, and Spatial Configuration," in Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on , vol., no., pp.960-967, 5-9 Jan. 2015 doi: 10.1109/WACV.2015.133 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7045987&isnumber=7045853 Etter, D.; Domeniconi, C., "SemRank: Semantic rank learning for multimedia retrieval," in Semantic Computing (ICSC), 2015 IEEE International Conference on , vol., no., pp.57-64, 7-9 Feb. 2015 doi: 10.1109/ICOSC.2015.7050778 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7050778&isnumber=7050753 Amer, M.; Todorovic, S., "Sum Product Networks for Activity Recognition," in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1 doi: 10.1109/TPAMI.2015.2465955 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7182341&isnumber=4359286 Wenjing Tong; Li Song; Xiaokang Yang; Hui Qu; Rong Xie, "CNN-based shot boundary detection and video annotation," in Broadband Multimedia Systems and Broadcasting (BMSB), 2015 IEEE International Symposium on , vol., no., pp.1-5, 17-19 June 2015 doi: 10.1109/BMSB.2015.7177222 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7177222&isnumber=7177182 Junwei Han; Changyuan Chen; Ling Shao; Xintao Hu; Jungong Han; Tianming Liu, "Learning Computational Models of Video Memorability from fMRI Brain Imaging," in Cybernetics, IEEE Transactions on , vol.45, no.8, pp.1692-1703, Aug. 2015 doi: 10.1109/TCYB.2014.2358647 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6919270&isnumber=7156182 Hsin-Yu Ha; Shu-Ching Chen; Mei-Ling Shyu, "Utilizing Indirect Associations in Multimedia Semantic Retrieval," in Multimedia Big Data (BigMM), 2015 IEEE International Conference on, vol., no., pp.72-79, 20-22 April 2015. doi: 10.1109/BigMM.2015.89 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153858&isnumber=7153824 Hsin-Yu Ha; Shu-Ching Chen; Mei-Ling Shyu, "Negative-Based Sampling for Multimedia Retrieval," in Information Reuse and Integration (IRI), 2015 IEEE International Conference on, vol., no., pp.64-71, 13-15 Aug. 2015. doi: 10.1109/IRI.2015.20 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7300956&isnumber=7300933 Wei Zhang; Chong-Wah Ngo, "Topological Spatial Verification for Instance Search," in Multimedia, IEEE Transactions on , vol.17, no.8, pp.1236-1247, Aug. 2015 doi: 10.1109/TMM.2015.2440997 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7117400&isnumber=7159118 Bastan, M; Cam, H; Gudukbay, U; Ulusoy, O, "An MPEG-7 Compatible Video Retrieval System with Integrated Support for Complex Multimodal Queries," in MultiMedia, IEEE , vol.PP, no.99, pp.1-1. doi: 10.1109/MMUL.2009.74 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5306056&isnumber=5255202 Fang Liu; Yi Wan, "Improving the video shot boundary detection using the HSV color space and image subsampling," in Advanced Computational Intelligence (ICACI), 2015 Seventh International Conference on , vol., no., pp.351-354, 27-29 March 2015 doi: 10.1109/ICACI.2015.7184728 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7184728&isnumber=7184712 Ville Viitaniemi, Mats Sjöberg, Markus Koskela, Satoru Ishikawa and Jorma Laaksonen, Chapter 12 - Advances in visual concept detection: Ten years of TRECVID, In Advances in Independent Component Analysis and Learning Machines, edited by Ella Bingham, Samuel Kaski, Jorma Laaksonen and Jouko Lampinen, Academic Press, 2015, Pages 249-278, ISBN 9780128028063, http://dx.doi.org/10.1016/B978-0-12-802806-3.00012-9. http://www.sciencedirect.com/science/article/pii/B9780128028063000129 Ouali, C.; Dumouchel, P.; Gupta, V., "GPU implementation of an audio fingerprints similarity search algorithm," in Content-Based Multimedia Indexing (CBMI), 2015 13th International Workshop on , vol., no., pp.1-6, 10-12 June 2015 doi: 10.1109/CBMI.2015.7153625 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153625&isnumber=7153597 Safadi, B.; Quenot, G., "A factorized model for multiple SVM and multi-label classification for large scale multimedia indexing," in Content-Based Multimedia Indexing (CBMI), 2015 13th International Workshop on , vol., no., pp.1-6, 10-12 June 2015 doi: 10.1109/CBMI.2015.7153610 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153610&isnumber=7153597 Xintao Hu; Lei Guo; Junwei Han; Tianming Liu, "Decoding Semantics Categorization during Natural Viewing of Video Streams," in Autonomous Mental Development, IEEE Transactions on , vol.7, no.3, pp.201-210, Sept. 2015 doi: 10.1109/TAMD.2015.2415413 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7070746&isnumber=7317835 Kaavya, S.; LakshmiPriya, G.G., "Multimedia Indexing and Retrieval: Recent research work and their challenges," in Signal Processing, Communication and Networking (ICSCN), 2015 3rd International Conference on , vol., no., pp.1-5, 26-28 March 2015 doi: 10.1109/ICSCN.2015.7219851 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7219851&isnumber=7219823 Yan Yan; Yi Yang; Deyu Meng; Gaowen Liu; Wei Tong; Hauptmann, A.G.; Sebe, N., "Event Oriented Dictionary Learning for Complex Event Detection," in Image Processing, IEEE Transactions on , vol.24, no.6, pp.1867-1878, June 2015 doi: 10.1109/TIP.2015.2413294 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7061499&isnumber=7065385 Shinde, S.R.; Chiddarwar, G.G., "Recent advances in content based video copy detection," in Pervasive Computing (ICPC), 2015 International Conference on , vol., no., pp.1-6, 8-10 Jan. 2015. doi: 10.1109/PERVASIVE.2015.7087093 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7087093&isnumber=7086957 Vrochidis, S.; Kompatsiaris, I.; Casamayor, G.; Arapakis, I.; Busch, R.; Alexiev, V.; Jamin, E.; Jugov, M.; Heise, N.; Forrellat, T.; Liparas, D.; Wanner, L.; Miliaraki, I.; Aleksic, V.; Simov, K.; Mas Soro, A.; Eckhoff, M.; Wagner, T.; Puigbo, M., "MULTISENSOR: Development of multimedia content integration technologies for journalism, media monitoring and international exporting decision support," in Multimedia & Expo Workshops (ICMEW), 2015 IEEE International Conference on , vol., no., pp.1-4, June 29 2015-July 3 2015 doi: 10.1109/ICMEW.2015.7169818 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7169818&isnumber=7169738 Tsai, T.J.; Friedland, G.; Anguera, X., "An information-theoretic metric of fingerprint effectiveness," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on , vol., no., pp.340-344, 19-24 April 2015 doi: 10.1109/ICASSP.2015.7177987 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7177987&isnumber=7177909 Liu, X.; Lin, L.; Jin, H., "Contextualized Trajectory Parsing with Spatio-Temporal Graph," in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1. doi: 10.1109/TPAMI.2013.84 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6517196&isnumber=4359286 Xianglong Liu; Yadong Mu; Danchen Zhang; Bo Lang; Xuelong Li, "Large-Scale Unsupervised Hashing with Shared Structure Learning," in Cybernetics, IEEE Transactions on , vol.45, no.9, pp.1811-1822, Sept. 2015 doi: 10.1109/TCYB.2014.2360856 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6960876&isnumber=7203181 Xintao Hu; Cheng Lv; Gong Cheng; Jinglei Lv; Lei Guo; Junwei Han; Tianming Liu, "Sparsity-Constrained fMRI Decoding of Visual Saliency in Naturalistic Video Streams," in Autonomous Mental Development, IEEE Transactions on , vol.7, no.2, pp.65-75, June 2015 doi: 10.1109/TAMD.2015.2409835 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7056490&isnumber=7121039 Hajimirsadeghi, H.; Wang Yan; Vahdat, A.; Mori, G., "Visual recognition by counting instances: A multi-instance cardinality potential kernel," in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.2596-2605, 7-12 June 2015 doi: 10.1109/CVPR.2015.7298875 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298875&isnumber=7298593 Mihir Jain, Jan C. van Gemert, Thomas Mensink, and Cees G. M. Snoek, "Objects2action: Classifying and localizing actions without any video example," in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015. Markus Nagel, Thomas Mensink, and Cees G. M. Snoek, "Event Fisher Vectors: Robust Encoding Visual Diversity of Visual Streams," in Proceedings of the British Machine Vision Conference, Swansea, UK, 2015. Amirhossein Habibian, Thomas Mensink, and Cees G. M. Snoek, "Discovering Semantic Vocabularies for Cross-Media Retrieval," in Proceedings of the ACM International Conference on Multimedia Retrieval, Shanghai, China, 2015. Masoud Mazloom, Amirhossein Habibian, Dong Liu, Cees G. M. Snoek, and Shih-Fu Chang, "Encoding Concept Prototypes for Video Event Detection and Summarization," in Proceedings of the ACM International Conference on Multimedia Retrieval, Shanghai, China, 2015. Pascal Mettes, Jan C. van Gemert, Spencer Cappallo, Thomas Mensink, and Cees G. M. Snoek, "Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting," in Proceedings of the ACM International Conference on Multimedia Retrieval, Shanghai, China, 2015. Svetlana Kordumova, Xirong Li, and Cees G. M. Snoek, "Best Practices for Learning Video Concept Detectors from Social Media Examples," Multimedia Tools and Applications, vol. 74, iss. 4, pp. 1291-1315, 2015. --------------------------------------------------------------------- 2014 (87) --------------------------------------------------------------------- Amjad Altadmri and Amr Ahmed. 2014. A framework for automatic semantic video annotation. Multimedia Tools Appl. 72, 2 (September 2014), 1167-1191. DOI=10.1007/s11042-013-1363-6 http://dx.doi.org/10.1007/s11042-013-1363-6 Amid, E.; Mesaros, A.; Palomaki, K.J.; Laaksonen, J.; Kurimo, M., "Unsupervised feature extraction for multimedia event detection and ranking using audio content," Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on , vol., no., pp.5939,5943, 4-9 May 2014 doi: 10.1109/ICASSP.2014.6854743 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6854743&isnumber=6853544 Bhattacharya, Subhabrata; Kalayeh, Mahdi M.; Sukthankar, Rahul; Shah, Mubarak, "Recognition of Complex Events: Exploiting Temporal Dynamics between Underlying Concepts," Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , vol., no., pp.2243,2250, 23-28 June 2014 doi: 10.1109/CVPR.2014.287, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909684&isnumber=6909393 Bhattacharya, S.; Mehran, R.; Sukthankar, R.; Shah, M., "Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition," Multimedia, IEEE Transactions on , vol.16, no.3, pp.686,696, April 2014 doi: 10.1109/TMM.2014.2300833, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6714427&isnumber=6766693 Subhabrata Bhattacharya, Felix X. Yu, and Shih-Fu Chang. 2014. Minimally Needed Evidence for Complex Event Recognition in Unconstrained Videos. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 105 , 8 pages. DOI=10.1145/2578726.2578740 http://doi.acm.org/10.1145/2578726.2578740 Ethem F. Can and R. Manmatha. 2014. Modeling Concept Dependencies for Event Detection. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 289 , 8 pages. DOI=10.1145/2578726.2578763 http://doi.acm.org/10.1145/2578726.2578763 Ning Chen; Jun Zhu; Fuchun Sun; Bo Zhang, "Learning Harmonium Models With Infinite Latent Features," Neural Networks and Learning Systems, IEEE Transactions on , vol.25, no.3, pp.520,532, March 2014 doi: 10.1109/TNNLS.2013.2276398, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6741394&isnumber=6740874 Jiawei Chen, Yin Cui, Guangnan Ye, Dong Liu, and Shih-Fu Chang. 2014. Event-Driven Semantic Concept Discovery by Exploiting Weakly Tagged Internet Images. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 1 , 8 pages. DOI=10.1145/2578726.2578729 http://doi.acm.org/10.1145/2578726.2578729 Yu Cheng; Brown, L.; Fan, Q.; Feris, R.; Pankanti, S.; Tao Zhang, "RiskWheel: Interactive visual analytics for surveillance event detection," Multimedia and Expo (ICME), 2014 IEEE International Conference on , vol., no., pp.1,6, 14-18 July 2014 doi: 10.1109/ICME.2014.6890286, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890286&isnumber=6890121 Ozgun Cirakman, Bilge Gunsel, Neslihan Serap Sengor, and Sezer Kutluk. 2014. Content-based copy detection by a subspace learning based video fingerprinting scheme. Multimedia Tools Appl. 71, 3 (August 2014), 1381-1409. DOI=10.1007/s11042-012-1269-8 http://dx.doi.org/10.1007/s11042-012-1269-8 Dang, C.T.; Radha, H., "Heterogeneity Image Patch Index and Its Application to Consumer Video Summarization," Image Processing, IEEE Transactions on , vol.23, no.6, pp.2704,2718, June 2014 doi: 10.1109/TIP.2014.2320814 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6807803&isnumber=6807541 Dehghan, Afshin; Idrees, Haroon; Shah, Mubarak, "Improving Semantic Concept Detection through the Dictionary of Visually-Distinct Elements," Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , vol., no., pp.2585,2592, 23-28 June 2014 doi: 10.1109/CVPR.2014.331 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909727&isnumber=6909393 David Etter and Carlotta Domeniconi. 2014. Semi-Supervised Rank Learning for Multimedia Known-Item Search. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 257 , 8 pages. DOI=10.1145/2578726.2578759 http://doi.acm.org/10.1145/2578726.2578759 Guangyu Gao and Huadong Ma. 2014. To accelerate shot boundary detection by reducing detection region and scope. Multimedia Tools Appl. 71, 3 (August 2014), 1749-1770. DOI=10.1007/s11042-012-1301-z http://dx.doi.org/10.1007/s11042-012-1301-z Zan Gao, Long-Fei Zhang, Ming-Yu Chen, Alexander Hauptmann, Hua Zhang, and An-Ni Cai. 2014. Enhanced and hierarchical structure algorithm for data imbalance problem in semantic extraction under massive video dataset. Multimedia Tools Appl. 68, 3 (February 2014), 641-657. DOI=10.1007/s11042-012-1071-7 http://dx.doi.org/10.1007/s11042-012-1071-7 Nikolaos Gkalelis and Vasileios Mezaris. 2014. Video event detection using generalized subclass discriminant analysis and linear support vector machines. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 25 , 8 pages. DOI=10.1145/2578726.2578745 http://doi.acm.org/10.1145/2578726.2578745 Amirhossein Habibian, Masoud Mazloom, and Cees G. M. Snoek. 2014. On-the-Fly Video Event Search by Semantic Signatures. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 518 , 3 pages. DOI=10.1145/2578726.2582615 http://doi.acm.org/10.1145/2578726.2582615 Amirhossein Habibian, Thomas Mensink, and Cees G. M. Snoek. 2014. Composite Concept Discovery for Zero-Shot Video Event Detection. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 17 , 8 pages. DOI=10.1145/2578726.2578746 http://doi.acm.org/10.1145/2578726.2578746 Amirhossein Habibian, Thomas Mensink, and Cees G. M. Snoek, "VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events," in Proceedings of the ACM International Conference on Multimedia, Orlando, Florida, USA, 2014, pp. 17-26. Amirhossein Habibian and Cees G. M. Snoek, "Recommendations for Recognizing Video Events by Concept Vocabularies," Computer Vision and Image Understanding, vol. 124, pp. 110-122, 2014. Amirhossein Habibian and Cees G. M. Snoek. 2014. Stop-Frame Removal Improves Web Video Classification. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 499 , 4 pages. DOI=10.1145/2578726.2578803 http://doi.acm.org/10.1145/2578726.2578803 Abdelkader Hamadi, Philippe Mulhem, and Georges Quénot. 2014. Infrequent concept pairs detection in multimedia documents. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 435 , 4 pages. DOI=10.1145/2578726.2578787 http://doi.acm.org/10.1145/2578726.2578787 Junwei Han, Xiang Ji, Xintao Hu, Jungong Han, and Tianming Liu. 2014. Clustering and retrieval of video shots based on natural stimulus fMRI. Neurocomput. 144 (November 2014), 128-137. DOI=10.1016/j.neucom.2013.11.052 http://dx.doi.org/10.1016/j.neucom.2013.11.052 Junwei Han, Kaiming Li, Ling Shao, Xintao Hu, Sheng He, Lei Guo, Jungong Han, and Tianming Liu. 2014. Video abstraction based on fMRI-driven visual attention model. Inf. Sci. 281 (October 2014), 781-796. DOI=10.1016/j.ins.2013.12.039 http://dx.doi.org/10.1016/j.ins.2013.12.039 Nakamasa Inoue and Koichi Shinoda, "n-Gram Models for Video Semantic Indexing," Proc. ACM Multimedia, pp. 777-780, 2014. Jain, A.; Xujun Peng; Xiaodan Zhuang; Natarajan, P.; Huaigu Cao, "Text detection and recognition in natural scenes and consumer videos," Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on , vol., no., pp.1245,1249, 4-9 May 2014 doi: 10.1109/ICASSP.2014.6853796, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6853796&isnumber=6853544 Lu Jiang, Teruko Mitamura, Shoou-I Yu, and Alexander G. Hauptmann. 2014. Zero-Example Event Search using MultiModal Pseudo Relevance Feedback. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 297 , 8 pages. DOI=10.1145/2578726.2578764 http://doi.acm.org/10.1145/2578726.2578764 Lu Jiang, Wei Tong, Deyu Meng, and Alexander G. Hauptmann. 2014. Towards Efficient Learning of Optimal Spatial Bag-of-Words Representations. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 121 , 8 pages. DOI=10.1145/2578726.2578739 http://doi.acm.org/10.1145/2578726.2578739 I-Hong Jhuo, Guangnan Ye, Shenghua Gao, Dong Liu, Yu-Gang Jiang, D. T. Lee, and Shih-Fu Chang. 2014. Discovering joint audio---visual codewords for video event detection. Mach. Vision Appl. 25, 1 (January 2014), 33-47. DOI=10.1007/s00138-013-0567-0 http://dx.doi.org/10.1007/s00138-013-0567-0 Ilseo Kim and Chin-Hui Lee. 2014. An Efficient Gradient-based Approach to Optimizing Average Precision Through Maximal Figure-of-Merit Learning. J. Signal Process. Syst. 74, 3 (March 2014), 285-295. DOI=10.1007/s11265-013-0748-0 http://dx.doi.org/10.1007/s11265-013-0748-0 Semin Kim, Jae Young Choi, Seungwan Han, and Yong Man Ro. 2014. Adaptive weighted fusion with new spatial and temporal fingerprints for improved video copy detection. Image Commun. 29, 7 (August 2014), 788-806. DOI=10.1016/j.image.2014.05.002 http://dx.doi.org/10.1016/j.image.2014.05.002 Svetlana Kordumova, Christoph Kofler, Dennis C. Koelma, Bouke Huurnink, Bauke Freiburg, Joris Kleinveld, Manuel van Rijn, Marco van Deursen, Martha Larson, and Cees G. M. Snoek. 2014. SocialZap: Catch-up on Interesting Television Fragments Discovered from Social Media. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 538 , 3 pages. DOI=10.1145/2578726.2582622 http://doi.acm.org/10.1145/2578726.2582622 Zhen-Zhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, and Alexander G. Hauptmann. 2014. Multimedia classification and event detection using double fusion. Multimedia Tools Appl. 71, 1 (July 2014), 333-347. DOI=10.1007/s11042-013-1391-2 http://dx.doi.org/10.1007/s11042-013-1391-2 Gaowen Liu, Yan Yan, Chenqiang Gao, Wei Tong, Alexander Hauptmann, and Nicu Sebe. 2014. The Mystery of Faces: Investigating Face Contribution for Multimedia Event Detection. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 467 , 4 pages. DOI=10.1145/2578726.2578795 http://doi.acm.org/10.1145/2578726.2578795 Tianming Liu; Xintao Hu; Xiaojin Li; Mo Chen; Junwei Han; Lei Guo, "Merging Neuroimaging and Multimedia: Methods, Opportunities, and Challenges," Human-Machine Systems, IEEE Transactions on , vol.44, no.2, pp.270,280, April 2014 doi: 10.1109/THMS.2013.2296871, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6742574&isnumber=6766254 Xianglong Liu, Junfeng He, and Bo Lang. 2014. Multiple feature kernel hashing for large-scale visual search. Pattern Recogn. 47, 2 (February 2014), 748-757. DOI=10.1016/j.patcog.2013.08.022 http://dx.doi.org/10.1016/j.patcog.2013.08.022 Zhigang Ma; Yi Yang; Sebe, N.; Hauptmann, A.G., "Knowledge Adaptation with PartiallyShared Features for Event DetectionUsing Few Exemplars," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.36, no.9, pp.1789,1802, Sept. 2014 doi: 10.1109/TPAMI.2014.2306419 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6740842&isnumber=6868318 Masoud Mazloom, Efstrastios Gavves, and Cees G. M. Snoek, "Conceptlets: Selective Semantics for Classifying Video Events," IEEE Transactions on Multimedia, vol. 16, iss. 8, pp. 2214-2228, 2014. Masoud Mazloom, Xirong Li, and Cees G. M. Snoek. 2014. Few-Example Video Event Retrieval using Tag Propagation. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 459 , 4 pages. DOI=10.1145/2578726.2578793 http://doi.acm.org/10.1145/2578726.2578793 Scott McCloskey and Jingchen Liu. 2014. Metadata-Weighted Score Fusion for Multimedia Event Detection. In Proceedings of the 2014 Canadian Conference on Computer and Robot Vision (CRV '14). IEEE Computer Society, Washington, DC, USA, 299-305. DOI=10.1109/CRV.2014.47 http://dx.doi.org/10.1109/CRV.2014.47 Tao Mei, Yong Rui, Shipeng Li, and Qi Tian. 2014. Multimedia search reranking: A literature survey. ACM Comput. Surv. 46, 3, Article 38 (January 2014), 38 pages. DOI=10.1145/2536798 http://doi.acm.org/10.1145/2536798 Tao Meng, Yang Liu, Mei-Ling Shyu, Yilin Yan, and Chi-Min Shu. 2014. Enhancing Multimedia Semantic Concept Mining and Retrieval by Incorporating Negative Correlations. In Proceedings of the 2014 IEEE International Conference on Semantic Computing (ICSC '14). IEEE Computer Society, Washington, DC, USA, 28-35. DOI=10.1109/ICSC.2014.30 http://dx.doi.org/10.1109/ICSC.2014.30 Merialdo, B.; Niaz, U., "Uploader models for video concept detection," Content-Based Multimedia Indexing (CBMI), 2014 12th International Workshop on , vol., no., pp.1,4, 18-20 June 2014 doi: 10.1109/CBMI.2014.6849847, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849847&isnumber=6849811 Metze, F.; Rawat, S.; Yipei Wang, "Improved audio features for large-scale multimedia event detection," Multimedia and Expo (ICME), 2014 IEEE International Conference on , vol., no., pp.1,6, 14-18 July 2014 doi: 10.1109/ICME.2014.6890234 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890234&isnumber=6890121 Murata, M.; Nagano, H.; Mukai, R.; Kashino, K.; Satoh, S., "BM25 With Exponential IDF for Instance Search," Multimedia, IEEE Transactions on , vol.16, no.6, pp.1690,1699, Oct. 2014 doi: 10.1109/TMM.2014.2323945 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6820744&isnumber=6898894 Gregory K. Myers, Ramesh Nallapati, Julien Hout, Stephanie Pancoast, Ramakant Nevatia, Chen Sun, Amirhossein Habibian, Dennis C. Koelma, Koen E. Sande, Arnold W. Smeulders, and Cees G. Snoek. 2014. Evaluating multimedia features and fusion for example-based event detection. Mach. Vision Appl. 25, 1 (January 2014), 17-32. DOI=10.1007/s00138-013-0527-8 http://dx.doi.org/10.1007/s00138-013-0527-8 Niaz, U.; Merialdo, B., "Improving video concept detection through label space partitioning," Multimedia and Expo (ICME), 2014 IEEE International Conference on , vol., no., pp.1,6, 14-18 July 2014 doi: 10.1109/ICME.2014.6890258, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890258&isnumber=6890121 Usman Niaz and Bernard Merialdo. 2014. Selective Multi-cotraining for Video Concept Detection. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 443 , 4 pages. DOI=10.1145/2578726.2578789 http://doi.acm.org/10.1145/2578726.2578789 Sangmin Oh, Scott Mccloskey, Ilseo Kim, Arash Vahdat, Kevin J. Cannons, Hossein Hajimirsadeghi, Greg Mori, A. G. Perera, Megha Pandey, and Jason J. Corso. 2014. Multimedia event detection with multimodal feature fusion and temporal concept localization. Mach. Vision Appl. 25, 1 (January 2014), 49-69. DOI=10.1007/s00138-013-0525-x http://dx.doi.org/10.1007/s00138-013-0525-x Ouali, C.; Dumouchel, P.; Gupta, V., "A robust audio fingerprinting method for content-based copy detection," Content-Based Multimedia Indexing (CBMI), 2014 12th International Workshop on , vol., no., pp.1,6, 18-20 June 2014 doi: 10.1109/CBMI.2014.6849814, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849814&isnumber=6849811 Sang Phan, Thanh Duc Ngo, Vu Lam, Son Tran, Duy-Dinh Le, Duc Anh Duong, and Shin'ichi Satoh. 2014. Multimedia Event Detection Using Segment-Based Approach for Motion Feature. J. Signal Process. Syst. 74, 1 (January 2014), 19-31. DOI=10.1007/s11265-013-0825-4 http://dx.doi.org/10.1007/s11265-013-0825-4 Trung Quy Phan; Shivakumara, P.; Bhowmick, S.; Shimiao Li; Chew Lim Tan; Pal, U., "Semiautomatic Ground Truth Generation for Text Detection and Recognition in Video Images," Circuits and Systems for Video Technology, IEEE Transactions on , vol.24, no.8, pp.1277,1287, Aug. 2014 doi: 10.1109/TCSVT.2014.2305515, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6739120&isnumber=6869080 G.G. Lakshmi Priya and S. Domnic. 2014. Shot boundary-based keyframe extraction for video summarisation. Int. J. Comput. Intell. Stud. 3, 2/3 (June 2014), 157-175. DOI=10.1504/IJCISTUDIES.2014.062728 http://dx.doi.org/10.1504/IJCISTUDIES.2014.062728 Mengren Qian; Luntian Mou; Jia Li; Yonghong Tian, "Video picture-in-picture detection using spatio-temporal slicing," Multimedia and Expo Workshops (ICMEW), 2014 IEEE International Conference on , vol., no., pp.1,6, 14-18 July 2014 doi: 10.1109/ICMEW.2014.6890580 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890580&isnumber=6890528 Xueming Qian, Danping Guo, Xingsong Hou, Zhi Li, Huan Wang, Guizhong Liu, and Zhe Wang. 2014. HWVP: hierarchical wavelet packet descriptors and their applications in scene categorization and semantic concept retrieval. Multimedia Tools Appl. 69, 3 (April 2014), 897-920. DOI=10.1007/s11042-012-1151-8 http://dx.doi.org/10.1007/s11042-012-1151-8 J. Rest, F. A. Grootjen, M. Grootjen, R. Wijn, O. Aarts, M. L. Roelofs, G. J. Burghouts, H. Bouma, L. Alic, and W. Kraaij. 2014. Requirements for multimedia metadata schemes in surveillance applications for security. Multimedia Tools Appl. 70, 1 (May 2014), 573-598. DOI=10.1007/s11042-013-1575-9 http://dx.doi.org/10.1007/s11042-013-1575-9 C. Okan Sakar, Olcay Kursun, and Fikret Gurgen. 2014. Ensemble canonical correlation analysis. Applied Intelligence 40, 2 (March 2014), 291-304. DOI=10.1007/s10489-013-0464-2 http://dx.doi.org/10.1007/s10489-013-0464-2 Yuan Shen; Zhenjiang Miao, "Multihuman Tracking Based on a Spatial–Temporal Appearance Match," Circuits and Systems for Video Technology, IEEE Transactions on , vol.24, no.3, pp.361,373, March 2014 doi: 10.1109/TCSVT.2013.2280073, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6587854&isnumber=6754192 Kimiaki Shirahama, Marcin Grzegorzek, and Kuniaki Uehara. 2014. Multimedia Event Detection Using Hidden Conditional Random Fields. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 9 , 8 pages. DOI=10.1145/2578726.2578742 http://doi.acm.org/10.1145/2578726.2578742 Kimiaki Shirahama, Yuta Matsuoka, and Kuniaki Uehara. 2014. Hybrid negative example selection using visual and conceptual features. Multimedia Tools Appl. 71, 3 (August 2014), 967-989. DOI=10.1007/s11042-011-0886-y http://dx.doi.org/10.1007/s11042-011-0886-y Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I., "Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection," Circuits and Systems for Video Technology, IEEE Transactions on , vol.24, no.7, pp.1251,1264, July 2014 doi: 10.1109/TCSVT.2014.2302554 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6727470&isnumber=6846390 Sjoberg, M.; Laaksonen, J., "Using semantic features to improve large-scale visual concept detection," Content-Based Multimedia Indexing (CBMI), 2014 12th International Workshop on , vol., no., pp.1,6, 18-20 June 2014 doi: 10.1109/CBMI.2014.6849817, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849817&isnumber=6849811 Sabin Tiberius Strat, Alexandre Benoit, Patrick Lambert, and Alice Caplier. 2014. Retina enhanced SURF descriptors for spatio-temporal concept detection. Multimedia Tools Appl. 69, 2 (March 2014), 443-469. DOI=10.1007/s11042-012-1280-0 http://dx.doi.org/10.1007/s11042-012-1280-0 Sun, Chen; Nevatia, Ram, "DISCOVER: Discovering Important Segments for Classification of Video Events and Recounting," Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , vol., no., pp.2569,2576, 23-28 June 2014 doi: 10.1109/CVPR.2014.329 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909725&isnumber=6909393 Chen Sun, Brian Burns, Ram Nevatia, Cees Snoek, Bob Bolles, Greg Myers, Wen Wang, and Eric Yeh. 2014. ISOMER: Informative Segment Observations for Multimedia Event Recounting. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 241 , 8 pages. DOI=10.1145/2578726.2578757 http://doi.acm.org/10.1145/2578726.2578757 Fuming Sun; Jinhui Tang; Haojie Li; Guo-Jun Qi; Huang, T.S., "Multi-Label Image Categorization With Sparse Factor Representation," Image Processing, IEEE Transactions on , vol.23, no.3, pp.1028,1037, March 2014 doi: 10.1109/TIP.2014.2298978, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6705666&isnumber=6717077 Jinhui Tang and Xian-Sheng Hua. 2014. Typicality ranking: beyond accuracy for video semantic annotation. Multimedia Tools Appl. 70, 2 (May 2014), 647-660. DOI=10.1007/s11042-011-0892-0 http://dx.doi.org/10.1007/s11042-011-0892-0 Ran Tao, Efstratios Gavves, Cees G. M. Snoek, and Arnold W. M. Smeulders, "Locality in Generic Instance Search from One Example," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, Ohio, USA, 2014. Wei Tong, Yi Yang, Lu Jiang, Shoou-I Yu, Zhenzhong Lan, Zhigang Ma, Waito Sze, Ehsan Younessian, and Alexander G. Hauptmann. 2014. E-LAMP: integration of innovative ideas for multimedia event detection. Mach. Vision Appl. 25, 1 (January 2014), 5-15. DOI=10.1007/s00138-013-0529-6 http://dx.doi.org/10.1007/s00138-013-0529-6 Trichet, R.; Nevatia, R., "Video segmentation and feature co-occurrences for activity classification," Applications of Computer Vision (WACV), 2014 IEEE Winter Conference on , vol., no., pp.385,392, 24-26 March 2014 doi: 10.1109/WACV.2014.6836074 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6836074&isnumber=6835728 Chun-Yu Tsai, Michelle L. Alexander, Nnenna Okwara, and John R. Kender. 2014. Highly Efficient Multimedia Event Recounting from User Semantic Preferences. In Proceedings of International Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 419 , 4 pages. DOI=10.1145/2578726.2578783 http://doi.acm.org/10.1145/2578726.2578783 van Hout, J.; Yeh, E.; Koelma, D.C.; Snoek, C.G.M.; Chen Sun; Nevatia, R.; Wong, J.; Myers, G.K., "Late fusion and calibration for multimedia event detection using few examples," Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on , vol., no., pp.4598,4602, 4-9 May 2014 doi: 10.1109/ICASSP.2014.6854473 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6854473&isnumber=6853544 Feng Wang; Zhanhu Sun; Yu-Gang Jiang; Chong-Wah Ngo, "Video Event Detection Using Motion Relativity and Feature Selection," Multimedia, IEEE Transactions on , vol.16, no.5, pp.1303,1315, Aug. 2014 doi: 10.1109/TMM.2014.2315780, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6783709&isnumber=6856249 Haidong Wang and Guizhong Liu. 2014. Priority and delay aware packet management framework for real-time video transport over 802.11e WLANs. Multimedia Tools Appl. 69, 3 (April 2014), 621-641. DOI=10.1007/s11042-012-1131-z http://dx.doi.org/10.1007/s11042-012-1131-z Mei Wang, Xiaoling Xia, Jiajin Le, and Xiangdong Zhou. 2014. Effective automatic image annotation via integrated discriminative and generative models. Inf. Sci. 262 (March 2014), 159-171. DOI=10.1016/j.ins.2013.11.005 http://dx.doi.org/10.1016/j.ins.2013.11.005 Wu, Shuang; Bondugula, Sravanthi; Luisier, Florian; Zhuang, Xiaodan; Natarajan, Pradeep, "Zero-Shot Event Detection Using Multi-modal Fusion of Weakly Supervised Concepts," Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , vol., no., pp.2665,2672, 23-28 June 2014 doi: 10.1109/CVPR.2014.341 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909737&isnumber=6909393 Shuang Wu; Xiaodan Zhuang; Natarajan, P., "Effective representations for leveraging language content in multimedia event detection," Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on , vol., no., pp.7123,7127, 4-9 May 2014 doi: 10.1109/ICASSP.2014.6854982, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6854982&isnumber=6853544 Hongtao Xie; Yongdong Zhang; Jianlong Tan; Li Guo; Jintao Li, "Contextual Query Expansion for Image Retrieval," Multimedia, IEEE Transactions on , vol.16, no.4, pp.1104,1114, June 2014 doi: 10.1109/TMM.2014.2305909, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6739088&isnumber=6814813 Xu, Zhongwen; Tsang, Ivor W.; Yang, Yi; Ma, Zhigang; Hauptmann, Alexander G., "Event Detection Using Multi-level Relevance Labels and Multiple Features," Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , vol., no., pp.97,104, 23-28 June 2014 doi: 10.1109/CVPR.2014.20, URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909414&isnumber=6909393 Bo Yang and Ramakant Nevatia. 2014. Multi-Target Tracking by Online Learning a CRF Model of Appearance and Motion Patterns. Int. J. Comput. Vision 107, 2 (April 2014), 203-217. DOI=10.1007/s11263-013-0666-4 http://dx.doi.org/10.1007/s11263-013-0666-4 Haojin Yang, Bernhard Quehl, and Harald Sack. 2014. A framework for improved video text detection and recognition. Multimedia Tools Appl. 69, 1 (March 2014), 217-245. DOI=10.1007/s11042-012-1250-6 http://dx.doi.org/10.1007/s11042-012-1250-6 Turgay Yilmaz, Adnan Yazici, and Masaru Kitsuregawa. 2014. RELIEF-MM: effective modality weighting for multimedia information retrieval. Multimedia Syst. 20, 4 (July 2014), 389-413. DOI=10.1007/s00530-014-0360-6 http://dx.doi.org/10.1007/s00530-014-0360-6 Ruijie Zhang, Fushan Wei, and Bicheng Li. 2014. E2LSH based multiple kernel approach for object detection. Neurocomput. 124 (January 2014), 105-110. DOI=10.1016/j.neucom.2013.07.027 http://dx.doi.org/10.1016/j.neucom.2013.07.027 Xianguo Zhang; Tiejun Huang; Yonghong Tian; Wen Gao, "Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding," Image Processing, IEEE Transactions on , vol.23, no.2, pp.769,784, Feb. 2014 doi: 10.1109/TIP.2013.2294549 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6680670&isnumber=6685907 Cencen Zhong and Zhenjiang Miao. 2014. Graph regularized GM-pLSA and its applications to video content analysis. Multimedia Syst. 20, 4 (July 2014), 429-445. DOI=10.1007/s00530-014-0378-9 http://dx.doi.org/10.1007/s00530-014-0378-9 Jun Zhu, Ning Chen, and Eric P. Xing. 2014. Bayesian inference with posterior regularization and applications to infinite latent SVMs. J. Mach. Learn. Res. 15, 1 (January 2014), 1799-1847. Liang Zhuolin, Nakamasa Inoue, and Koichi Shinoda, "Velocity Pyramid for Multimedia Event Detection," In Proc. MMM, 2014. --------------------------------------------------------------------- 2013 (94) --------------------------------------------------------------------- Robin Aly, Aiden Doherty, Djoerd Hiemstra, Franciska Jong, and Alan F. Smeaton. 2013. The uncertain representation ranking framework for concept-based video retrieval. Inf. Retr. 16, 5 (October 2013), 557-583. DOI=10.1007/s10791-012-9207-y http://dx.doi.org/10.1007/s10791-012-9207-y Arpit, D.; Shuang Wu; Natarajan, P.; Prasad, R.; Natarajan, P., "Ridge Regression based classifiers for large scale class imbalanced datasets," Applications of Computer Vision (WACV), 2013 IEEE Workshop on , vol., no., pp.267,274, 15-17 Jan. 2013 doi: 10.1109/WACV.2013.6475028 Ilaria Bartolini, Marco Patella, and Corrado Romani. 2013. SHIATSU: tagging and retrieving videos without worries. Multimedia Tools Appl. 63, 2 (March 2013), 357-385. DOI=10.1007/s11042-011-0948-1 http://dx.doi.org/10.1007/s11042-011-0948-1 Subhabrata Bhattacharya. 2013. Recognition of complex events in open-source web-scale videos: a bottom up approach. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 1035-1038. DOI=10.1145/2502081.2502210 http://doi.acm.org/10.1145/2502081.2502210 Qiang Chen, Yang Cai, Lisa Brown, Ankur Datta, Quanfu Fan, Rogerio Feris, Shuicheng Yan, Alex Hauptmann, and Sharath Pankanti. 2013. Spatio-temporal fisher vector coding for surveillance event detection. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 589-592. DOI=10.1145/2502081.2502155 http://doi.acm.org/10.1145/2502081.2502155 Michael G. Christel. 2013. Multimedia: from information source to components of transformational games. In Proceedings of the 19th Brazilian symposium on Multimedia and the web (WebMedia '13). ACM, New York, NY, USA, 1-2. DOI=10.1145/2526188.2528279 http://doi.acm.org/10.1145/2526188.2528279 Roghayeh Dadashi and Hamidreza Rashidy Kanan. 2013. AVCD-FRA: A novel solution to automatic video cut detection using fuzzy-rule-based approach. Comput. Vis. Image Underst. 117, 7 (July 2013), 807-817. DOI=10.1016/j.cviu.2013.03.002 http://dx.doi.org/10.1016/j.cviu.2013.03.002 Jeffrey Dalton, James Allan, and Pranav Mirajkar. 2013. Zero-shot video retrieval using content and concepts. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13). ACM, New York, NY, USA, 1857-1860. DOI=10.1145/2505515.2507880 http://doi.acm.org/10.1145/2505515.2507880 Pradipto Das, Rohini K. Srihari, and Jason J. Corso. 2013. Translating related words to videos and back through latent topics. In Proceedings of the sixth ACM international conference on Web search and data mining (WSDM '13). ACM, New York, NY, USA, 485-494. DOI=10.1145/2433396.2433456 http://doi.acm.org/10.1145/2433396.2433456 Del Fabro, M.; Schoeffmann, K.; Guggenberger, M.; Taschwer, M., "A filtering tool to support interactive search in Internet video archives," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.7,10, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576544 Ajay Divakaran, Omar Javed, Saad Ali, Harpreet Sawhney, Qian Yu, Jingen Liu, Hui Cheng, and Amir Tamrakar. 2013. Video event recognition using concept attributes. In Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision (WACV) (WACV '13). IEEE Computer Society, Washington, DC, USA, 339-346. DOI=10.1109/WACV.2013.6475038 http://dx.doi.org/10.1109/WACV.2013.6475038 de Rooij, O.; Worring, M., "Active Bucket Categorization for High Recall Video Retrieval," Multimedia, IEEE Transactions on , vol.15, no.4, pp.898,907, June 2013 doi: 10.1109/TMM.2013.2237894 Xiaohua Duan; Liang Lin; Hongyang Chao, "Discovering Video Shot Categories by Unsupervised Stochastic Graph Partition," Multimedia, IEEE Transactions on , vol.15, no.1, pp.167,180, Jan. 2013 doi: 10.1109/TMM.2012.2225029 Mennan Guder and Nihan Kesim Cicekli. 2013. Interactive Event Recognition in Video. In Proceedings of the 2013 IEEE International Symposium on Multimedia (ISM '13). IEEE Computer Society, Washington, DC, USA, 100-101. DOI=10.1109/ISM.2013.24 http://dx.doi.org/10.1109/ISM.2013.24 Mennan Guder and Nihan Kesim Cicekli. 2013. Dichotomic Decision Cascading for Video Shot Boundary Detection. In Proceedings of the 2013 IEEE International Symposium on Multimedia (ISM '13). IEEE Computer Society, Washington, DC, USA, 227-230. DOI=10.1109/ISM.2013.43 http://dx.doi.org/10.1109/ISM.2013.43 Jinlin Guo; Zhengwei Qiu; Gurrin, C., "Exploring the optimal visual vocabulary sizes for semantic concept detection," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.109,114, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576565 Amirhossein Habibian, Koen E. A. van de Sande, Cees G. M. Snoek. Recommendations for Video Event Recognition Using Concept Vocabularies. ACM International Conference on Multimedia Retrieval, 2013. Amirhossein Habibian, Cees G. M. Snoek. Video2Sentence and Vice Versa. ACM International Conference on Multimedia, 2013 Hamadi, A.; Mulhem, P.; Quenot, G., "Conceptual feedback for semantic multimedia indexing," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.53,58, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576552 Hamadi, A.; Quenot, G.; Mulhem, P., "Clustering based rescoring for semantic indexing of multimedia documents," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.41,46, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576550 Junwei Han; Xiang Ji; Xintao Hu; Dajiang Zhu; Kaiming Li; Xi Jiang; Guangbin Cui; Lei Guo; Tianming Liu, "Representing and Retrieving Video Shots in Human-Centric Brain Imaging Space," Image Processing, IEEE Transactions on , vol.22, no.7, pp.2723,2736, July 2013 doi: 10.1109/TIP.2013.2256919 Xintao Hu; Tuo Zhang; Junwei Han; Lei Guo; Tianming Liu, "Functional brain interactions during free viewing of video stream," Biomedical Imaging (ISBI), 2013 IEEE 10th International Symposium on , vol., no., pp.1082,1085, 7-11 April 2013 doi: 10.1109/ISBI.2013.6556666 Chang Huang; Yuan Li; Nevatia, R., "Multiple Target Tracking by Learning-Based Hierarchical Association of Detection Responses," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.35, no.4, pp.898,910, April 2013 doi: 10.1109/TPAMI.2012.159 Nakamasa Inoue and Koichi Shinoda, "q-Gaussian Mixture Models for Image And Video Semantic Indexing," Elsevier Journal of Visual Communication and Image Representation, vol.24, no.8, pp.1450-1457, 2013. Shuiwang Ji; Wei Xu; Ming Yang; Kai Yu, "3D Convolutional Neural Networks for Human Action Recognition," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.35, no.1, pp.221,231, Jan. 2013 doi: 10.1109/TPAMI.2012.59 Su Jiang, Yao Zhao, Shikui Wei, Rongrong Ni, and Zhenfeng Zhu. 2013. Frame filtering and path verification for improving video copy detection. In Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service (ICIMCS '13). ACM, New York, NY, USA, 34-37. DOI=10.1145/2499788.2499829 http://doi.acm.org/10.1145/2499788.2499829 Ilseo Kim, Sangmin Oh, Arash Vahdat, Kevin Cannons, A.G. Amitha Perera, and Greg Mori. 2013. Segmental multi-way local pooling for video recognition. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 637-640. DOI=10.1145/2502081.2502167 http://doi.acm.org/10.1145/2502081.2502167 Yusuke Kamishima, Nakamasa Inoue, and Koichi Shinoda, "Event detection in consumer videos using GMM supervectors and SVMs," EURASIP Journal on Image and Video Processing, vol.2013, no.1 , pp.1-13, 2013. Svetlana Kordumova, Xirong Li, Cees G. M. Snoek. Evaluating Sources and Strategies for Learning Video Concepts from Social Media. International Workshop on Content-Based Multimedia Indexing, 2013. Guorong Li; Qingming Huang; Lei Qin; Shuqiang Jiang, "SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently," Circuits and Systems for Video Technology, IEEE Transactions on , vol.23, no.4, pp.695,709, April 2013 doi: 10.1109/TCSVT.2012.2221257 Jingen Liu; Qian Yu; Javed, O.; Ali, S.; Tamrakar, A.; Divakaran, A.; Hui Cheng; Sawhney, H., "Video event recognition using concept attributes," Applications of Computer Vision (WACV), 2013 IEEE Workshop on , vol., no., pp.339,346, 15-17 Jan. 2013 doi: 10.1109/WACV.2013.6475038 Bo Lu, Guoren Wang, Ye Yuan, and Dong Han. 2013. Semantic concept detection for video based on extreme learning machine. Neurocomput. 102 (February 2013), 176-183. DOI=10.1016/j.neucom.2012.02.043 http://dx.doi.org/10.1016/j.neucom.2012.02.043 Masoud Mazloom, Amirhossein Habibian, and Cees G.M. Snoek. 2013. Querying for video events by semantic signatures from few examples. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 609-612. DOI=10.1145/2502081.2502160 http://doi.acm.org/10.1145/2502081.2502160 Kevin McGuinness, Noel E. O'Connor, Robin Aly, Franciska De Jong, Ken Chatfield, Omkar M. Parkhi, Relja Arandjelovic, Andrew Zisserman, Matthijs Douze, and Cordelia Schmid. 2013. The AXES PRO video search system. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA, 307-308. DOI=10.1145/2461466.2461519 http://doi.acm.org/10.1145/2461466.2461519 Sara Memar, Lilly Suriani Affendey, Norwati Mustapha, Shyamala C. Doraisamy, and Mohammadreza Ektefa. 2013. An integrated semantic-based approach in concept based video retrieval. Multimedia Tools Appl. 64, 1 (May 2013), 77-95. DOI=10.1007/s11042-011-0848-4 http://dx.doi.org/10.1007/s11042-011-0848-4 Li, M.; Monga, V., "Compact Video Fingerprinting Via Structural Graphical Models," Information Forensics and Security, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TIFS.2013.2278100 Xirong Li, Cees G. M. Snoek, Marcel Worring, Dennis C. Koelma, Arnold W. M. Smeulders. Bootstrapping Visual Categorization with Relevant Negatives. IEEE Transactions on Multimedia, Volume 15 (4), page 933-945, 2013. Zhenyang Li, Efstratios Gavves, Koen E. A. van de Sande, Cees G. M. Snoek, Arnold W. M. Smeulders. Codemaps Segment, Classify and Search Objects Locally. IEEE International Conference on Computer Vision, 2013. Suzanne Little, Iveel Jargalsaikhan, Kathy Clawson, Marcos Nieto, Hao Li, Cem Direkoglu, Noel E. O'Connor, Alan F. Smeaton, Jun Liu, Bryan Scotney, Hui Wang, Seán Gaines, Aitor Rodriguez, Pedro Sanchez, Ana MartÃnez Llorens, Karina Villarroel Peniza, Roberto Gimenez, Raúl Santos de La Cámara, Anna Mereu, Celso Prados, and Emmanouil Kafetzakis. 2013. Interactive surveillance event detection at TRECVid2012. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA, 301-302. DOI=10.1145/2461466.2461516 http://doi.acm.org/10.1145/2461466.2461516 Suzanne Little, Iveel Jargalsaikhan, Kathy Clawson, Marcos Nieto, Hao Li, Cem Direkoglu, Noel E. O'Connor, Alan F. Smeaton, Bryan Scotney, Hui Wang, and Jun Liu. 2013. An information retrieval approach to identifying infrequent events in surveillance video. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA, 223-230. DOI=10.1145/2461466.2461503 http://doi.acm.org/10.1145/2461466.2461503 Hong Liu; Hong Lu; Xiangyang Xue, "A Segmentation and Graph-Based Video Sequence Matching Method for Video Copy Detection," Knowledge and Data Engineering, IEEE Transactions on , vol.25, no.8, pp.1706,1718, Aug. 2013 doi: 10.1109/TKDE.2012.92 Jingen Liu; Qian Yu; Javed, O.; Ali, S.; Tamrakar, A.; Divakaran, A.; Hui Cheng; Sawhney, H., "Video event recognition using concept attributes," Applications of Computer Vision (WACV), 2013 IEEE Workshop on , vol., no., pp.339,346, 15-17 Jan. 2013 doi: 10.1109/WACV.2013.6475038 Wu Liu, Feibin Yang, Yongdong Zhang, Qinghua Huang, and Tao Mei. 2013. LAVES: an instant mobile video search system based on layered audio-video indexing. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 409-410. DOI=10.1145/2502081.2502244 http://doi.acm.org/10.1145/2502081.2502244 Liu, X.; Lin, L.; Jin, H., "Contextualized Trajectory Parsing with Spatio-Temporal Graph," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TPAMI.2013.84 Lu, Z.; Shi, Y., "Fast Video Shot Boundary Detection Based on SVD and Pattern Matching," Image Processing, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TIP.2013.2282081 Ma, Z.; Yang, Y.; Sebe, N.; Zheng, K.; Hauptmann, A.G., "Multimedia Event Detection Using A Classifier-Specific Intermediate Representation," Multimedia, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TMM.2013.2264928 Zhigang Ma, Yi Yang, Zhongwen Xu, Nicu Sebe, and Alexander G. Hauptmann. 2013. We are not equally negative: fine-grained labeling for multimedia event detection. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 293-302. DOI=10.1145/2502081.2502119 http://doi.acm.org/10.1145/2502081.2502119 Masoud Mazloom, Efstratios Gavves, Koen E. A. van de Sande, Cees G. M. Snoek. Searching Informative Concept Banks for Video Event Detection. ACM International Conference on Multimedia Retrieval, 2013. Masoud Mazloom, Amirhossein Habibian, Cees G. M. Snoek. Querying for Video Events by Semantic Signatures from Few Examples. ACM International Conference on Multimedia, 2013. Tao Mei, Lin-Xie Tang, Jinhui Tang, and Xian-Sheng Hua. 2013. Near-lossless semantic video summarization and its applications to video analysis. ACM Trans. Multimedia Comput. Commun. Appl. 9, 3, Article 16 (July 2013), 23 pages. DOI=10.1145/2487268.2487269 http://doi.acm.org/10.1145/2487268.2487269 Gregory K. Meyers, Ramesh Nallapati, Julien van Hout, Stephanie Pancoast, Ram Nevatia, Chen Sun, Amirhossein Habibian, Dennis C. Koelma, Koen E. A. van de Sande, Arnold W. M. Smeulders, Cees G. M. Snoek. Evaluating Multimedia Features and Fusion for Example-Based Event Detection. Machine Vision and Applications, In press, 2013. Davide Modolo, Cees G. M. Snoek. Can Object Detectors Aid Internet Video Event Retrieval? IS&T/SPIE Symposium on Electronic Imaging, 2013. Niaz, U.; Merialdo, B., "Leveraging from group classification for video concept detection," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.173,178, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576577 Oneata, Dan and Verbeek, Jakob and Schmid, Cordelia, "Action and Event Recognition with Fisher Vectors on a Compact Feature Set", in IEEE International Conference on Computer Vision (ICCV), Dec 2013, Sydney, Australia, http://hal.inria.fr/hal-00873662 Fabio Poiesi, Riccardo Mazzon, and Andrea Cavallaro. 2013. Multi-target tracking on confidence maps: An application to people tracking. Comput. Vis. Image Underst. 117, 10 (October 2013), 1257-1272. DOI=10.1016/j.cviu.2012.08.008 http://dx.doi.org/10.1016/j.cviu.2012.08.008 Priyadharssini, B.A.; Sivagami, S.V.; Muneeswaran, K., "Maximum a posteriori adaptation method for video semantic indexing," Emerging Trends in Computing, Communication and Nanotechnology (ICE-CCN), 2013 International Conference on , vol., no., pp.58,61, 25-26 March 2013 doi: 10.1109/ICE-CCN.2013.6528613 Vignesh Ramanathan, Percy Liang, and Li Fei-Fei. 2013. Video Event Understanding Using Natural Language Descriptions. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC, USA, 905-912. DOI=10.1109/ICCV.2013.117 http://dx.doi.org/10.1109/ICCV.2013.117 Vignesh Ramanathan, Bangpeng Yao, and Li Fei-Fei. 2013. Social Role Discovery in Human Events. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '13). IEEE Computer Society, Washington, DC, USA, 2475-2482. DOI=10.1109/CVPR.2013.320 http://dx.doi.org/10.1109/CVPR.2013.320 Miriam Redi and Bernard Merialdo. 2013. Direct modeling of image keypoints distribution through copula-based image signatures. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA, 183-190. DOI=10.1145/2461466.2461498 http://doi.acm.org/10.1145/2461466.2461498 Ren, Y.J.; O'Gorman, L.; Wu, L.J.; Chang, F.; Wood, T.L.; Zhang, J.R., "Authenticating Lossy Surveillance Video," Information Forensics and Security, IEEE Transactions on , vol.8, no.10, pp.1678,1687, Oct. 2013 doi: 10.1109/TIFS.2013.2279542 Safadi, B.; Quenot, G., "Descriptor optimization for multimedia indexing and retrieval," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.65,71, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576554 Shen, Y.; Miao, Z., "Multi-Human Tracking Based on Spatial-Temporal Appearance Match," Circuits and Systems for Video Technology, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TCSVT.2013.2280073 Shinoda, K.; Inoue, N., "Reusing Speech Techniques for Video Semantic Indexing [Applications Corner]," Signal Processing Magazine, IEEE , vol.30, no.2, pp.118,122, March 2013 doi: 10.1109/MSP.2012.2230520 Mats Sjoberg, Markus Koskela, Satoru Ishikawa, and Jorma Laaksonen. Large-scale Visual Concept Detection with Explicit Kernel Maps and Power Mean SVM. In Proceedings of ACM International Conference on Multimedia Retrieval (ICMR2013), pages 239--246, Dallas, Texas, USA, April 2013. ACM. Slimi, J.; Ben Ammar, A.; Alimi, A.M., "Interactive video data visualization system based on semantic organization," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.161,166, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576575 Strat, S.T.; Benoit, A.; Lambert, P., "Bags of Trajectory Words for video indexing," Content-Based Multimedia Indexing (CBMI), 2014 12th International Workshop on , vol., no., pp.1,6, 18-20 June 2014 doi: 10.1109/CBMI.2014.6849820 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849820&isnumber=6849811 Strat, S.T.; Benoit, A.; Lambert, P., "Retina enhanced SIFT descriptors for video indexing," Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on , vol., no., pp.201,206, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576582 Chen Sun and Ram Nevatia. 2013. ACTIVE: Activity Concept Transitions in Video Event Classification. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC, USA, 913-920. DOI=10.1109/ICCV.2013.453 http://dx.doi.org/10.1109/ICCV.2013.453 Chen Sun; Nevatia, R., "Large-scale web video event classification by use of Fisher Vectors," Applications of Computer Vision (WACV), 2013 IEEE Workshop on , vol., no., pp.15,22, 15-17 Jan. 2013 doi: 10.1109/WACV.2013.6474994 Kevin Tang, Bangpeng Yao, Li Fei-Fei, and Daphne Koller. 2013. Combining the Right Features for Complex Event Recognition. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC, USA, 2696-2703. DOI=10.1109/ICCV.2013.335 http://dx.doi.org/10.1109/ICCV.2013.335 Yonghong Tian; Tiejun Huang; Menglin Jiang; Wen Gao, "Video Copy-Detection and Localization with a Scalable Cascading Framework," MultiMedia, IEEE , vol.20, no.3, pp.72,86, July-Sept. Christos Tzelepis, Nikolaos Gkalelis, Vasileios Mezaris, and Ioannis Kompatsiaris. 2013. Improving event detection using related videos and relevance degree support vector machines. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 673-676. DOI=10.1145/2502081.2502176 http://doi.acm.org/10.1145/2502081.2502176 Tian, Y.; Wang, Y.; Hu, Z.; Huang, T., "Selective Eigenbackground for Background Modeling and Subtraction in Crowded Scenes," Circuits and Systems for Video Technology, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TCSVT.2013.2248239 Arash Vahdat, Kevin Cannons, Greg Mori, Sangmin Oh, and Ilseo Kim. 2013. Compositional Models for Video Event Detection: A Multiple Kernel Learning Latent Variable Approach. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC, USA, 1185-1192. DOI=10.1109/ICCV.2013.463 http://dx.doi.org/10.1109/ICCV.2013.463 Carles Ventura. 2013. Visual object analysis using regions and interest points. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 1075-1078. DOI=10.1145/2502081.2502220 http://doi.acm.org/10.1145/2502081.2502220 Zhiyong Wang, Genliang Guan, Yu Qiu, Li Zhuo, and Dagan Feng. 2013. Semantic context based refinement for news video annotation. Multimedia Tools Appl. 67, 3 (December 2013), 607-627. DOI=10.1007/s11042-012-1060-x http://dx.doi.org/10.1007/s11042-012-1060-x Xiao-Yong Wei; Zhen-Qun Yang, "Coaching the Exploration and Exploitation in Active Learning for Interactive Video Retrieval," Image Processing, IEEE Transactions on , vol.22, no.3, pp.955,968, March 2013 doi: 10.1109/TIP.2012.2222902 Song Wu and Michael Lew. 2013. Evaluation of salient point methods. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 685-688. DOI=10.1145/2502081.2502179 http://doi.acm.org/10.1145/2502081.2502179 Zhongwen Xu, Yi Yang, Ivor Tsang, Nicu Sebe, and Alexander G. Hauptmann. 2013. Feature Weighting via Optimal Thresholding for Video Analysis. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC, USA, 3440-3447. DOI=10.1109/ICCV.2013.427 http://dx.doi.org/10.1109/ICCV.2013.427 Yi Yang, Zhigang Ma, Zhongwen Xu, Shuicheng Yan, and Alexander G. Hauptmann. 2013. How Related Exemplars Help Complex Event Detection in Web Videos?. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC, USA, 2104-2111. DOI=10.1109/ICCV.2013.456 http://dx.doi.org/10.1109/ICCV.2013.456 Yi Yang; Jingkuan Song; Zi Huang; Zhigang Ma; Sebe, N.; Hauptmann, A.G., "Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis," Multimedia, IEEE Transactions on , vol.15, no.3, pp.572,581, April 2013 doi: 10.1109/TMM.2012.2234731 Yang Yang, Yi Yang, and Heng Tao Shen. 2013. Effective transfer tagging from image to video. ACM Trans. Multimedia Comput. Commun. Appl. 9, 2, Article 14 (May 2013), 20 pages. DOI=http://dx.doi.org/10.1145/2457450.2457456 http://doi.acm.org/http://dx.doi.org/10.1145/2457450.2457456 Ting Yao; Chong-Wah Ngo; Tao Mei, "Circular Reranking for Visual Search," Image Processing, IEEE Transactions on , vol.22, no.4, pp.1644,1655, April 2013 doi: 10.1109/TIP.2012.2236341 Yi, J.; Peng, Y.; Xiao, J., "Exploiting Semantic and Visual Context for Effective Video Annotation," Multimedia, IEEE Transactions on , vol.15, no.6, pp.1400,1414, Oct. 2013 doi: 10.1109/TMM.2013.2250266 Qian Yu, Jingen Liu, Hui Cheng, Ajay Divakaran, and Harpreet Sawhney. 2013. Semantic pooling for complex event detection. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 733-736. DOI=10.1145/2502081.2502191 http://doi.acm.org/10.1145/2502081.2502191 Wei Zhang and Chong-Wah Ngo. 2013. Searching visual instances with topology checking and context modeling. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA, 57-64. DOI=10.1145/2461466.2461477 http://doi.acm.org/10.1145/2461466.2461477 Zhang, X.; Huang, T.; Tian, Y.; Geng, M.; Ma, S.; Gao, W., "Fast and Efficient Transcoding Based on Low-complexity Background Modeling and Adaptive Block Classification," Multimedia, IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TMM.2013.2280117 Wan-Lei Zhao; Chong-Wah Ngo, "Flip-Invariant SIFT for Copy and Object Detection," Image Processing, IEEE Transactions on , vol.22, no.3, pp.980,991, March 2013 doi: 10.1109/TIP.2012.2226043 Zheng-Jun Zha, Tao Mei, Richang Hong, and Zhiwei Gu. 2013. Marginalized multi-layer multi-instance kernel for video concept detection. Signal Process. 93, 8 (August 2013), 2119-2125. DOI=10.1016/j.sigpro.2012.08.026 http://dx.doi.org/10.1016/j.sigpro.2012.08.026 Xiangmin Zhou; Lei Chen, "ASVTDECTOR: A practical near duplicate video retrieval system," Data Engineering (ICDE), 2013 IEEE 29th International Conference on , vol., no., pp.1348,1351, 8-12 April 2013 doi: 10.1109/ICDE.2013.6544941 Cai-Zhi Zhu, Xiao Zhou, and Shin'Ichi Satoh. 2013. Bag-of-Words Against Nearest-Neighbor Search for Visual Object Retrieval. In Proceedings of the 2013 2nd IAPR Asian Conference on Pattern Recognition (ACPR '13). IEEE Computer Society, Washington, DC, USA, 626-630. DOI=10.1109/ACPR.2013.56 http://dx.doi.org/10.1109/ACPR.2013.56 Xiaofeng Zhu; Zi Huang; Jiangtao Cui; Heng Tao Shen, "Video-to-Shot Tag Propagation by Graph Sparse Group Lasso," Multimedia, IEEE Transactions on , vol.15, no.3, pp.633,646, April 2013 doi: Xiaodan Zhuang, Shuang Wu, and Pradeep Natarajan. 2013. Compact bag-of-words visual representation for effective linear classification. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 521-524. DOI=10.1145/2502081.2502138 http://doi.acm.org/10.1145/2502081.2502138 --------------------------------------------------------------------- 2012 (81) --------------------------------------------------------------------- Aly, Robin and Doherty, Aiden and Hiemstra, Djoerd and de Jong, Franciska and Smeaton, Alan. The uncertain representation ranking framework for concept-based video retrieval. Information Retrieval 2012, pp.1-27, doi = {10.1007/s10791-012-9207-y}, Springer Netherlands Aly, Robin; Hiemstra, Djoerd; de Jong, Franciska; Apers, Peter. Simulating the future of concept-based video retrieval under improved detector performance. Multimedia Tools and Applications, 2012-09-01, Springer Netherlands, pp. 203-231. Vol. 60, Issue. 1, dx.doi.org/10.1007/s11042-011-0818-x, Doi: 10.1007/s11042-011-0818-x Anguera, Xavier; Garzon, Antonio; Adamek, Tomasz; , "MASK: Robust Local Features for Audio Fingerprinting," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.455-460, 9-13 July 2012 doi: 10.1109/ICME.2012.137 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298443&isnumber=6298237 Werner Bailer. 2012. Sequence kernels for clustering and visualizing near duplicate video segments. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 383-394. DOI=10.1007/978-3-642-27355-1_36 http://dx.doi.org/10.1007/978-3-642-27355-1_36 Werner Bailer, "Learning Multiple Sequence-based Kernels for Video Concept Detection," in IEEE International Symposium on Multimedia, Irvine, CA, USA, Dec. 2012, pp. 73-77. Mohammed Belkhatir and Bashar Tahayna. 2012. Near-duplicate video detection featuring coupled temporal and perceptual visual structures and logical inference based matching. Inf. Process. Manage. 48, 3 (May 2012), 489-501. DOI=10.1016/j.ipm.2011.03.003 http://dx.doi.org/10.1016/j.ipm.2011.03.003 Bredin, Herve; , "Community-driven hierarchical fusion of numerous classifiers: Application to video semantic indexing," Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , vol., no., pp.2329-2332, 25-30 March 2012 doi: 10.1109/ICASSP.2012.6288381 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6288381&isnumber=6287775 Andrei Bursuc, Titus Zaharia, and Francoise Prêteux. 2012. Retrieval of multiple instances of objects in videos. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 358-369. DOI=10.1007/978-3-642-27355-1_34 http://dx.doi.org/10.1007/978-3-642-27355-1_34 Shu Chen; McGuinness, K.; Aly, R.; O'Connor, N.E.; de Jong, F.; , "The AXES-lite video search engine," Image Analysis for Multimedia Interactive Services (WIAMIS), 2012 13th International Workshop on , vol., no., pp.1-4, 23-25 May 2012 doi: 10.1109/WIAMIS.2012.6226778 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6226778&isnumber=6226742 Choudhury, A.; Medioni, G.; , "A Framework for Robust Online Video Contrast Enhancement Using Modularity Optimization," Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.9, pp.1266-1279, Sept. 2012 doi: 10.1109/TCSVT.2012.2198136 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6196206&isnumber=6291822 Codella, Noel C.F.; Natsev, Apostol; Hua, Gang; Hill, Matthew; Cao, Liangliang; Gong, Leiguang; Smith, John R.; , "Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace Boosting," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.747-752, 9-13 July 2012 doi: 10.1109/ICME.2012.190 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298492&isnumber=6298237 Tiago O. Cunha, Flávio G. H. de Souza, Arnaldo de A. Araújo, and Gisele L. Pappa. 2012. Rushes video summarization based on spatio-temporal features. In Proceedings of the 27th Annual ACM Symposium on Applied Computing (SAC '12). ACM, New York, NY, USA, 45-50. DOI=10.1145/2245276.2245287 http://doi.acm.org/10.1145/2245276.2245287 Lixin Duan; Tsang, I.W.; Dong Xu; , "Domain Transfer Multiple Kernel Learning," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.34, no.3, pp.465-479, March 2012 doi: 10.1109/TPAMI.2011.114 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6136518&isnumber=6136512 Dumont, Emilie; Quenot, Georges; , "A Local Temporal Context-Based Approach for TV News Story Segmentation," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.973-978, 9-13 July 2012 doi: 10.1109/ICME.2012.3 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298529&isnumber=6298237 Ralph Ewerth, Markus Muehling, and Bernd Freisleben. 2012. Robust Video Content Analysis via Transductive Learning. ACM Trans. Intell. Syst. Technol. 3, 3, Article 41 (May 2012), 26 pages. DOI=10.1145/2168752.2168755 http://doi.acm.org/10.1145/2168752.2168755 Ã�Lvaro GarcÃA-MartÃN and José M. MartÃNez. 2012. On collaborative people detection and tracking in complex scenarios. Image Vision Comput. 30, 4-5 (May 2012), 345-354. DOI=10.1016/j.imavis.2012.03.005 http://dx.doi.org/10.1016/j.imavis.2012.03.005 Ã�lvaro GarcÃa-MartÃn, José M. MartÃnez, and Jesús Bescós. 2012. A corpus for benchmarking of people detection algorithms. Pattern Recogn. Lett. 33, 2 (January 2012), 152-156. DOI=10.1016/j.patrec.2011.09.038 http://dx.doi.org/10.1016/j.patrec.2011.09.038 Efstratios Gavves, Cees G. M. Snoek, and Arnold W. M. Smeulders, "Convex Reduction of High-Dimensional Kernels for Visual Classification," in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Providence, Rhode Island, USA, 2012. Efstratios Gavves, Cees G. M. Snoek, and Arnold W. M. Smeulders, "Visual Synonyms for Landmark Image Retrieval," Computer Vision and Image Understanding, vol. 116, iss. 2, pp. 238-249, 2012. Bo Geng; Yangxi Li; Dacheng Tao; Meng Wang; Zheng-Jun Zha; Chao Xu; , "Parallel Lasso for Large-Scale Video Concept Detection," Multimedia, IEEE Transactions on , vol.14, no.1, pp.55-65, Feb. 2012 doi: 10.1109/TMM.2011.2174781 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6069863&isnumber=6130620 Jinlin Guo, Colum Foley, Cathal Gurrin, and Songyang Lao. 2011. Semantic concept detection in imbalanced datasets based on different under-sampling strategies. In Proceedings of the 2011 IEEE International Conference on Multimedia and Expo (ICME '11). IEEE Computer Society, Washington, DC, USA, 1-6. DOI=10.1109/ICME.2011.6011923 http://dx.doi.org/10.1109/ICME.2011.6011923 Vishwa Nath Gupta, Gilles Boulianne, and Patrick Cardinal. 2012. CRIM's content-based audio copy detection system for TRECVID 2009. Multimedia Tools Appl. 60, 2 (September 2012), 371-387. DOI=10.1007/s11042-010-0608-x http://dx.doi.org/10.1007/s11042-010-0608-x Hamadi, A.; Quenot, G.; Mulhem, P.; , "Two-layers re-ranking approach based on contextual information for visual concepts detection in videos," Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on , vol., no., pp.1-6, 27-29 June 2012 doi: 10.1109/CBMI.2012.6269837 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6269837&isnumber=6269791 R. Cameron Harvey and Mohamed Hefeeda. 2012. Spatio-temporal video copy detection. In Proceedings of the 3rd Multimedia Systems Conference (MMSys '12). ACM, New York, NY, USA, 35-46. DOI=10.1145/2155555.2155562 http://doi.acm.org/10.1145/2155555.2155562 Xintao Hu; Kaiming Li; Junwei Han; Xiansheng Hua; Lei Guo; Tianming Liu; , "Bridging the Semantic Gap via Functional Brain Imaging," Multimedia, IEEE Transactions on , vol.14, no.2, pp.314-325, April 2012 doi: 10.1109/TMM.2011.2172201 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6046230&isnumber=6170997 Huang, Po-Sen; Mertens, Robert; Divakaran, Ajay; Friedland, Gerald; Hasegawa-Johnson, Mark; , "How to put it into words - using random forests to extract symbol level descriptions from audio content for concept detection," Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , vol., no., pp.505-508, 25-30 March 2012 doi: 10.1109/ICASSP.2012.6287927 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6287927&isnumber=6287775 Bouke Huurnink, Cees G. M. Snoek, Maarten de Rijke, and Arnold W. M. Smeulders, "Content-Based Analysis Improves Audiovisual Archive Retrieval," IEEE Transactions on Multimedia, vol. 14, iss. 4, pp. 1166-1178, 2012. Inoue, N.; Shinoda, K.; , "A Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors," Multimedia, IEEE Transactions on , vol.14, no.4, pp.1196-1205, Aug. 2012 doi: 10.1109/TMM.2012.2191395 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6172243&isnumber=6239700 Nakamasa Inoue and Koichi Shinoda, "q-Gaussian Mixture Models Based on Non-Extensive Statistics for Image And Video Semantic Indexing," In Proc. ACCV, pp.499-510, 2012. Jegou, Herve; Delhumeau, Jonathan; Yuan, Jiangbo; Gravier, Guillaume; Gros, Patrick; , "BABAZ: A large scale audio search system for video copy detection," Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , vol., no., pp.2369-2372, 25-30 March 2012 doi: 10.1109/ICASSP.2012.6288391 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6288391&isnumber=6287775 Jiang, Menglin; Tian, Yonghong; Huang, Tiejun; , "Video Copy Detection Using a Soft Cascade of Multimodal Features," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.374-379, 9-13 July 2012 doi: 10.1109/ICME.2012.189 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298426&isnumber=6298237 Yusuke Kamishima, Nakamasa Inoue, Koichi Shinoda, and Shunsuke Sato, "Multimedia Event Detection Using GMM Supervectors and SVMs," In Proc. ICIP, pp.3089–3092, 2012. Zhen-zhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, and Alexander G. Hauptmann. 2012. Double fusion for multimedia event detection. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 173-185. DOI=10.1007/978-3-642-27355-1_18 http://dx.doi.org/10.1007/978-3-642-27355-1_18 Huan Li, Yuan Shi, Yang Liu, Alexander G. Hauptmann, and Zhang Xiong. 2012. Cross-domain video concept detection: A joint discriminative and generative active learning approach. Expert Syst. Appl. 39, 15 (November 2012), 12220-12228. DOI=10.1016/j.eswa.2012.04.054 http://dx.doi.org/10.1016/j.eswa.2012.04.054 Xirong Li, Cees G. M. Snoek, Marcel Worring, and Arnold W. M. Smeulders, "Fusing Concept Detection and Geo Context for Visual Search," in Proceedings of the ACM International Conference on Multimedia Retrieval, Hong Kong, China, 2012. Xirong Li, Cees G. M. Snoek, Marcel Worring, and Arnold W. M. Smeulders, "Harvesting Social Images for Bi-Concept Search," IEEE Transactions on Multimedia, vol. 14, iss. 4, pp. 1091-1104, 2012. Li, Zhi and Liu, Guizhong; "Video scene analysis in 3D wavelet transform domain", Multimedia Tools and Applications, Springer Netherlands,dx.doi.org/10.1007/s11042-010-0594-z, 2012. Jingen Liu. 2012. Evaluation of low-level features and their combinations for complex event detection in open source videos. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (CVPR '12). IEEE Computer Society, Washington, DC, USA, 3681-3688. Gjorgji Madjarov, Dragi Kocev, Dejan Gjorgjevikj, and SaÅ¡O Deroski. 2012. An extensive experimental comparison of methods for multi-label learning. Pattern Recogn. 45, 9 (September 2012), 3084-3104. DOI=10.1016/j.patcog.2012.03.004 http://dx.doi.org/10.1016/j.patcog.2012.03.004 Mansencal, B.; Benois-Pineau, J.; Vieux, R.; Domenger, J.-P.; , "Search of objects of interest in videos," Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on , vol., no., pp.1-6, 27-29 June 2012 doi: 10.1109/CBMI.2012.6269809 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6269809&isnumber=6269791 Meng, Tao; Shyu, Mei-Ling; , "Leveraging Concept Association Network for Multimedia Rare Concept Mining and Retrieval," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.860-865, 9-13 July 2012 doi: 10.1109/ICME.2012.134 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298511&isnumber=6298237 Merler, M.; Huang, B.; Lexing Xie; Gang Hua; Natsev, A.; , "Semantic Model Vectors for Complex Video Event Recognition," Multimedia, IEEE Transactions on , vol.14, no.1, pp.88-101, Feb. 2012 doi: 10.1109/TMM.2011.2168948 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6024471&isnumber=6130620 Hyun-seok Min; Jae Young Choi; De Neve, W.; Yong Man Ro; , "Near-Duplicate Video Clip Detection Using Model-Free Semantic Concept Detection and Adaptive Semantic Distance Measurement," Circuits and Systems for Video Technology, IEEE Transactions on , vol.22, no.8, pp.1174-1187, Aug. 2012 doi: 10.1109/TCSVT.2012.2197080 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6193167&isnumber=6255841 Muehling, Markus and Ewerth, Ralph and Zhou, Jun and Freisleben, Bernd; Multimodal Video Concept Detection via Bag of Auditory Words and Multiple Kernel Learning; in Advances in Multimedia Modeling, Lecture Notes in Computer Science}, Eds.: Schoeffmann, Klaus and Merialdo, Bernard and Hauptmann, Alexander and Ngo, Chong-Wah and Andreopoulos, Yiannis and Breiteneder, Christian; Springer Berlin / Heidelberg, isbn 978-3-642-27354-4, pp 40-50, vol. 7131, http://dx.doi.org/10.1007/978-3-642-27355-1_7,2012. Natarajan, P.; Shuang Wu; Vitaladevuni, S.; Xiaodan Zhuang; Tsakalidis, S.; Unsang Park; Prasad, R.; Natarajan, P.; , "Multimodal feature fusion for robust event detection in web videos," Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , vol., no., pp.1298-1305, 16-21 June 2012 doi: 10.1109/CVPR.2012.6247814 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6247814&isnumber=6247647 Parkhi, O.M.; Vedaldi, A.; Zisserman, A.; , "On-the-fly specific person retrieval," Image Analysis for Multimedia Interactive Services (WIAMIS), 2012 13th International Workshop on , vol., no., pp.1-4, 23-25 May 2012 doi: 10.1109/WIAMIS.2012.6226775 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6226775&isnumber=6226742 Jesse Read, Albert Bifet, Geoff Holmes, and Bernhard Pfahringer. 2012. Scalable and efficient multi-label classification for evolving data streams. Mach. Learn. 88, 1-2 (July 2012), 243-272. DOI=10.1007/s10994-012-5279-6 http://dx.doi.org/10.1007/s10994-012-5279-6 Redi, Miriam; Merialdo, Bernard; , "Fitting Gaussian copulae for efficient visual codebooks generation," Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on , vol., no., pp.1-6, 27-29 June 2012 doi: 10.1109/CBMI.2012.6269794 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6269794&isnumber=6269791 Redi, Miriam; Merialdo, Bernard;, "A Multimedia Retrieval Framework Based on Automatic Graded Relevance Judgments" in Advances in Multimedia Modeling, Lecture Notes in Computer Science, 2012, Springer Berlin / Heidelberg, pp. 300-311,dx.doi.org/10.1007/978-3-642-27355-1_29 Doi: 10.1007/978-3-642-27355-1_29 Miriam Redi and Bernard Merialdo. 2012. Exploring two spaces with one feature: kernelized multidimensional modeling of visual alphabets. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (ICMR '12). ACM, New York, NY, USA, , Article 20 , 8 pages. DOI=10.1145/2324796.2324821 http://doi.acm.org/10.1145/2324796.2324821 Jennifer Ren, Fangzhe Chang, Thomas Wood, and John R. Zhang. 2012. Efficient video copy detection via aligning video signature time series. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (ICMR '12). ACM, New York, NY, USA, , Article 14 , 8 pages. DOI=10.1145/2324796.2324814 http://doi.acm.org/10.1145/2324796.2324814 Bahjat Safadi, Stephane Ayache, and Georges Quenot. 2012. Active cleaning for video corpus annotation. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 518-528. DOI=10.1007/978-3-642-27355-1_48 http://dx.doi.org/10.1007/978-3-642-27355-1_48 Kimiaki Shirahama, Yuta Matsuoka, and Kuniaki Uehara. 2012. Event retrieval in video archives using rough set theory and partially supervised learning. Multimedia Tools Appl. 57, 1 (March 2012), 145-173. DOI=10.1007/s11042-011-0727-z http://dx.doi.org/10.1007/s11042-011-0727-z Mats Sjöberg, Markus Koskela, Satoru Ishikawa, and Jorma Laaksonen. Real-time Large-scale Visual Concept Detection with Linear Classifiers. In Proceedings of 21st International Conference on Pattern Recognition, Tsukuba, Japan, November 2012. Tamrakar, A.; Ali, S.; Qian Yu; Jingen Liu; Javed, O.; Divakaran, A.; Hui Cheng; Sawhney, H.; , "Evaluation of low-level features and their combinations for complex event detection in open source videos," Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , vol., no., pp.3681-3688, 16-21 June 2012 doi: 10.1109/CVPR.2012.6248114 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6248114&isnumber=6247647 Claudiu Tănase and Bernard Merialdo. 2012. Efficient spatio-temporal edge descriptor. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 210-221. DOI=10.1007/978-3-642-27355-1_21 http://dx.doi.org/10.1007/978-3-642-27355-1_21 Tang, K.; Li Fei-Fei; Koller, D.; , "Learning latent temporal structure for complex event detection," Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , vol., no., pp.1250-1257, 16-21 June 2012 doi: 10.1109/CVPR.2012.6247808 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6247808&isnumber=6247647 Sheng Tang; Yan-Tao Zheng; Yu Wang; Tat-Seng Chua; , "Sparse Ensemble Learning for Concept Detection," Multimedia, IEEE Transactions on , vol.14, no.1, pp.43-54, Feb. 2012 doi: 10.1109/TMM.2011.2168198 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6020805&isnumber=6130620 Tuan Hue Thi, Li Cheng, Jian Zhang, Li Wang, and Shinichi Satoh. 2012. Editors Choice Article: Structured learning of local features for human action classification and localization. Image Vision Comput. 30, 1 (January 2012), 1-14. DOI=10.1016/j.imavis.2011.12.006 http://dx.doi.org/10.1016/j.imavis.2011.12.006 Xinmei Tian, Dacheng Tao, and Yong Rui. 2012. Sparse transfer learning for interactive video search reranking. ACM Trans. Multimedia Comput. Commun. Appl. 8, 3, Article 26 (August 2012), 19 pages. DOI=10.1145/2240136.2240139 http://doi.acm.org/10.1145/2240136.2240139 Mercan Topkara, Shimei Pan, Jennifer Lai, Ahmet Dirik, Steven Wood, and Jeff Boston. 2012. "You've got video": increasing clickthrough when sharing enterprise video with email. In Proceedings of the 2012 ACM annual conference on Human Factors in Computing Systems (CHI '12). ACM, New York, NY, USA, 565-568. DOI=10.1145/2207676.2207755 http://doi.acm.org/10.1145/2207676.2207755 Valdés, VÃctor; MartÃnez, José; "On-line video abstract generation of multimedia news" in Multimedia Tools and Applications, 2012-08-01, Springer Netherlands,pp. 795-832, vol. 59, No. 3 dx.doi.org/10.1007/s11042-011-0774-5 Doi: 10.1007/s11042-011-0774-5 Victor Valdes and Jose M. Martinez. 2012. Automatic evaluation of video summaries. ACM Trans. Multimedia Comput. Commun. Appl. 8, 3, Article 25 (August 2012), 21 pages. DOI=10.1145/2240136.2240138 http://doi.acm.org/10.1145/2240136.2240138 Robert Villa and Joemon M. Jose. 2012. A study of awareness in multimedia search. Inf. Process. Manage. 48, 1 (January 2012), 32-46. DOI=10.1016/j.ipm.2011.03.005 http://dx.doi.org/10.1016/j.ipm.2011.03.005 Feng Wang; Chong-Wah Ngo; , "Summarizing Rushes Videos by Motion, Object, and Event Understanding," Multimedia, IEEE Transactions on , vol.14, no.1, pp.76-87, Feb. 2012 doi: 10.1109/TMM.2011.2165531 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5993544&isnumber=6130620 Wang, Lezi; Dong, Yuan; Bai, Hongliang; Zhang, Jiwei; Huang, Chong; Liu, Wei; , "Contented-Based Large Scale Web Audio Copy Detection," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.961-966, 9-13 July 2012 doi: 10.1109/ICME.2012.17 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298527&isnumber=6298237 Weng, Ming-Fang; Chuang, Yung-Yu; , "Cross-Domain Multicue Fusion for Concept-Based Video Indexing," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.34, no.10, pp.1927-1941, Oct. 2012 doi: 10.1109/TPAMI.2011.273 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6112775&isnumber=6269017 Shaoxi Xu, Sheng Tang, Yongdong Zhang, Jintao Li, and Yan-Tao Zheng. 2012. Exploring multi-modality structure for cross domain adaptation in video concept annotation. Neurocomput. 95 (October 2012), 11-21. DOI=10.1016/j.neucom.2011.05.041 http://dx.doi.org/10.1016/j.neucom.2011.05.041 Yang, J.; Tong, W.; Hauptmann, A. G.; , "A Framework for Classifier Adaptation for Large-Scale Multimedia Data," Proceedings of the IEEE , vol.100, no.9, pp.2639-2657, Sept. 2012 doi: 10.1109/JPROC.2012.2204009 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6257407&isnumber=6269941 Turgay Yilmaz, Elvan Gulen, Adnan Yazici, and Masaru Kitsuregawa. 2012. A RELIEF-based modality weighting approach for multimodal information retrieval. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (ICMR '12). ACM, New York, NY, USA, , Article 54 , 8 pages. DOI=10.1145/2324796.2324858 http://doi.acm.org/10.1145/2324796.2324858 Ehsan Younessian and Deepu Rajan. 2012. Scene signatures for unconstrained news video stories. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 77-88. DOI=10.1007/978-3-642-27355-1_10 http://dx.doi.org/10.1007/978-3-642-27355-1_10 Jin Yuan, Huanbo Luan, Dejun Hou, Han Zhang, Yan-Tao Zheng, Zheng-Jun Zha, and Tat-Seng Chua. 2012. Video browser showdown by NUS. In Proceedings of the 18th international conference on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg, 642-645. DOI=10.1007/978-3-642-27355-1_64 http://dx.doi.org/10.1007/978-3-642-27355-1_64 Zhang, John R.; Ren, Jennifer Y.; Chang, Fangzhe; Wood, Thomas L.; Kender, John R.; , "Fast Near-Duplicate Video Retrieval via Motion Time Series Matching," Multimedia and Expo (ICME), 2012 IEEE International Conference on , vol., no., pp.842-847, 9-13 July 2012 doi: 10.1109/ICME.2012.111 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298508&isnumber=6298237 Zheng-Jun Zha; Meng Wang; Yan-Tao Zheng; Yi Yang; Richang Hong; Tat-Seng Chua; , "Interactive Video Indexing With Statistical Active Learning," Multimedia, IEEE Transactions on , vol.14, no.1, pp.17-27, Feb. 2012 doi: 10.1109/TMM.2011.2174782 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6069865&isnumber=6130620 Zheng-Jun Zha, Tao Mei, Yan-Tao Zheng, Zengfu Wang, and Xian-Sheng Hua. 2012. A comprehensive representation scheme for video semantic ontology and its applications in semantic concept detection. Neurocomput. 95 (October 2012), 29-39. DOI=10.1016/j.neucom.2011.05.044 http://dx.doi.org/10.1016/j.neucom.2011.05.044 Yongchao Zhang; Mingxing Xu; Pratt, E.; , "Energy classification-assisted fingerprint system for content-based audio copy detection," Communications (COMM), 2012 9th International Conference on , vol., no., pp.35-38, 21-23 June 2012 doi: 10.1109/ICComm.2012.6262598 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6262598&isnumber=6262524 Zhong, Cencen; Miao, Zhenjiang; , "Data-specific concept correlation estimation for video annotation refinement," Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , vol., no., pp.961-964, 25-30 March 2012 doi: 10.1109/ICASSP.2012.6288044 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6288044&isnumber=6287775 Cencen Zhong; Zhenjiang Miao; , "A Two-View Concept Correlation Based Video Annotation Refinement," Signal Processing Letters, IEEE , vol.19, no.5, pp.259-262, May 2012 doi: 10.1109/LSP.2012.2189386 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6159059&isnumber=6167335 Cai-Zhi Zhu and Shin'ichi Satoh. 2012. Large vocabulary quantization for searching instances from videos. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (ICMR '12). ACM, New York, NY, USA, , Article 52 , 8 pages. DOI=10.1145/2324796.2324856 http://doi.acm.org/10.1145/2324796.2324856 --------------------------------------------------------------------- 2011 (75) --------------------------------------------------------------------- Almeida, J.; Leite, N.J.; da S Torres, R.; , "Comparison of video sequences with histograms of motion patterns," Image Processing (ICIP), 2011 18th IEEE International Conference on , vol., no., pp.3673-3676, 11-14 Sept. 2011 doi: 10.1109/ICIP.2011.6116516 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116516&isnumber=6115588 Xavier Anguera, Juan Manuel Barrios, Tomasz Adamek, and Nuria Oliver. 2011. Multimodal fusion for video copy detection. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1221-1224. DOI=10.1145/2072298.2071979 http://doi.acm.org/10.1145/2072298.2071979 Baber, J.; Afzulpurkar, N.; Dailey, M.N.; Bakhtyar, M.; , "Shot boundary detection from videos using entropy and local descriptor," Digital Signal Processing (DSP), 2011 17th International Conference on , vol., no., pp.1-6, 6-8 July 2011 doi: 10.1109/ICDSP.2011.6004918 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6004918&isnumber=6004864 Chidansh Bhatt and Mohan Kankanhalli. 2011. Probabilistic temporal multimedia data mining. ACM Trans. Intell. Syst. Technol. 2, 2, Article 17 (February 2011), 19 pages. DOI=10.1145/1899412.1899421 http://doi.acm.org/10.1145/1899412.1899421 Werner Bailer. A Feature Sequence Kernel for Video Concept Classification. Proceedings of 17th Multimedia Modeling Conference, Taipei, TW, Jan. 2011, pp. 359-369. Werner Bailer. Sequence-based Kernels for Online Concept Detection in Video. AIEMPro '11: Proceedings of the 4th international workshop on Automated information extraction in media production, Scottsdale, AZ, USA, Dec. 2011. Bali, O.; Karray, H.; Ben Ammar, A.; Alimi, A.M.; , "Toward Interactive TV," Computational Intelligence and Intelligent Informatics (ISCIII), 2011 5th International Symposium on , vol., no., pp.31-36, 15-17 Sept. 2011 doi: 10.1109/ISCIII.2011.6069737 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6069737&isnumber=6069732 Barrios, J.M.; Bustos, B.; , "P-VCD: A pivot-based approach for Content-Based Video Copy Detection," Multimedia and Expo (ICME), 2011 IEEE International Conference on , vol., no., pp.1-6, 11-15 July 2011 doi: 10.1109/ICME.2011.6012212 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6012212&isnumber=6011826 Chaisorn, L.; Yan-Tao Zheng; Sim, K.; , "Known-item Search (KIS) in video: Survey, experience and trend," Information, Communications and Signal Processing (ICICS) 2011 8th International Conference on , vol., no., pp.1-4, 13-16 Dec. 2011 doi: 10.1109/ICICS.2011.6173547 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6173547&isnumber=6173124 Chao Chen; Lin Lin; Mei-Ling Shyu; , "Utilization of Co-occurrence Relationships between Semantic Concepts in Re-ranking for Information Retrieval," Multimedia (ISM), 2011 IEEE International Symposium on , vol., no., pp.53-60, 5-7 Dec. 2011 doi: 10.1109/ISM.2011.18 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6123325&isnumber=6123309 Tianlong Chen, Shuqiang Jiang, Lingyang Chu, and Qingming Huang. 2011. Detection and location of near-duplicate video sub-clips by finding dense subgraphs. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1173-1176. DOI=10.1145/2072298.2071967 http://doi.acm.org/10.1145/2072298.2071967 Xiangang Cheng and Liang-Tien Chia. 2011. Spatially-coherent pyramid matching based on max-pooling. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1445-1448. DOI=10.1145/2072298.2072036 http://doi.acm.org/10.1145/2072298.2072036 Xiangang Cheng; Liang-Tien Chia; , "Stratification-Based Keyframe Cliques for Effective and Efficient Video Representation," Multimedia, IEEE Transactions on , vol.13, no.6, pp.1333-1342, Dec. 2011 doi: 10.1109/TMM.2011.2167222 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6009224&isnumber=6069890 Xiangang Cheng, Yiqun Hu, and Liang-Tien Chia. 2011. Exploiting local dependencies with spatial-scale space (S-Cube) for near-duplicate retrieval. Comput. Vis. Image Underst. 115, 6 (June 2011), 750-758. DOI=10.1016/j.cviu.2011.02.003 http://dx.doi.org/10.1016/j.cviu.2011.02.003 Daniyal, F.; Cavallaro, A.; , "Abnormal motion detection in crowded scenes using local spatio-temporal analysis," Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on , vol., no., pp.1944-1947, 22-27 May 2011 doi: 10.1109/ICASSP.2011.5946889 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5946889&isnumber=5946226 Youdong Ding; Jianfei Zhang; Jun Li; Xiaocheng Wei; , "A Bag-of-Feature Model for Video Semantic Annotation," Image and Graphics (ICIG), 2011 Sixth International Conference on , vol., no., pp.696-701, 12-15 Aug. 2011 doi: 10.1109/ICIG.2011.135 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6005612&isnumber=6005527 F. Daniyal, A. Cavallaro. Abnormal motion detection in crowded scenes using local spatio-temporal analysis, In Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pages 3913-3916. Prague, Czech Republic, 22-27 May 2011. Pınar Duygulu and Muhammet BaÅŸtan. 2011. Multimedia translation for linking visual data to semantics in videos. Mach. Vision Appl. 22, 1 (January 2011), 99-115. DOI=10.1007/s00138-009-0217-8 http://dx.doi.org/10.1007/s00138-009-0217-8 Nizar Elleuch, Mohamed Zarka, Anis Ben Ammar, and Adel M. Alimi. 2011. A fuzzy ontology: based framework for reasoning in visual vidBahjat Safadi and Georges Quénot. 2011. Re-ranking for multimedia indexing and retrieval. In Proceedings of the 33rd European conference on Advances in information retrieval (ECIR'11), Paul Clough, Colum Foley, Cathal Gurrin, Hyowon Lee, and Gareth J. F. Jones (Eds.). Springer-Verlag, Berlin, Heidelberg, 708-711. eo content analysis and indexing. In Proceedings of the Eleventh International Workshop on Multimedia Data Mining (MDMKDD '11). ACM, New York, NY, USA, , Article 1 , 8 pages. DOI=10.1145/2237827.2237828 http://doi.acm.org/10.1145/2237827.2237828 Bailan Feng, Juan Cao, Xiuguo Bao, Lei Bao, Yongdong Zhang, Shouxun Lin, and Xiaochun Yun. 2011. Graph-based multi-space semantic correlation propagation for\&\#x00a0;video retrieval. Vis. Comput. 27, 1 (January 2011), 21-34. DOI=10.1007/s00371-010-0510-6 http://dx.doi.org/10.1007/s00371-010-0510-6 Huamin Feng; Chao Jiang; Xinghua Yang; , "An audio classification and speech recognition system for video content analysis," Multimedia Technology (ICMT), 2011 International Conference on , vol., no., pp.5272-5276, 26-28 July 2011 doi: 10.1109/ICMT.2011.6002093 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6002093&isnumber=6001647 Bauke Freiburg, Jaap Kamps, and Cees G. M. Snoek, "Crowdsourcing Visual Detectors for Video Search," in Proceedings of the ACM International Conference on Multimedia, Scottsdale, AZ, USA, 2011. Garcia-Martin, A.; Hauptmann, A.; Martinez, J.M.; , "People detection based on appearance and motion models," Advanced Video and Signal-Based Surveillance (AVSS), 2011 8th IEEE International Conference on , vol., no., pp.256-260, Aug. 30 2011-Sept. 2 2011 doi: 10.1109/AVSS.2011.6027333 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6027333&isnumber=6027273 Gkalelis, N.; Mezaris, V.; Kompatsiaris, I.; , "High-level event detection in video exploiting discriminant concepts," Content-Based Multimedia Indexing (CBMI), 2011 9th International Workshop on , vol., no., pp.85-90, 13-15 June 2011 doi: 10.1109/CBMI.2011.5972525 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5972525&isnumber=5972508 Jinlin Guo; Foley, Colum; Gurrin, Cathal; Songyang Lao; , "Semantic concept detection in imbalanced datasets based on different under-sampling strategies," Multimedia and Expo (ICME), 2011 IEEE International Conference on , vol., no., pp.1-6, 11-15 July 2011 doi: 10.1109/ICME.2011.6011923 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6011923&isnumber=6011826 Hiep Van Hoang, Duy-Dinh Le, Shin'ichi Satoh, and Quang Hong Nguyen. 2011. Improving retake detection by adding motion feature. In Proceedings of the 16th international conference on Image analysis and processing - Volume Part II (ICIAP'11), Giuseppe Maino and Gian Luca Foresti (Eds.), Vol. Part II. Springer-Verlag, Berlin, Heidelberg, 150-157. Marijn Huijbregts and Franciska de Jong. 2011. Robust speech/non-speech classification in heterogeneous multimedia content. Speech Commun. 53, 2 (February 2011), 143-153. DOI=10.1016/j.specom.2010.08.008 http://dx.doi.org/10.1016/j.specom.2010.08.008 Wolfgang Hürst, Cees G. M. Snoek, Willem-Jan Spoel, and Mate Tomin, "Size Matters! How Thumbnail Number, Size, and Motion Influence Mobile Video Retrieval," in International Conference on MultiMedia Modeling, Taipei, Taiwan, 2011. Nakamasa Inoue and Koichi Shinoda. 2011. A fast MAP adaptation technique for gmm-supervector-based video semantic indexing systems. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1357-1360. DOI=10.1145/2072298.2072014 http://doi.acm.org/10.1145/2072298.2072014 Xiang Ji; Junwei Han; Xintao Hu; Kaiming Li; Fan Deng; Jun Fang; Lei Guo; Tianming Liu; , "Retrieving video shots in semantic brain imaging space using manifold-ranking," Image Processing (ICIP), 2011 18th IEEE International Conference on , vol., no., pp.3633-3636, 11-14 Sept. 2011 doi: 10.1109/ICIP.2011.6116505 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116505&isnumber=6115588 Wei Jiang and Alexander Loui. 2011. Laplacian adaptive context-based SVM for video concept detection. In Proceedings of the 3rd ACM SIGMM international workshop on Social media (WSM '11). ACM, New York, NY, USA, 15-20. DOI=10.1145/2072609.2072615 http://doi.acm.org/10.1145/2072609.2072615 Ilseo Kim; Chin-Hui Lee; , "Optimization of average precision with Maximal Figure-of-Merit Learning," Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on , vol., no., pp.1-6, 18-21 Sept. 2011 doi: 10.1109/MLSP.2011.6064638 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6064638&isnumber=6064496 Ksibi, A.; Elleuch, N.; Ben Ammar, A.; Alimi, A.M.; , "Semi-automatic soft collaborative annotation for semantic video indexing," EUROCON - International Conference on Computer as a Tool (EUROCON), 2011 IEEE , vol., no., pp.1-6, 27-29 April 2011 doi: 10.1109/EUROCON.2011.5929417 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5929417&isnumber=5929030 Duy-Dinh Le; Satoh, S.; , "A Comprehensive Study of Feature Representations for Semantic Concept Detection," Semantic Computing (ICSC), 2011 Fifth IEEE International Conference on , vol., no., pp.235-238, 18-21 Sept. 2011 doi: 10.1109/ICSC.2011.92 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6061339&isnumber=6061289 Duy-Dinh Le; Satoh, S.; , "Indexing Faces in Broadcast News Video Archives," Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on , vol., no., pp.519-526, 11-11 Dec. 2011 doi: 10.1109/ICDMW.2011.101 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6137423&isnumber=6137352 Lin Lin; Chao Chen; Mei-Ling Shyu; Shu-Ching Chen; , "Weighted Subspace Filtering and Ranking Algorithms for Video Concept Retrieval," MultiMedia, IEEE , vol.18, no.3, pp.32-43, March 2011 doi: 10.1109/MMUL.2011.35 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5765919&isnumber=5986505 Nan Liu; Yao Zhao; Zhenfeng Zhu; Hanqing Lu; , "Exploiting Visual-Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation," Multimedia, IEEE Transactions on , vol.13, no.5, pp.961-973, Oct. 2011 doi: 10.1109/TMM.2011.2160334 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5928417&isnumber=6018340 Yuan Liu; Tao Mei; , "Optimizing Visual Search Reranking via Pairwise Learning," Multimedia, IEEE Transactions on , vol.13, no.2, pp.280-291, April 2011 doi: 10.1109/TMM.2010.2103931 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5680970&isnumber=5732768 Mezaris, V.; Sidiropoulos, P.; Kompatsiaris, I.; , "Improving Interactive Video Retrieval by Exploiting Automatically-Extracted Video Structural Semantics," Semantic Computing (ICSC), 2011 Fifth IEEE International Conference on , vol., no., pp.224-227, 18-21 Sept. 2011 doi: 10.1109/ICSC.2011.29 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6061337&isnumber=6061289 Min, Hyun-seok; Jae Young Choi; De Neve, Wesley; Ro, Yong Man; , "Leveraging an image folksonomy and the Signature Quadratic Form Distance for semantic-based detection of near-duplicate video clips," Multimedia and Expo (ICME), 2011 IEEE International Conference on , vol., no., pp.1-6, 11-15 July 2011 doi: 10.1109/ICME.2011.6011937 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6011937&isnumber=6011826 Hyun-seok Min, Jae Young Choi, Wesley De Neve, and Yong Man Ro. 2011. Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection. Image Commun. 26, 10 (November 2011), 612-627. DOI=10.1016/j.image.2011.04.001 http://dx.doi.org/10.1016/j.image.2011.04.001 Lin Pang, Juan Cao, Lei Bao, Yongdong Zhang, and Shouxun Lin. 2011. Towards hierarchical context: unfolding visual community potential for interactive video retrieval. Multimedia Tools Appl. 55, 1 (October 2011), 151-178. DOI=10.1007/s11042-010-0605-0 http://dx.doi.org/10.1007/s11042-010-0605-0 Sanjay Purushotham, Qi Tian, and C.-C. Jay Kuo. 2011. Picture-in-picture copy detection using spatial coding techniques. In Proceedings of the 2011 ACM international workshop on Automated media analysis and production for novel TV services (AIEMPro '11), Jean-Pierre Evain, Gerald Friedland, Masanori Sano, and Patrick Gros (Eds.). ACM, New York, NY, USA, 25-30. DOI=10.1145/2072552.2072559 http://doi.acm.org/10.1145/2072552.2072559 Rajendran, D.; Shivakumara, P.; Bolan Su; Shijian Lu; Chew Lim Tan; , "A New Fourier-Moments Based Video Word and Character Extraction Method for Recognition," Document Analysis and Recognition (ICDAR), 2011 International Conference on , vol., no., pp.1165-1169, 18-21 Sept. 2011 doi: 10.1109/ICDAR.2011.235 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6065493&isnumber=6065247 Ranathunga, Lochandaka; Zainuddin, Roziati; Abdullah, Nor. Performance evaluation of the combination of Compacted Dither Pattern Codes with Bhattacharyya classifier in video visual concept depiction. Multimedia Tools and Applications, 2011-08-01, Springer Netherlands, pp. 263-289, Vol. 54, Issue 2, dx.doi.org/10.1007/s11042-010-0522-2, 10.1007/s11042-010-0522-2 Miriam Redi and Bernard Merialdo. 2011. Marginal-based visual alphabets for local image descriptors aggregation. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1429-1432. DOI=10.1145/2072298.2072032 http://doi.acm.org/10.1145/2072298.2072032 Miriam Redi and Bernard Merialdo. 2011. Saliency moments for image categorization. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval (ICMR '11). ACM, New York, NY, USA, , Article 39 , 8 pages. DOI=10.1145/1991996.1992035 http://doi.acm.org/10.1145/1991996.1992035 Reede Ren, John Collomosse, and Joemon Jose. 2011. A BOVW based query generative model. In Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I (MMM'11), Kuo-Tien Lee, Jun-Wei Hsieh, Wen-Hsiang Tsai, Hong-Yuan Mark Liao, and Tsuhan Chen (Eds.), Vol. Part I. Springer-Verlag, Berlin, Heidelberg, 118-128. Roopalakshmi, R.; Reddy, G.R.M.; , "A Novel CBCD Approach Using MPEG-7 Motion Activity Descriptors," Multimedia (ISM), 2011 IEEE International Symposium on , vol., no., pp.179-184, 5-7 Dec. 2011 doi: 10.1109/ISM.2011.36 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6123343&isnumber=6123309 Roopalakshmi, R.; Reddy, G.R.M.; , "Towards a new approach to video copy detection using acoustic features," Internet Multimedia Systems Architecture and Application (IMSAA), 2011 IEEE 5th International Conference on , vol., no., pp.1-5, 12-13 Dec. 2011 doi: 10.1109/IMSAA.2011.6156336 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6156336&isnumber=6156331 Bahjat Safadi and Georges Quénot. 2011. Re-ranking by local re-scoring for video indexing and retrieval. In Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM '11), Bettina Berendt, Arjen de Vries, Wenfei Fan, Craig Macdonald, Iadh Ounis, and Ian Ruthven (Eds.). ACM, New York, NY, USA, 2081-2084. DOI=10.1145/2063576.2063895 http://doi.acm.org/10.1145/2063576.2063895 Bahjat Safadi and Georges Quénot. 2011. Re-ranking for multimedia indexing and retrieval. In Proceedings of the 33rd European conference on Advances in information retrieval (ECIR'11), Paul Clough, Colum Foley, Cathal Gurrin, Hyowon Lee, and Gareth J. F. Jones (Eds.). Springer-Verlag, Berlin, Heidelberg, 708-711. Markus Seidl, Matthias Zeppelzauer, Dalibor Mitrović, and Christian Breiteneder. 2011. Gradual transition detection in historic film material—a systematic study. J. Comput. Cult. Herit. 4, 3, Article 10 (December 2011), 18 pages. DOI=10.1145/2069276.2069279 http://doi.acm.org/10.1145/2069276.2069279 Kimiaki Shirahama, Yuta Matsuoka, and Kuniaki Uehara. 2011. Video event retrieval from a small number of examples using rough set theory. In Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I (MMM'11), Kuo-Tien Lee, Jun-Wei Hsieh, Wen-Hsiang Tsai, Hong-Yuan Mark Liao, and Tsuhan Chen (Eds.), Vol. Part I. Springer-Verlag, Berlin, Heidelberg, 96-106. Shirahama, K.; Uehara, K.; , "Query by Virtual Example: Video Retrieval Using Example Shots Created by Virtual Reality Techniques," Image and Graphics (ICIG), 2011 Sixth International Conference on , vol., no., pp.829-834, 12-15 Aug. 2011 doi: 10.1109/ICIG.2011.158 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6005957&isnumber=6005527 Shirahama, K.; Uehara, K.; , "Utilizing Video Ontology for Fast and Accurate Query-by-Example Retrieval," Semantic Computing (ICSC), 2011 Fifth IEEE International Conference on , vol., no., pp.395-402, 18-21 Sept. 2011 doi: 10.1109/ICSC.2011.88 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6061440&isnumber=6061289 Kimiaki Shirahama and Kuniaki Uehara. 2011. Effectiveness of video ontology in query by example approach. In Proceedings of the 7th international conference on Active media technology (AMT'11), Ning Zhong, Vic Callaghan, Ali A. Ghorbani, and Bin Hu (Eds.). Springer-Verlag, Berlin, Heidelberg, 49-58. Shivakumara, P.; Trung Quy Phan; Shijian Lu; Chew Lim Tan; , "Video Character Recognition through Hierarchical Classification," Document Analysis and Recognition (ICDAR), 2011 International Conference on , vol., no., pp.131-135, 18-21 Sept. 2011 doi: 10.1109/ICDAR.2011.35 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6065290&isnumber=6065247 Shivakumara, P.; Bhowmick, S.; Bolan Su; Tan, C.L.; Pal, U.; , "A New Gradient Based Character Segmentation Method for Video Text Recognition," Document Analysis and Recognition (ICDAR), 2011 International Conference on , vol., no., pp.126-130, 18-21 Sept. 2011 doi: 10.1109/ICDAR.2011.34 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6065289&isnumber=6065247 Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I.; Meinedo, H.; Bugalho, M.; Trancoso, I.; , "Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features," Circuits and Systems for Video Technology, IEEE Transactions on , vol.21, no.8, pp.1163-1177, Aug. 2011 doi: 10.1109/TCSVT.2011.2138830 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5742987&isnumber=5970267 Mats Sjöberg and Jorma Laaksonen. 2011. Analysing the structure of semantic concepts in visual databases. In Proceedings of the 8th international conference on Advances in self-organizing maps (WSOM'11), Jorma Laaksonen and Timo Honkela (Eds.). Springer-Verlag, Berlin, Heidelberg, 338-347. Takahashi, M.; Naemura, M.; Fujii, M.; Satoh, S.; , "Human action recognition in crowded surveillance video sequences by using features taken from key-point trajectories," Computer Vision and Pattern Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference on , vol., no., pp.9-16, 20-25 June 2011 doi: 10.1109/CVPRW.2011.5981713 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5981713&isnumber=5981671 Tapu, R.; Mocanu, B.; Raducanu, M.; Petrescu, T.; , "Multiresolution median filtering based video temporal segmentation," Signals, Circuits and Systems (ISSCS), 2011 10th International Symposium on , vol., no., pp.1-4, June 30 2011-July 1 2011 doi: 10.1109/ISSCS.2011.5978651 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5978651&isnumber=5978636 Yonghong Tian; Menglin Jiang; Luntian Mou; Xiaoyu Fang; Tiejun Huang; , "A multimodal video copy detection approach with sequential pyramid matching," Image Processing (ICIP), 2011 18th IEEE International Conference on , vol., no., pp.3629-3632, 11-14 Sept. 2011 doi: 10.1109/ICIP.2011.6116504 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116504&isnumber=6115588 Ioannis Tsampoulatidis, Nikolaos Gkalelis, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. 2011. High-level event detection system based on discriminant visual concepts. In Proceedings of the 1st ACM International Conference on Multimedia Retrieval (ICMR '11). ACM, New York, NY, USA, , Article 68 , 2 pages. DOI=10.1145/1991996.1992064 http://doi.acm.org/10.1145/1991996.1992064 Yusuke Uchida, Motilal Agrawal, and Shigeyuki Sakazawa, "Accurate Content-Based Video Copy Detection with Efficient Feature Indexing," Proceedings of the 1st ACM International Conference on Multimedia Retrieval, Trento, Italy, 2011. http://dl.acm.org/citation.cfm?id=1992015 Vahdat, A.; Bo Gao; Ranjbar, M.; Mori, G.; , "A discriminative key pose sequence model for recognizing human interactions," Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on , vol., no., pp.1729-1736, 6-13 Nov. 2011 doi: 10.1109/ICCVW.2011.6130458 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6130458&isnumber=6130192 Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek, "Empowering Visual Categorization with the GPU," IEEE Transactions on Multimedia, vol. 13, iss. 1, pp. 60-70, 2011. Hung Thanh Vu; Thanh Duc Ngo; Thao Ngoc Nguyen; Duy-Dinh Le; Satoh, S.; Bac Hoai Le; Duc Anh Duong; , "Fast face sequence matching in large-scale video databases," Image Processing (ICIP), 2011 18th IEEE International Conference on , vol., no., pp.2549-2552, 11-14 Sept. 2011 doi: 10.1109/ICIP.2011.6116183 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116183&isnumber=6115588 Kong-Wah Wan, Yan-Tao Zheng, and Lekha Chaisorn. 2011. Known-item video search via query-to-modality mapping. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1133-1136. DOI=10.1145/2072298.2071957 http://doi.acm.org/10.1145/2072298.2071957 Jingdong Wang, Yinghai Zhao, Xiuqing Wu, and Xian-Sheng Hua. 2011. A transductive multi-label learning approach for video concept detection. Pattern Recogn. 44, 10-11 (October 2011), 2274-2286. DOI=10.1016/j.patcog.2010.07.015 http://dx.doi.org/10.1016/j.patcog.2010.07.015 Lei Wang, Dawei Song, and Eyad Elyan. 2011. Words-of-interest selection based on temporal motion coherence for video retrieval. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval (SIGIR '11). ACM, New York, NY, USA, 1197-1198. DOI=10.1145/2009916.2010117 http://doi.acm.org/10.1145/2009916.2010117 Lezi Wang; Yuan Dong; Hongliang Bai; Wei Liu; Kun Tao; , "A word-based approach for duplicate picture in picture sequence detection," Broadband Network and Multimedia Technology (IC-BNMT), 2011 4th IEEE International Conference on , vol., no., pp.286-290, 28-30 Oct. 2011 doi: 10.1109/ICBNMT.2011.6155942 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6155942&isnumber=6155882 Xiangyu Wang, Yong Rui, and Mohan S. Kankanhalli. 2011. Up-fusion: an evolving multimedia decision fusion method. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1089-1092. DOI=10.1145/2072298.2071945 http://doi.acm.org/10.1145/2072298.2071945 Xiao-Yong Wei and Zhen-Qun Yang. 2011. Coached active learning for interactive video search. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 443-452. DOI=10.1145/2072298.2072356 http://doi.acm.org/10.1145/2072298.2072356 Xin-Shun Xu, Yuan Jiang, Liang Peng, Xiangyang Xue, and Zhi-Hua Zhou. 2011. Ensemble approach based on conditional random field for multi-label image and video annotation. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1377-1380. DOI=10.1145/2072298.2072019 http://doi.acm.org/10.1145/2072298.2072019 Xin-Shun Xu, Xiangyang Xue, and Zhi-Hua Zhou. 2011. Ensemble multi-instance multi-label learning approach for video annotation task. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1153-1156. DOI=10.1145/2072298.2071962 http://doi.acm.org/10.1145/2072298.2071962 Jian Yi, Yuxin Peng, and Jianguo Xiao. 2011. Mining concept relationship in temporal context for effective video annotation. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 1053-1056. DOI=10.1145/2072298.2071936 http://doi.acm.org/10.1145/2072298.2071936 Jin Yuan, Zheng-Jun Zha, Yao-Tao Zheng, Meng Wang, Xiangdong Zhou, and Tat-Seng Chua. 2011. Learning concept bundles for video search with complex queries. In Proceedings of the 19th ACM international conference on Multimedia (MM '11). ACM, New York, NY, USA, 453-462. DOI=10.1145/2072298.2072357 http://doi.acm.org/10.1145/2072298.2072357 Jin Yuan; Zheng-Jun Zha; Yan-Tao Zheng; Meng Wang; Xiangdong Zhou; Tat-Seng Chua; , "Utilizing Related Samples to Enhance Interactive Concept-Based Video Search," Multimedia, IEEE Transactions on , vol.13, no.6, pp.1343-1355, Dec. 2011 doi: 10.1109/TMM.2011.2168813 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6022804&isnumber=6069890 Lu Zhang, Tao Mei, Yuan Liu, Dacheng Tao, and He-Qin Zhou. 2011. Visual search reranking via adaptive particle swarm optimization. Pattern Recogn. 44, 8 (August 2011), 1811-1820. DOI=10.1016/j.patcog.2011.01.016 http://dx.doi.org/10.1016/j.patcog.2011.01.016 Qiusha Zhu; Lin Lin; Mei-Ling Shyu; Shu-Ching Chen; , "Effective supervised discretization for classification based on correlation maximization," Information Reuse and Integration (IRI), 2011 IEEE International Conference on , vol., no., pp.390-395, 3-5 Aug. 2011 doi: 10.1109/IRI.2011.6009579 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6009579&isnumber=6009494 --------------------------------------------------------------------- 2010 (72) --------------------------------------------------------------------- Aly, Robin and Doherty, Aiden and Hiemstra, Djoerd and Smeaton, A. 2010. Beyond Shot Retrieval: Searching for Broadcast News Items Using Language Models of Concepts in ECIR '10: Proceedings of the 32th European Conference on IR Research on Advances in Information Retrieval}, Lecture Notes in Computer Science, Vol 5993, pp. 241-252, Springer Verlag. Amiri, A.; Fathy, M.; Naseri, A.; , "Key-frame extraction and video summarization using QR-Decomposition," Digital Content, Multimedia Technology and its Applications (IDC), 2010 6th International Conference on , vol., no., pp.134-139, 16-18 Aug. 2010 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5568717&isnumber=5568515 Asaidi, H.; Aarab, A.; , "Visual video retrieval using Multivariate GARCH models," I/V Communications and Mobile Network (ISVC), 2010 5th International Symposium on , vol., no., pp.1-4, Sept. 30 2010-Oct. 2 2010 doi: 10.1109/ISVC.2010.5656176 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5656176&isnumber=5654712 Ates, T.K.; Esen, E.; Saracoglu, A.; Soysal, M.; Turgut, Y.; Oktay, O.; Alatan, A.A.; , "Content based video copy detection with local descriptors," Signal Processing and Communications Applications Conference (SIU), 2010 IEEE 18th , vol., no., pp.49-52, 22-24 April 2010 doi: 10.1109/SIU.2010.5654395 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5654395&isnumber=5648807 Stephane Ayache, Georges Quenot, Andy Tseng. 2010. The lIGVID system for video retrieval and concept annotation. March 2010 MIR '10: Proceedings of the international conference on Multimedia information retrieval Werner Bailer. Evaluating Detection of Near Duplicate Video Segments. Proceedings of the ACM International Conference on Image and Video Retrieval, Xian, China, July 2010. Werner Bailer, Wolfgang Weiss, Gert Kienast, Georg Thallinger and Werner Haas. A Video Browsing Tool for Content Management in Post-production. International Journal of Digital Multimedia Broadcasting, Mar. 2010. Chantamunee, S.; Gotoh, Y.; , "Nearly-repetitive video synchronisation using nonlinear manifold embedding," Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on , vol., no., pp.2282-2285, 14-19 March 2010 doi: 10.1109/ICASSP.2010.5495925 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5495925&isnumber=5494886 Chayanurak, R.; Cooharojananone, N.; Satoh, S.; Lipikorn, R.; , "Carried object detection using star skeleton with adaptive centroid and time series graph," Signal Processing (ICSP), 2010 IEEE 10th International Conference on , vol., no., pp.736-739, 24-28 Oct. 2010 doi: 10.1109/ICOSP.2010.5655765 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5655765&isnumber=5654687 Juan Chen; , "Detection of video copies based on robust descriptors," Apperceiving Computing and Intelligence Analysis (ICACIA), 2010 International Conference on , vol., no., pp.303-306, 17-19 Dec. 2010 doi: 10.1109/ICACIA.2010.5709906 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5709906&isnumber=5709837 Shi Chen; Jinqiao Wang; Yi Ouyang; Bo Wang; Qi Tian; Hanqing Lu; , "Multi-level trajectory modeling for video copy detection," Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on , vol., no., pp.2378-2381, 14-19 March 2010 doi: 10.1109/ICASSP.2010.5496165 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5496165&isnumber=5494886 Xiaolin Chen; Xiaokang Yang; Rui Zhang; Anwen Liu; Shibao Zheng; , "Edge region color autocorrelogram: A new low-level feature applied in CBIR," Broadband Multimedia Systems and Broadcasting (BMSB), 2010 IEEE International Symposium on , vol., no., pp.1-4, 24-26 March 2010 doi: 10.1109/ISBMSB.2010.5463087 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5463087&isnumber=5463070 Xiangang Cheng, Liang-Tien Chia. 2010. Stratification-based keyframe cliques for removal of near-duplicates in video search results. March 2010 MIR '10: Proceedings of the international conference on Multimedia information retrieval Cirakman, O.; Gunsel, B.; Sengor, N.S.; Gursoy, O.; , "Key-frame based video fingerprinting by NMF," Image Processing (ICIP), 2010 17th IEEE International Conference on , vol., no., pp.2373-2376, 26-29 Sept. 2010 doi: 10.1109/ICIP.2010.5652649 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5652649&isnumber=5648792 de Rooij, O.; Worring, M.; , "Browsing Video Along Multiple Threads," Multimedia, IEEE Transactions on , vol.12, no.2, pp.121-130, Feb. 2010 doi: 10.1109/TMM.2009.2037388 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5340554&isnumber=5379168 Ding, G.; Qin, K.; , "Semantic classifier based on compressed sensing for image and video annotation," Electronics Letters , vol.46, no.6, pp.417-419, March 18 2010 doi: 10.1049/el.2010.2295 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5434620&isnumber=5434595 Diou, C.; Stephanopoulos, G.; Panagiotopoulos, P.; Papachristou, C.; Dimitriou, N.; Delopoulos, A.; , "Large-Scale Concept Detection in Multimedia Data Using Small Training Sets and Cross-Domain Concept Fusion," Circuits and Systems for Video Technology, IEEE Transactions on , vol.20, no.12, pp.1808-1821, Dec. 2010 doi: 10.1109/TCSVT.2010.2087814 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5604666&isnumber=5704816 Douze, M.; Jegou, H.; Schmid, C.; , "An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering," Multimedia, IEEE Transactions on , vol.12, no.4, pp.257-266, June 2010 doi: 10.1109/TMM.2010.2046265 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5437235&isnumber=5463236 Feng, Y.; Ren, J.; Jiang, J.; , "Mixed ranking scheme for video retrieval," Electronics Letters , vol.46, no.24, pp.1600-1601, November 25 2010 doi: 10.1049/el.2010.8621 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5659664&isnumber=5659647 Ke Gao, Yongdong Zhang, Wei Zhang, Shouxun Lin. 2010. Affine Stable Characteristic based sample expansion for object detection. July 2010 CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval Bo Geng; Linjun Yang; Chao Xu; Xian-Sheng Hua; , "Content-aware Ranking for visual search," Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , vol., no., pp.3400-3407, 13-18 June 2010 doi: 10.1109/CVPR.2010.5540003 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5540003&isnumber=5539770 Gupta, V.; Boulianne, G.; Cardinal, P.; , "Content-based audio copy detection using nearest-neighbor mapping," Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on , vol., no., pp.261-264, 14-19 March 2010 doi: 10.1109/ICASSP.2010.5495963 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5495963&isnumber=5494886 Gupta, V.; Boulianne, G.; Cardinal, P.; , "Crim's content-based audio copy detection system for TRECVID 2009," Content-Based Multimedia Indexing (CBMI), 2010 International Workshop on , vol., no., pp.1-6, 23-25 June 2010 doi: 10.1109/CBMI.2010.5529908 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5529908&isnumber=5529836 Gürsoy, O.; Kutluk, S.; Günsel, B.; Şengör, N.; , "Negatif olmayan matris ayrıştirma ile ikili video kiyimlama binary video hashing by non-negative matrix factorization," Signal Processing and Communications Applications Conference (SIU), 2010 IEEE 18th , vol., no., pp.894-897, 22-24 April 2010 doi: 10.1109/SIU.2010.5651268 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5651268&isnumber=5648807 Hongzhong Tang; Huixian Huang; Songhao Zhu; , "Video concept detection based on spatio-temporal correlation," Computer Application and System Modeling (ICCASM), 2010 International Conference on , vol.8, no., pp.V8-638-V8-642, 22-24 Oct. 2010 doi: 10.1109/ICCASM.2010.5620186 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5620186&isnumber=5619026 Wolfgang Huerst, Cees G. M. Snoek, Willem-Jan Spoel, and Mate Tomin, "Keep Moving! Revisiting Thumbnails for Mobile Video Retrieval," in Proceedings of the ACM International Conference on Multimedia, Firenze, Italy, 2010. Bouke Huurnink, Cees G. M. Snoek, Maarten de Rijke, and Arnold W. M. Smeulders, "Today's and Tomorrow's Retrieval Practice in the Audiovisual Archive," in Proceedings of the ACM International Conference on Image and Video Retrieval, Xi'an, China, 2010, pp. 18-25. Nakamasa Inoue, Tatsuhiko Saito, Koichi Shinoda and Sadaoki Furui, "High-Level Feature Extraction Using SIFT GMMs and Audio Models", In Proceedings of the International Conference on Pattern Recognition, pp. 3220-3223, Istanbul, Turkey, August 2010. Yu-Gang Jiang; Jun Yang; Chong-Wah Ngo; Hauptmann, A.G.; , "Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study," Multimedia, IEEE Transactions on , vol.12, no.1, pp.42-53, Jan. 2010 doi: 10.1109/TMM.2009.2036235 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5332300&isnumber=5353832 Huan Li; Yuan Shi; Mingyu Chen; Hauptmann, A.; Zhang Xiong; , "Joint-AL: Joint Discriminative and Generative Active Learning for Cross-Domain Semantic Concept Classification," Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on , vol., no., pp.60-66, 22-24 Sept. 2010 doi: 10.1109/ICSC.2010.86 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5628856&isnumber=5628562 LI Li; Weiming Hu; Bing Li; Chunfeng Yuan; Pengfei Zhu; Wanqing Li; , "Event Recognition Based on Top-Down Motion Attention," Pattern Recognition (ICPR), 2010 20th International Conference on , vol., no., pp.3561-3564, 23-26 Aug. 2010 doi: 10.1109/ICPR.2010.869 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5597831&isnumber=5595735 Yuanning Li, Yonghong Tian, Jingjing Yang, Ling-Yu Duan, Wen Gao. 2010. Video retargeting with multi-scale trajectory optimization. March 2010 MIR '10: Proceedings of the international conference on Multimedia information retrieval Yuanning Li; Yonghong Tian; Ling-Yu Duan; Jingjing Yang; Tiejun Huang; Wen Gao; , "Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context," Multimedia, IEEE Transactions on , vol.12, no.8, pp.814-828, Dec. 2010 doi: 10.1109/TMM.2010.2066960 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5549915&isnumber=5623225 Ke-yan Liu; Tong Zhang; Lei Wang; , "A new parallel video understanding and retrieval system," Multimedia and Expo (ICME), 2010 IEEE International Conference on , vol., no., pp.679-684, 19-23 July 2010 doi: 10.1109/ICME.2010.5583873 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5583873&isnumber=5582530 Zhu Liu, Tao Liu, David C. Gibbon, Behzad Shahraray. 2010. Effective and scalable video copy detection March 2010 MIR '10: Proceedings of the international conference on Multimedia information retrieval Yang Liu, Wan-Lei Zhao, Chong-Wah Ngo, Chang-Sheng Xu, Han-Qing Lu. 2010. Coherent bag-of audio words model for efficient large-scale video copy detection. July 2010 CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval Shiyang Lu; Zhiyong Wang; Meng Wang; Ott, M.; Dagan Feng; , "Adaptive reference frame selection for near-duplicate video shot detection," Image Processing (ICIP), 2010 17th IEEE International Conference on , vol., no., pp.2341-2344, 26-29 Sept. 2010 doi: 10.1109/ICIP.2010.5649254 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5649254&isnumber=5648792 Mezaris, V.; Sidiropoulos, P.; Dimou, A.; Kompatsiaris, I.; , "On the Use of Visual Soft Semantics for Video Temporal Decomposition to Scenes," Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on , vol., no., pp.141-148, 22-24 Sept. 2010 doi: 10.1109/ICSC.2010.23 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5628943&isnumber=5628562 Dianting Liu; Mei-Ling Shyu; Chao Chen; Shu-Ching Chen; , "Integration of global and local information in videos for key frame extraction," Information Reuse and Integration (IRI), 2010 IEEE International Conference on , vol., no., pp.171-176, 4-6 Aug. 2010 doi: 10.1109/IRI.2010.5558944 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5558944&isnumber=5558895 Natsev, A.; Hill, M.; Smith, J.R.; , "Design and evaluation of an effective and efficient video copy detection system," Multimedia and Expo (ICME), 2010 IEEE International Conference on , vol., no., pp.1353-1358, 19-23 July 2010 doi: 10.1109/ICME.2010.5583216 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5583216&isnumber=5582530 Thao Ngoc Nguyen, Thanh Duc Ngo, Duy-Dinh Le, Shin'ichi Satoh, Bac Hoai Le, Duc Anh Duong. 2010. An efficient method for face retrieval from large video datasets. July 2010 CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval Lin Pang, Juan Cao, Yongdong Zhang, Shouxun Lin. 2010. Hierarchical feedback algorithm based on visual community discovery for interactive video retrieval. July 2010 CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval Pinheiro, A.M.G.; , "Performance analysis of the Edge Pixel Orientations Histogram," Image Analysis for Multimedia Interactive Services (WIAMIS), 2010 11th International Workshop on , vol., no., pp.1-4, 12-14 April 2010 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5617641&isnumber=5617638 Yu Qiu; Genliang Guan; Zhiyong Wang; Dagan Feng; , "Improving News Video Annotation with Semantic Context," Digital Image Computing: Techniques and Applications (DICTA), 2010 International Conference on , vol., no., pp.214-219, 1-3 Dec. 2010 doi: 10.1109/DICTA.2010.47 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5692567&isnumber=5692215 Ranathunga, L.; Zainuddin, R.; Abdullah, N.A.; , "Semantic visual search with feature space reduction," Information and Automation for Sustainability (ICIAFs), 2010 5th International Conference on , vol., no., pp.463-468, 17-19 Dec. 2010 doi: 10.1109/ICIAFS.2010.5715706 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5715706&isnumber=5715624 Roth, G.; LaganieÌ€re, R.; Lambert, P.; Lakhmiri, I.; Janati, T.; , "A Simple but Effective Approach to Video Copy Detection," Computer and Robot Vision (CRV), 2010 Canadian Conference on , vol., no., pp.63-70, May 31 2010-June 2 2010 doi: 10.1109/CRV.2010.15 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5479485&isnumber=5479157 Stevan Rudinac, Martha Larson, Alan Hanjalic. 2010. Visual concept-based selection of query expansions for spoken content retrieval. July 2010 SIGIR '10: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval Safadi, B.; Quenot, G.; , "Active learning with multiple classifiers for multimedia indexing," Content-Based Multimedia Indexing (CBMI), 2010 International Workshop on , vol., no., pp.1-6, 23-25 June 2010 doi: 10.1109/CBMI.2010.5529910 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5529910&isnumber=5529836 Saracoğlu, A.; Tekin, M.; Esen, E.; Soysal, M.; Loğoğlu, K.B.; Ateş, T.K.; Sevinç, A.M.; Sevimli, H.; Acar, B.O.; Zubari, U.; Ozan, E.C.; Alatan, A.A.; , "Generalized visual concept detection," Signal Processing and Communications Applications Conference (SIU), 2010 IEEE 18th , vol., no., pp.621-624, 22-24 April 2010 doi: 10.1109/SIU.2010.5650360 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5650360&isnumber=5648807 Klaus Schoeffmann, Frank Hopfgartner, Oge Marques, Laszlo Boeszoermenyi, and Joemon M. Jose. 2010 Video browsing interfaces and applications: a review SPIE Reviews 1, 018004 (2010), DOI:10.1117/6.0000005 Markus Seidl, Matthias Zeppelzauer, and Christian Breiteneder. 2010. A study of gradual transition detection in historic film material. In Proceedings of the second workshop on eHeritage and digital art preservation (eHeritage '10). ACM, New York, NY, USA, 13-18. DOI=10.1145/1877922.1877929 http://doi.acm.org/10.1145/1877922.1877929 Shirahama, Kimiaki; Lin Yanpeng; Matsuoka, Yuta; Uehara, Kuniaki; , "Query by example for large-scale video data by parallelizing rough set theory based on MapReduce," Science and Social Research (CSSR), 2010 International Conference on , vol., no., pp.390-395, 5-7 Dec. 2010 doi: 10.1109/CSSR.2010.5773806 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5773806&isnumber=5773667 Cees G. M. Snoek and Arnold W. M. Smeulders, "Visual-Concept Search Solved?," IEEE Computer, vol. 43, iss. 6, pp. 76-78, 2010. Tahayna, B.; Belkhatir, M.; Alhashmi, S.M.; O'Daniel, T.; , "Human action detection and classification using optimal bag-of-words representation," Digital Content, Multimedia Technology and its Applications (IDC), 2010 6th International Conference on , vol., no., pp.75-80, 16-18 Aug. 2010 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5568597&isnumber=5568515 Tuan Hue Thi; Jian Zhang; Li Cheng; Li Wang; Satoh, S.; , "Human Action Recognition and Localization in Video Using Structured Learning of Local Space-Time Features," Advanced Video and Signal Based Surveillance (AVSS), 2010 Seventh IEEE International Conference on , vol., no., pp.204-211, Aug. 29 2010-Sept. 1 2010 doi: 10.1109/AVSS.2010.76 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5597147&isnumber=5597063 Jeff Ubois, Jamie Davidson, Marko Grobelnik, Paul Over, Hans Westerhof. 2010. Video search: are algorithms all we need? April 2010 WWW '10: Proceedings of the 19th international conference on World wide web Uijlings, J.R.R.; Smeulders, A.W.M.; Scha, R.J.H.; , "Real-Time Visual Concept Classification," Multimedia, IEEE Transactions on , vol.12, no.7, pp.665-681, Nov. 2010 doi: 10.1109/TMM.2010.2052027 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5482156&isnumber=5601986 Adrian Ulges, Christian Schulze, Markus Koch, and Thomas M. Breuel. 2010. Learning automatic concept detectors from online video. Computer Vision and Image Understanding Volume 114, Issue 4, April 2010, Pages 429-438 David Vallet, Ivan Cantador, Joemon M. Jose. 2010. Exploiting external knowledge to improve video retrieval. March 2010 MIR '10: Proceedings of the international conference on Multimedia information retrieval David Vallet, Frank Hopfgartner, Joemon M. Jose, and Pablo Castells. 2011. Effects of Usage-Based Feedback on Video Retrieval: A Simulation-Based Study. ACM Trans. Inf. Syst. 29, 2, Article 11 (April 2011), 32 pages. DOI=10.1145/1961209.1961214 http://doi.acm.org/10.1145/1961209.1961214 Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek, "Evaluating Color Descriptors for Object and Scene Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 32, iss. 9, pp. 1582-1596, 2010. Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek, "Accelerating Visual Categorization with the GPU," in ECCV Workshop on Computer Vision on GPU, Crete, Greece, 2010. Jan C. van Gemert, Cees G. M. Snoek, Cor J. Veenman, Arnold W. M. Smeulders, and Jan-Mark Geusebroek, "Comparing Compact Codebooks for Visual Categorization," Computer Vision and Image Understanding, vol. 114, iss. 4, pp. 450-462, 2010. Stefanos Vrochidis, Ioannis Kompatsiaris, Ioannis Patras. 2010. Optimizing visual search with implicit user feedback in interactive video retrieval July 2010 CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval Kong-Wah Wan; Ah-Hwee Tan; Joo-Hwee Lim; Liang-Tien Chia; , "Faceted topic retrieval of news video using joint topic modeling of visual features and speech transcripts," Multimedia and Expo (ICME), 2010 IEEE International Conference on , vol., no., pp.843-848, 19-23 July 2010 doi: 10.1109/ICME.2010.5583061 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5583061&isnumber=5582530 Yaowei Wang; Yonghong Tian; Lingyu Duan; Zhipeng Hu; Guochen Jia; , "ESUR: A system for Events detection in SURveillance video," Image Processing (ICIP), 2010 17th IEEE International Conference on , vol., no., pp.2317-2320, 26-29 Sept. 2010 doi: 10.1109/ICIP.2010.5654246 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5654246&isnumber=5648792 Peter Wilkins, Alan F. Smeaton, Paul Ferguson. 2010. Properties of optimally weighted data fusion in CBMIR. July 2010 SIGIR '10: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval Yu Xiang; Xiangdong Zhou; Zuotao Liu; Tat-Seng Chua; Chong-Wah Ngo; , "Semantic context modeling with maximal margin Conditional Random Fields for automatic image annotation," Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , vol., no., pp.3368-3375, 13-18 June 2010 doi: 10.1109/CVPR.2010.5540015 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5540015&isnumber=5539770 Xinxing Xu; Dong Xu; Tsang, I.W.; , "Video Concept Detection Using Support Vector Machine with Augmented Features," Image and Video Technology (PSIVT), 2010 Fourth Pacific-Rim Symposium on , vol., no., pp.381-385, 14-17 Nov. 2010 doi: 10.1109/PSIVT.2010.70 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5673969&isnumber=5673675 Jin Yuan, Zheng-Jun Zha, Zhengdong Zhao, Xiangdong Zhou, Tat-Seng Chua. 2010. Utilizing related samples to learn complex queries in interactive concept-based video search. July 2010 CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval Hui Zhang; Zhicheng Zhao; Anni Cai; Xiaohui Xie; , "A novel framework for content-based video copy detection," Network Infrastructure and Digital Content, 2010 2nd IEEE International Conference on , vol., no., pp.753-757, 24-26 Sept. 2010 doi: 10.1109/ICNIDC.2010.5657881 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5657881&isnumber=5657774 Zhicheng Zhao; Xiaodan Liu; , "A segment-based advertisement search method from TV stream," Future Computer and Communication (ICFCC), 2010 2nd International Conference on , vol.2, no., pp.V2-690-V2-693, 21-24 May 2010 doi: 10.1109/ICFCC.2010.5497581 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5497581&isnumber=5497284 Qiusha Zhu; Lin Lin; Mei-Ling Shyu; Shu-Ching Chen; , "Feature Selection Using Correlation and Reliability Based Scoring Metric for Video Semantic Detection," Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on , vol., no., pp.462-469, 22-24 Sept. 2010 doi: 10.1109/ICSC.2010.65 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5629038&isnumber=5628562 --------------------------------------------------------------------- 2009 (39) --------------------------------------------------------------------- Aly, Robin and Hiemstra, Djoerd. 2009. Concept detectors: how good is good enough? in MM '09: Proceedings of the Seventeenth ACM International Conference on Multimedia, Beijing, China, pp. 233 - 242, New York, NY, USA. ACM, doi:doi.acm.org/10.1145/1631272.1631306, isbn:978-1-60558-608-3 Robin Aly, Djoerd Hiemstra, Arjen P. de Vries. 2009. Reusing Annotation Labor for Concept Selection. in CIVR '09: Proceedings of the International Conference on Content-Based Image and Video Retrieval 2009, Santorini Ioannis Arapakis, Ioannis Konstas, Joemon M. Jose, Ioannis Kompatsiaris. 2009. Modeling facial expressions and peripheral physiological signals to predict topical relevance. July 2009 SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval Stéphane Ayache, Georges Quénot, Laurent Besacier. 2009. The LIG multi-criteria system for video retrieval. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Bailer W., Lee F. and Thallinger G. A Distance Measure for Repeated Takes of One Scene. The Visual Computer, 25(1):53-68, Jan. 2009. Bailer W. and Rehatschek H. Comparing Fact Finding Tasks and User Survey for Evaluating a Video Browsing Tool. Proceedings of ACM Multimedia, Beijing, CN, Oct. 2009. Bailer W. and Thallinger G. Summarizing Raw Video Material Using Hidden Markov Models. Proceedings of 10th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), London, UK, May 2009, pp. 53-56. Juan Cao, HongFang Jing, Chong-Wah Ngo, YongDong Zhang. 2009. Distribution-based concept selection for concept-based video retrieval. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Juan Cao, Yong-Dong Zhang, Jun-Bo Guo, Lei Bao, Jin-Tao Li. 2009. VideoMap: an interactive video retrieval system of MCG-ICT-CAS. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Lixin Duan, Ivor W. Tsang, Dong Xu, Tat-Seng Chua. 2009. Domain adaptation from multiple sources via auxiliary classifiers. June 2009 ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning Martin Halvey, Joemon M. Jose. 2009. The role of expertise in aiding video search. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Martin Halvey, David Vallet, David Hannah, Joemon M. Jose. 2009. ViGOR: a grouping oriented interface for search and retrieval in video libraries. June 2009 JCDL '09: Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries Benoit Huet, Jinhui Tang, Alex Hauptmann. 2009 .ACM SIGMM the first workshop on web-scale multimedia corpus (WSMC09). October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Yu-Gang Jiang, Chong-Wah Ngo, Shih-Fu Chang. 2009. Semantic context transfer across heterogeneous sources for domain adaptive video search. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Philip Kelly, Ciarán Ó Conaire, Noel E. O'Connor. 2009. Exploiting contextual data for event retrieval in surveillance video. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Duy-Dinh Le, Shin'ichi Satoh. 2009. Efficient concept detection by fusing simple visual features. March 2009 SAC '09: Proceedings of the 2009 ACM symposium on Applied Computing Wei-Hao Lin, Alexander Haputmann. 2009. Identifying news videos' ideological perspectives using emphatic patterns of visual concepts. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Yuan Liu, Tao Mei, Xian-Sheng Hua. 2009. CrowdReranking: exploring multiple search engines for visual search reranking. July 2009 SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval Paul Over, George Awad, Alan F. Smeaton, Colum Foley, James Lanagan. 2009. Creating a web-scale video collection for research. October 2009 WSMC '09: Proceedings of the 1st workshop on Web-scale multimedia corpus Yuxin Peng, Zhiwu Lu, Jianguo Xiao. 2009. Semantic concept annotation based on audio PLSA model. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Sébastien Poullot, Michel Crucianu, Shin'Ichi Satoh. 2009. Indexing local configurations of features for scalable content-based video copy detection. October 2009 LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining P. Punitha, Joemon M. Jose, Anuj Goyal. 2009. Topic prerogative feature selection using multiple query examples for automatic video retrieval. July 2009 SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval Arjan T. Setz and Cees G. M. Snoek, "Can Social Tagged Images Aid Concept-Based Video Search?," in Proceedings of the IEEE International Conference on Multimedia & Expo, 2009. Kimiaki Shirahama, Chieri Sugihara, Yuta Matsuoka, Kuniaki Uehara. 2009. Query-based video event definition using rough set theory. October 2009 EiMM '09: Proceedings of the 1st ACM international workshop on Events in multimedia Alan F. Smeaton,. Paul Over, Aiden R. Doherty. Video shot boundary detection: Seven years of TRECVid activity. To appear in the IEEE Computer Vision and Image Understanding. Online at http://dx.doi.org/10.1016/j.cviu.2009.03.011 Cees G. M. Snoek and Marcel Worring, "Concept-Based Video Retrieval," Foundations and Trends in Information Retrieval, vol. 4, iss. 2, pp. 215-322, 2009. Lin-Xie Tang, Tao Mei, Xian-Sheng Hua. 2009. Near-lossless video summarization. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Pablo Toharia, Oscar D. Robles, Alan F. Smeaton and Angel Rodriguez: Measuring the influence of Concept Detection on Video Retrieval. Proceedings of the 13th International Conference on Computer Analysis of Images and Patterns, pp. 581--589. Münster, Germany. Sept. 2009 ISBN: 978-3-6425-03766-5 Pablo Toharia, Alberto Sánchez, José Luis Bosque and Oscar D. Robles: GCViR: Grid Content-Based Video Retrieval with work allocation brokering Concurrency and Computation, Practices and Experience, John Wiley & Sons. ISSN: 1532-0626. DOI: 10.1002/cpe.1492 Pablo Toharia, Alberto Sánchez, José Luis Bosque and Oscar D. Robles: Efficient Grid-Based Video Storage and Retrieval. International Symposium on Grid computing, high-performance and Distributed Applications (GADA '08) Proceedings of the GADA 08. Lecture Notes in Computer Science, Vol. 5331. Springer-Verlag Berlin Heidelberg, pp. 833 -- 851 ISBN: 978-3-540-88870-3. Monterrey, Mexico. Nov. 2008 Thierry Urruty, Frank Hopfgartner, David Hannah, Desmond Elliott, Joemon M. Jose. 2009. Supporting aspect-based video browsing: analysis of a user study. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Stefanos Vrochidis, Paul King, Lambros Makris, Anastasia Moumtzidou, Spiros Nikolopoulos, Anastasios Dimou, Vasileios Mezaris, Ioannis Kompatsiaris. 2009. MKLab interactive video retrieval system. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Wang, D., Wang, Z., Li, J., Zhang, B., and Li, X. 2009. Query representation by structured concept threads with application to interactive video retrieval. J. Vis. Comun. Image Represent. 20, 2 (Feb. 2009), 104-116. DOI= http://dx.doi.org/10.1016/j.jvcir.2008.12.001 Xiao-Yong Wei, Yu-Gang Jiang, Chong-Wah Ngo. 2009. Exploring inter-concept relationship with context space for semantic video indexing. July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Peter Wilkins, Raphaël Troncy, Martin Halvey, Daragh Byrne, Alia Amin, P. Punitha, Alan F. Smeaton, Robert Villa. 2009. User variance and its impact on video retrieval benchmarking July 2009 CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval Zhipeng Wu, Shuqiang Jiang, Qingming Huang. 2009. Near-duplicate video matching with transformation recognition. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Rong Yan, Marc-Olivier Fleury, Michele Merler, Apostol Natsev, John R. Smith. 2009. Large-scale multimedia semantic concept modeling using robust subspace bagging and MapReduce. October 2009 LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining Ming Yang, Fengjun Lv, Wei Xu, Kai Yu, Yihong Gong. Human action detection by boosting efficient motion features. IEEE Workshop on Video-oriented Object and Event Classification in Conjunction with ICCV, Kyoto, Japan, Sept.28, 2009, (VOEC'2009). Guangyu Zhu, Ming Yang, Kai Yu, Wei Xu, Yihong Gong. Detecting video events based on action recognition in complex scenes using spatio- temporal descriptor. ACM International Conference on Multimedia, Beijing, China, Oct.19-23, 2009, full paper, (ACM MM'2009). --------------------------------------------------------------------- 2008 (31) --------------------------------------------------------------------- Aly, R.B.N. and Hiemstra, D. and de Vries, A.P. and de Jong, F.M.G. (2008) Probabilistic Ranking Framework using Unobservable Binary Events for Video Search./ <http://eprints.eemcs.utwente.nl/12167/> In: Proceedings of the 7th ACM International Conference on Content-based Image and Video Retrieval, 7-9 July 2008, Niagara Falls. pp. 349-358. ACM. ISBN 978-1-60558-070-8 Ioannis Arapakis, Ioannis Konstas, Joemon M. Jose. 2009. Using facial expressions and peripheral physiological signals as implicit indicators of topical relevance. October 2009 MM '09: Proceedings of the seventeen ACM international conference on Multimedia Bailer W. A Comparison of Distance Measures for Clustering Video Sequences. Proceedings of 1st Workshop on Automated Information Extraction in Media Production, Turin, IT, Sept. 2008, pp. 595-599. Bailer W., Dumont E., Essid S., and M�érialdo B. A collaborative approach to automatic rushes video summarization. Proceedings of First ICIP Workshop on Multimedia Information Retrieval, San Diego, CA, USA, Oct. 2008. Werner Bailer, Felix Lee and Georg Thallinger. Detecting and Clustering Multiple Takes of One Scene. Proceedings of the 14th International Multimedia Modeling Conference, Kyoto, Japan, 9-11 January 2008. Bredin H, Byrne D, Lee H, O'Connor N and Jones G. Dublin City University at the TRECVid 2008 BBC Rushes Summarisation Task. TVS 2008 - TRECVID BBC Rushes Summarization Workshop, ACM Multimedia 2008, Vancouver, Canada, 31 October 2008. Byrne D, Doherty A, Snoek C.G.M, Jones G and Smeaton A.F. Validating the Detection of Everyday Concepts in Visual Lifelogs. SAMT 2008 - 3rd International Conference on Semantic and Digital Media Technologies, Koblenz, Germany, 3-5 December 2008. Daragh Byrne, Aiden R. Doherty, Cees G. M. Snoek, Gareth J. F. Jones, and Alan F. Smeaton, "Everyday Concept Detection in Visual Lifelogs: Validation, Relationships and Trends," Multimedia Tools and Applications, vol. 49, iss. 1, pp. 119-144, 2010. Byrne D, Wilkins P, Jones G, Smeaton A.F and O'Connor N. Measuring the Impact of Temporal Context on Video Retrieval. CIVR 2008 - ACM International Conference on Image and Video Retrieval, Niagara Falls, Canada, 7-9 July 2008. Doherty A, Byrne D, Smeaton A.F, Jones G, and Hughes M. Investigating Keyframe Selection Methods in the Novel Domain of Passively Captured Visual Lifelogs. CIVR 2008 - ACM International Conference on Image and Video Retrieval, Niagara Falls, Canada, 7-9 July 2008. Doherty A, O Conaire C, Blighe M, Smeaton A.F and O'Connor N. Combining Image Descriptors to Effectively Retrieve Events from Visual Lifelogs. MIR 2008 - ACM International Conference on Multimedia Information Retrieval 2008, Vancouver, Canada, 30-31 October 2008 Doherty A and Smeaton A.F. Automatically Segmenting Lifelog Data Into Events. WIAMIS 2008 - 9th International Workshop on Image Analysis for Multimedia Interactive Services, Klagenfurt, Austria, 7-9 May 2008. Dumont E, Merialdo B, Essid S, Bailer W, Byrne D, Bredin H, O'Connor N, Jones G, Haller M, Krutz A, Sikora T and Platrik T. A Collaborative Approach to Video Summarization. SAMT 2008 - 3rd International Conference on Semantic and Digital Media Technologies, Koblenz, Germany, 3-5 December 2008. Dumont E, Merialdo B, Essid S, Bailer W, Rehatschek H, Byrne D, Bredin H, O'Connor N, Jones G, Smeaton A.F, Haller M and Piatrick T. Video Rushes Summarization Using a Collaborative Approach. . TVS 2008 - TRECVID BBC Rushes Summarization Workshop, ACM Multimedia 2008, Vancouver, Canada, 31 October 2008. Zhiwei Gu, Tao Mei, Jinhui Tang, Xiuqing Wu, Xian-Sheng Hua. "MILC^2: A Multi-Layer Multi-Instance Learning Approach to Video Concept Detection," International Conference on Multi-Media Modeling (MMM), Kyoto, Japan, Jan. 2008. Gurrin C. Content-based Video Retrieval. Encyclopedia of Database Systems, Springer, 2008. Haubold, A. and Natsev, A. 2008. Web-based information content and its application to concept-based video retrieval. In Proceedings of the 2008 international Conference on Content-Based Image and Video Retrieval (Niagara Falls, Canada, July 07 - 09, 2008). CIVR '08. ACM, New York, NY, 437-446. DOI= http://doi.acm.org/10.1145/1386352.1386408 Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek, "A Comparison of Color Features for Visual Concept Classification," in Proceedings of the ACM International Conference on Image and Video Retrieval, Niagara Falls, Canada, 2008, pp. 141-149. Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek, "Evaluation of Color Descriptors for Object and Scene Recognition," in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Anchorage, Alaska, 2008. Ken-Hao Liu, Ming-Fang Weng, Chi-Yao Tseng, Yung-Yu Chuang and Ming-Syan Chen. Association and Temporal Rule Mining for Post-Processing of Semantic Concept Detection in Video. In IEEE Transactions on Multimedia, special issue on Multimedia Data Mining, volume 10, issue 2, page 240-251, February 2008. Lee F. and Bailer W. Organizing Rushes Video by Visually Similar Setting. Proceedings of ACM International Conference on Image and Video Retrieval, Niagara Falls, CA, Jul. 2008, pp. 279-287. Lee H, Gurrin C, Jones G and Smeaton A.F. Interaction Design for Personal Photo Management on a Mobile Device. Handbook of Research on User Interface Design and Evaluation for Mobile Technology, 2008. (pp69-85) IGI Publishing, ISBN: 978-1-59904-871-0. Ork de Rooij, Cees G. M. Snoek, and Marcel Worring, "Balancing Thread Based Navigation for Targeted Video Search," in Proceedings of the ACM International Conference on Image and Video Retrieval, Niagara Falls, Canada, 2008, pp. 485-494. Ork de Rooij, Cees G. M. Snoek, and Marcel Worring, "MediaMill: Fast and Effective Video Search using the ForkBrowser," in Proceedings of the ACM International Conference on Image and Video Retrieval, Niagara Falls, Canada, 2008, pp. 561-561. Jeremy Pickens, Gene Golovchinsky, Chirag Shah, Pernilla Qvarfordt, and Maribeth Back. Algorithmic Mediation for Collaborative Exploratory Search. SIGIR 2008. (Singapore, Singapore, July 20 - 24, 2008). ACM, New York, NY, 315-322., July 22, 2008 Smeaton A.F, Foley C, Byrne D and Jones G. iBingo Mobile Collaborative Search. CIVR 2008 - ACM International Conference on Image and Video Retrieval. VideOlympics @ CIVR, Niagara Falls, Canada, 7-9 July 2008. Smeaton A.F, Foley C, Byrne D and Jones G. Mobile, Ubiquitous Information Seeking, as a Group:The iBingo Collaborative Video Retrieval System. MobiQuitous 2008 - The 5th Annual International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services, Dublin, Ireland, 21-25 July 2008. Smeaton A.F, Over P and Kraaij W. High-level Feature Detection from Video in TRECVid: a 5-Year Retrospective of Achievements. Multimedia Content Analysis:Theory and Applications (in press), 2008. Smeaton A.F, Wilkins P, Worring N, de Rooij O, Chua T-S and Luan H. Content-Based Video Retrieval: Three Example Systems from TRECVid. International Journal of Imaging Systems and Technology, Special Issue on Multimedia Information Retrieval (in press), 2008. Cees G. M. Snoek, Marcel Worring, Ork de Rooij, Koen E. A. van de Sande, Rong Yan, and Alexander G. Hauptmann, "VideOlympics: Real-Time Evaluation of Multimedia Retrieval Systems," IEEE Multimedia, vol. 15, iss. 1, 2008. Jinhui Tang, Xian-Sheng Hua, Yan Song, Tao Mei, Xiuqing Wu. "Optimizing Training Set Construction for Video Semantic Classification," EURASIP Journal on Advances in Signal Processing, 2008. Wilkins P, Smeaton A.F, O'Connor N and Byrne D. K-Space Interactive Search. CIVR 2008 - ACM International Conference on Image and Video Retrieval. VideOlympics @ CIVR, Niagara Falls, Canada, 7-9 July 2008. Yan-Tao Zheng, Shi-Yong Neo, Tat-Seng Chua, Qi Tian, ’¡ÈObject-based Image Retrieval Beyond Visual Appearances’¡É, MMM 2008, Kyoto, Japan, Jan 2008 --------------------------------------------------------------------- 2007 (59) --------------------------------------------------------------------- Werner Bailer and Georg Thallinger. A Framework for Multimedia Content Abstraction and its Application to Rushes Exploration. CIVR 2007 - ACM International Conference on Image and Video Retrieval, Amsterdam, The Netherlands, 9-11 July 2007. Bosch, A., Zisserman, A. and Munoz, X. Representing shape with a spatial pyramid kernel Proceedings of the International Conference on Image and Video Retrieval (2007) Byrne D, Kehoe P, Lee H, O Conaire C, Smeaton A.F, O'Connor N and Jones G. A User-Centered Approach to Rushes Summarisation Via Highlight-Detected Keyframes. TVS 2007 - TRECVID BBC Rushes Summarization Workshop, ACM Multimedia 2007, Augsburg, Germany, 24-29 September 2007. (pp35-39) Christel, M. G. 2007. Establishing the utility of non-text search for news video retrieval with real world users. In Proceedings of the 15th international Conference on Multimedia (Augsburg, Germany, September 25 - 29, 2007). MULTIMEDIA '07. ACM, New York, NY, 707-716. DOI= http://doi.acm.org/10.1145/1291233.1291395 Christel, M. Examining User Interactions with Video Retrieval Systems. Proc. of SPIE Vol. 6506 Multimedia Content Access: Algorithms and Systems (San Jose, CA, Feb. 2007). Chum, O., Philbin, J., Isard, M. and Zisserman, A. Scalable Near Identical Image and Shot Detection Proceedings of the International Conference on Image and Video Retrieval (2007) Dayong Ding, Bo Zhang. Probabilistic Model Supported Rank Aggregation for the Semantic Concept Detection in Video. in Intl. Conference of Image and Video Retrieval (CIVR), Amsterdam, 2007. Zhiwei Gu, Tao Mei, Xian-Sheng Hua, Jinhui Tang, Xiuqing Wu. "Multi-Layer Multi-Instance Kernel for Video Concept Detection," Accepted by ACM International Conference on Multimedia (ACM MM), Augsburg, Germany, Sept. 2007 Steven C.H. Hoi and Michael R Lyu. "A Multi-Modal and Multi-Level Ranking Framework for Content-Based Video Retrieval," , In the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2007), Special Session on "Web Image/Video Search Technologies", Hawaii, USA, 15-20 April, 2007. Winston H. Hsu, Lyndon Kennedy, Shih-Fu Chang. Reranking Methods for Visual Search. IEEE Multimedia Magazine, 13(3), 2007. Winston H. Hsu, Lyndon Kennedy, Shih-Fu Chang. Video Search Reranking through Random Walk over Document-Level Context Graph. In ACM Multimedia, Augsburg, Germany, September 2007. Wei Jiang, Shih-Fu Chang, Alexander C. Loui. Kernel Sharing With Joint Boosting For Multi-Class Concept Detection. In IEEE CVPR Workshop on Semantic Learning Application in Multimedia, Minneapolis, Minnesota, June 2007. Wei Jiang, Shih-Fu Chang, Alexander C. Loui. Context-Based Concept Fusion with Boosted Conditional Random Fields. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hawaii, USA, April 2007. Yu-Gang Jiang, Chong-Wah Ngo, Jun Yang, Towards Optimal Bag-of-Features for Object Categorization and Semantic Video Retrieval, ACM International Conference on Image and Video Retrieval (CIVR), 2007 Kehoe P and Smeaton A.F. Using Graphics Processor Units (GPUs) for Automatic Video Structuring. Proceedings of the WIAMIS 2007 - International Workshop on Image Analysis for Multimedia Interactive Services, Santorini, Greece, 6-8 June 2007. Lyndon Kennedy, Shih-Fu Chang. A Reranking Approach for Context-based Concept Fusion in Video Indexing and Retrieval. In ACM International Conference on Image and Video Retrieval, Amsterdam, Netherlands, July 2007. Koskela M and Smeaton A.F. Measuring Concept Similarities in Multimedia Ontologies: Analysis and Evaluations. IEEE Transactions on Multimedia, 2007. Koskela M and Smeaton A.F. An Empirical Study of Inter-Concept Similarities in Multimedia Ontologies. CIVR 2007 - ACM International Conference on Image and Video Retrieval, Amsterdam, The Netherlands, 9-11 July 2007. Xirong Li, Dong Wang, Jianmin Li and Bo Zhang, Video Search in Concept Subspace: A Text-Like Paradigm, in Intl. Conference of Image and Video Retrieval (CIVR), Amsterdam, 2007 Jingjing Liu, Wei Lai, Xian-Sheng Hua, Yalou Huang, Shipeng Li. Video Search Re-Ranking via Multi-Graph Propagation. ACM International Conference on Multimedia (ACM MM), Augsburg, Germany, Sept. 2007. Xiaobing Liu, Dong Wang, Jianmin Li, Bo Zhang, The Feature and Spatial Covariant Kernel: Adding Implicit Spatial Constraints to Histogram, in Proc. ACM International Conference on Image and Video Retrieval (CIVR), Amsterdam, Netherlands, July, 2007 Huan-Bo Luan, Shi-Yong Neo, Hai-Kiat Goh, Yong-Dong Zhang, Shou-Xun Lin, Tat-Seng Chua. Segregated Feedback with Performance-based Adaptive Sampling for Interactive News Video Retrieval, ACM MM 2007, Augsburg, Germany, 23-29 Sep 2007. Shi-Yong Neo, Yuanyuan Ran, Hai-Kiat Goh, Yantao Zheng, Tat-Seng Chua, Jintao Li, ’¡ÈThe Use of Topic Evolution to help Users Browse and Find Answers in News Video Corpus,’¡É ACM MM 2007, Augsburg, Germany, 23-29 Sep 2007. Shi-Yong Neo, Yantao Zheng, Hai-Kiat Goh, Tat-Seng Chua, Sheng Tang, ’¡ÈNews Video Retrieval Using Implicit Event Semantics,’¡É ICME 2007, Beijing, China, 2-5 Jul 2007. Chen-Ming Pan, Yung-Yu Chuang, Winston H. Hsu."NTU TRECVID-2007 Fast Rushes Summarization System,", ACM Multimedia TRECVID BBC Rushes Summarization Workshop (TVS 2007), Augsburg, Germany, September 23-29, 2007. Ork de Rooij, Cees G.M. Snoek, and Marcel Worring. MediaMill: Semantic Video Browsing using the RotorBrowser. In ACM CIVR 2007 - International Conference on Image and Video Retrieval, Amsterdam, The Netherlands, July 2007. Ork de Rooij, Cees G.M. Snoek, and Marcel Worring. MediaMill: Video Query on demand using the RotorBrowser. In Proceedings of the IEEE International Conference on Multimedia & Expo, Beijing, China, July 2007. Over P, Smeaton A.F and Kelly P. The TRECVID 2007 BBC Rushes Summarization Evaluation Pilot. TVS 2007 - TRECVID BBC Rushes Summarization Workshop, ACM Multimedia 2007, Augsburg, Germany, 24-29 September 2007. (pp1-15) Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei, Hong-Jiang Zhang. "Correlative Multi-Label Video Annotation," ACM International Conference on Multimedia (ACM MM), Augsburg, Germany, Sept. 2007. (Best paper award) Frank J. Seinstra, Jan-Mark Geusebroek, Dennis Koelma, Cees G.M. Snoek, Marcel Worring, and Arnold W.M. Smeulders. High-Performance Distributed Image and Video Content Analysis with Parallel-Horus. IEEE Multimedia. 2007. In press. Smeaton A.F. Techniques Used and Open Challenges to the Analysis, Indexing and Retrieval of Digital Video. Information Systems Journal, Vol. 32, No. 4, 2007. (pp545-559) Smeaton A.F. TRECVid - Video Evaluation. ASIST Bulletin, 2007. Smeaton A.F. Video Summarisation: A new Challenge. Proceedings of the MAR 2007 - Research Challenges in Multimedia Analysis and Retrieval, Glasgow, Scotland, 20 July 2007. Cees G.M. Snoek, Bouke Huurnink, Laura Hollink, Maarten de Rijke, Guus Schreiber, and Marcel Worring. Adding Semantics to Detectors for Video Retrieval. IEEE Transactions on Multimedia, August, 2007. In press. Cees G.M. Snoek and Marcel Worring. Are Concept Detector Lexicons Effective for Video Search? In Proceedings of the IEEE International Conference on Multimedia & Expo, Beijing, China, July 2007. Cees G.M. Snoek, Marcel Worring, Dennis C. Koelma, and Arnold W.M. Smeulders. A Learned Lexicon-Driven Paradigm for Interactive Video Retrieval. IEEE Transactions on Multimedia, 9(2):280-292, February 2007. Jinhui Tang, Xian-Sheng Hua, Guo-Jun Qi, Meng Wang, Tao Mei, Xiuqing Wu. "Structure-Sensitive Manifold Ranking for Video Concept Detection," ACM International Conference on Multimedia (ACM MM), Augsburg, Germany, Sept. 2007. Tesic, J., Natsev, A., and Smith, J. R. 2007. Cluster-based data modeling for semantic video search. In Proceedings of the 6th ACM international Conference on Image and Video Retrieval (Amsterdam, The Netherlands, July 09 - 11, 2007). CIVR '07. ACM, New York, NY, 595-602. DOI= http://doi.acm.org/10.1145/1282280.1282365 Dong Wang, Jianmin Li, and Bo Zhang. The Importance of Query-Concept-Mapping for Automatic Video Retrieval, ACM Multimedia 2007 Dong Wang, Xiaobing Liu, Linjie Luo, Jianmin Li and Bo Zhang. Video Diver: Generic Video Indexing with Diverse Features. MIR workshop at ACM Multimedia 2007 Dong Wang, Zhikun Wang, Xirong Li, Xiaobing Liu, Jianmin Li and Bo Zhang. Mapping Query to Semantic Concepts: Leveraging Semantic Indices for Automatic and Interactive Video Retrieval, Invited paper in special session of "Closing the semantic gap: concept-based video mining and retrieval" at Intl. Conference Semantic Computing (ICSC) 2007 Feng Wang, Chong-Wah Ngo, Rushes Video Summarization by Object and Event Understanding, TRECVID BBC Rushes Summarization Workshop at ACM Multimedia (TVS'07), Augsburg, Germany, Sep. 2007. Meng Wang, Xian-Sheng Hua, Xun Yuan, Yan Song, Li-Rong Dai. Optimizing Multi-Graph Learning: Towards A Unified Video Annotation Scheme. ACM International Conference on Multimedia (ACM MM), Augsburg, Germany, Sept. 2007. Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Ren-Hua Wang. An Interactive Video Annotation Framework With Multiple Modalities. International Conference on Acoustic, Speech, and Signal Processing (ICASSP), April, 2007, Honolulu, Hawaii, USA. Meng Wang, Xian-Sheng Hua, Yan Song, Jinhui Tang, Li-Rong Dai, ’¡ÈMulti-Concept Multi-Modality Active Learning for Interactive Video Annotation’¡É, to appear in First IEEE International Conference on Semantic Computing (ICSC), Irvine, California, USA, September, 2007. Xiao-Yong Wei, Chong-Wah Ngo, Ontology-Enriched Semantic Space for Video Search, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007. Wilkins P, Adamek T, Smeaton A.F. and O'Connor N. Inexpensive Fusion Methods for Enhancing Feature Detection. CBMI 2007 - 5th International Workshop on Content-Based Multimedia Indexing, Bordeaux, France, 25-27 June 2007. Wilkins P, Adamek T, O'Connor N and Smeaton A.F. Inexpensive Fusion Methods for Enhancing Feature Detection. Signal Processing: Image Communication, Special Issue on Content-Based Multimedia Indexing and Retrieval, Vol. 22, No. 7-8, 2007. (pp 635-650) Marcel Worring, Cees G.M. Snoek, Ork de Rooij, Giang P. Nguyen, and Arnold W.M. Smeulders. The MediaMill Semantic Video Search Engine. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, USA, April 2007. Xiao Wu, Alexander G. Hauptmann, Chong-Wah Ngo, Novelty Detection for Cross-Lingual News Stories with Visual Duplicates and Speech Transcripts, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007. Xiao Wu, Wan-Lei Zhao, Chong-Wah Ngo, Efficient Near-Duplicate Keyframe Retrieval with Visual Language Models, International Conference on Multimedia and Expo (ICME), 2007 Xiao Wu, Wan-Lei Zhao, Chong-Wah Ngo, Near-Duplicate Keyframe Retrieval with Visual Keywords and Semantic Context, ACM International Conference on Image and Video Retrieval (CIVR), 2007. Yan, R. and Hauptmann, A. G. 2007. A review of text and image retrieval approaches for broadcast news video. Inf. Retr. 10, 4-5 (Oct. 2007), 445-484. DOI= http://dx.doi.org/10.1007/s10791-007-9031-y Jun Yang, Yu-Gang Jiang, Alexander G. Hauptmann, Chong-Wah Ngo, Evaluating Bag-of-Visual-Words Representations in Scene Classification, ACM SIGMM Int'l Workshop on Multimedia Information Retrieval (MIR'07), Augsburg, Germany, Sep. 2007. Jinhui Yuan, Huiyi Wang, Lan Xiao, Wujie Zheng, Jianmin Li, Fuzong Lin, Bo Zhang: A Formal Study of Shot Boundary Detection. IEEE Trans. Circuits Syst. Video Techn. 17(2): 168-186 (2007) Jinhui Yuan, Jianmin Li, Bo Zhang. Gradual transitions detection with conditional random fields. Proc. of ACM Multimedia. Augsburg, Germany. ACM Press, September, 2007. pages 277-280. Eric Zavesky, Zhu Liu, David Gibbon, Behzad Shahraray. Searching Visual Semantic Spaces with Concept Filters. In IEEE International Conference on Semantic Computing, Irvine, California, September 2007. Zhengjun Zha, Tao Mei, Zengfu Wang, Xian-Sheng Hua. "Building a Comprehensive Ontology to Refine Video Concept Detection," ACM SIGMM International Conference Workshop on Multimedia Information Retrieval (ACM MIR), In conjunction with ACM Multimedia, Augsburg, Germany, Sept. 2007. Wan-Lei Zhao, Chong-Wah Ngo, Hung-Khoon Tan, Xiao Wu, Near-Duplicate Keyframe Identification with Interest Point Matching and Pattern Learning, IEEE Trans. on Multimedia, vol. 9, pp. 1037-1048, Aug 2007. --------------------------------------------------------------------- 2006 (54) --------------------------------------------------------------------- Christel, M. Evaluation and User Studies with Respect to Video Summarization and Browsing. Proc. of SPIE Vol. 6073 Multimedia Content Analysis, Management and Retrieval (San Jose, CA, Jan. 2006). Christel, M., and Conescu, R. Mining Novice User Activity with TRECVID Interactive Retrieval Tasks. In CIVR 2006 - International Conference on Image and Video Retrieval, H. Sundaram et al. (Eds.), LNCS 4071, pp. 21-30, Tempe, USA, 13-15 July 2006. Springer-Verlag. Shahram Ebadollahi, Lexing Xie, Shih-Fu Chang, John R. Smith. Visual Event Detection Using Multi-Dimensional Concept Dynamics. In IEEE International Conference on Multimedia and Expo (ICME 06), Toronto, Canada, 2006. Ewerth, Ralph and Freisleben, Bernd: Self-Supervised Learning for Robust Video Indexing. In: Proceedings of the IEEE International Conference on Multimedia & Expo, Toronto, Canada, 2006, pp. 1749-1752. Garnaud E, Smeaton A.F and Koskela M. Evaluation of a Video Annotation Tool Based on the LSCOM Ontology. SAMT 2006 - Proceedings of The First International Conference on Semantics And Digital Media Technology, Athens, Greece, 6-8 December 2006. Gurrin C, Johansen D and Smeaton A.F. Supporting Relevance Feedback in Video Search. ECIR 2006 - European Conference on Information Retrieval. Lalmas M et al. (Eds.): Lecture Notes in Computer Science (LNCS Series 3936), London, U.K., 10-12 April 2006. Hoashi, K., Sugano, M., Naito, M., Matsumoto, K., Sugaya, F. (2006) Video story segmentation based on generic low-level features, Trans of IEICE on Information and Systems, Vol. J89-D, No. 10, pp. 2305-2314, 2006. (In Japanese) Winston H. Hsu, Lyndon Kennedy, and Shih-Fu Chang. (2006) "Video Search Reranking via Information Bottleneck Principle," ACM Multimedia 2006 (full paper), Santa Barbara, CA, October 22-27. Winston H. Hsu and Shih-Fu Chang.(2006) "Topic Tracking across Broadcast News Videos with Visual Duplicates and Semantic Concepts," The International Conference on Image Processing (ICIP), Atlanta, GA, October. Wei Jiang, Shih-Fu Chang, Alexander C. Loui. Active Context-based concept fusion with partial user labels. In IEEE International Conference on Image Processing (ICIP 06), Atlanta, GA, USA, 2006. Lyndon Kennedy, Shih-Fu Chang, Igor Kozintsev. To Search or To Label?: Predicting the Performance of Search-Based Automatic Image Classifiers. In Multimedia Information Retrieval Workshop (MIR), Santa Barbara, CA, USA, 2006. Koskela M and Smeaton A.F. Clustering-Based Analysis of Semantic Concept Models for Video Shots. ICME 2006 - IEEE International Conference on Multimedia and Expo, Toronto, Canada, 9-12 July 2006. Koskela M, Smeaton A.F and Gaughan G. Semantic Analysis of Concept Models for News Videos. VCIMS - Workshop on Visual Categorisation and Image Management Systems, Sunderland, U.K., 28 June 2006. Wei Lai, Xian-Sheng Hua, Wei-Ying Ma. Towards Content-Based Relevance Ranking for Video Search. ACM Multimedia (ACM MM), Santa Barbara, CA, USA, Oct 23-27 2006. Matsumoto, K., Hoashi, K., Naito, M., Shishibori, M., Kita, K. (2006) Report on TRECVID2005, Proc. of 12th Korea-Japan Joint Workshop on Frontiers of Computer Vision(FCV2006), pp.65-70, Feb. 2006. Matsumoto, K., Naito, M., Hoashi, K., Sugaya, F. (2006) SVM-based Shot Boundary Detection with a Novel Feature. In Proceedings of the IEEE International Conference on Multimedia & Expo (ICME) 2006, pp. 1837-1840, Toronto, Ontario, Canada, 9-12 July, 2006. Naito, M., Matsumoto, K., Hoashi, K., Sugaya, F. (2006) Camera Motion Detection using Video Mosaicing. In Proceedings of the IEEE International Conference on Multimedia & Expo (ICME) 2006, pp. 1741-1744, Toronto, Ontario, Canada, 9-12 July, 2006. Milind Naphade, John R. Smith, Jelena Tesic, Shih-Fu Chang, Winston Hsu, Lyndon Kennedy, Alexander Hauptmann, Jon Curtis. Large-Scale Concept Ontology for Multimedia. IEEE Multimedia Magazine, 13(3), 2006. Shi-Yong Neo, Yantao Zheng, Tat-Seng Chua, Qi Tian ’¡ÈNews Video Search with Fuzzy Event Clustering using High-level Features’¡É In ACM MM 2006, Santa Barbara, USA, 23-27 October 2006. Shi-Yong Neo, Jin Zhao, Min-Yan Kan, Tat-Seng Chua ’¡ÈVideo Retrieval Using High-level features: Exploiting Query-matching and Confidence-based Weighting’¡É In CIVR 2006, Arizona, USA, 13-15 July 2006. O'Connor N, Lee H, Smeaton A.F, Jones G, Cooke E, Le Borgne H and Gurrin C. F�íschl�ár-TRECVid2004: Combined Text- and Image-Based Searching of Video Archives. ISCAS 2006 - IEEE International Symposium on Circuits and Systems, Kos, Greece, 21-24 May 2006. Over P, Smeaton A.F and Docef A. Eval-ware: Digital Video Retrieval. IEEE Signal Processing Magazine, 2006. Sav S, Jones G, Lee H, O'Connor N and Smeaton A.F. Interactive Experiments in Object-Based Retrieval. CIVR2006 - 5th International Conference on Image and Video Retrieval. Springer Lecture Notes in Computer Science Vol. 4071, Tempe, AZ, 13-15 July 2006. Shishibori, M., Minamimoto, T., Matsumoto, K., Hoashi, K., Naito, M., Kita, K. (2006) Estimation of The Camera Motion based on Movement of Interest Points between Images, Proc. of 12th Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV2006), pp. 145-150, Feb. 2006. Smeaton A.F. TrecVid. CLEAR '06 (Classification of Events, Activities and Relationships) Evaluation Workshop, Southampton, U.K., 6-7 April 2006. Smeaton A.F, Foley C, Gurrin C, Lee H and Mc Givney S. (2006) Collaborative Searching for Video Using the Fishlar System and a DiamondTouch Table. TableTop2006 - The 1st IEEE International Workshop on Horizontal Interactive Human-Computer Systems, Adelaide, Australia, 5-7 January 2006. Smeaton A.F, Gurrin C and Lee H. Interactive Searching and Browsing of Video Archives: Using Text and Using Image Matching. In: Hammoud, Riad (Ed.), Interactive Video: Algorithms and Technologies, 2006, XVI, 250 p. 109 illus., Hardcover, ISBN: 3-540-33214-6 , 2006. Smeaton A.F, Jones G, Lee H and O'Connor N and Sav S. Object-Based Access to TV Rushes Video. ECIR 2006 - European Conference on Information Retrieval. Lalmas M et al. (Eds.): Lecture Notes in Computer Science (LNCS Series 3936), pp. 476-479., London, U.K., 10-12 April 2006. Smeaton A.F, Lee H, Foley C, Mc Givney S and Gurrin C. (2006) Fishlar-DiamondTouch: Collaborative Video Searching on a Table. SPIE Electronic Imaging - Multimedia Content Analysis, Management, and Retrieval, San Jose, CA, 15-19 January 2006. Smeaton A.F, Lee H, Foley C and Mc Givney S. Collaborative Video Searching on a Tabletop. Multimedia Systems Journal, Vol. 12, No. 4-5, 2006. Smeaton A.F, Over P and Kraaij W. Evaluation Campaigns and TRECVid. MIR 2006 - 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, Santa Barbara, CA, 26-27 October 2006. Arnold W.M. Smeulders, Jan C. van Gemert, Jan-Mark Geusebroek, Cees G.M. Snoek, and Marcel Worring Browsing for the National Dutch Video Archive In Proceedings of the 2nd IEEE-EURASIP International Symposium on Communications, Control and Signal Processing, Marrakech, Morocco, March 2006. Cees G.M. Snoek, Marcel Worring, Jan-Mark Geusebroek, Dennis C. Koelma, Frank J. Seinstra, and Arnold W.M. Smeulders The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), October 2006. Cees G.M. Snoek, Marcel Worring, Jan-Mark Geusebroek, Dennis C. Koelma, Frank J. Seinstra, and Arnold W.M. Smeulders The Semantic Pathfinder for Generic News Video Indexing In Proceedings of the IEEE International Conference on Multimedia & Expo, pp. 1469-1472, Toronto, Canada, July 2006. Cees G.M. Snoek, Marcel Worring, and Alexander G. Hauptmann Learning Rich Semantics from News Video Archives by Style Analysis ACM Transactions on Multimedia Computing, Communications and Applications, 2(2):91-108, May 2006. Cees G.M. Snoek, Marcel Worring, Dennis C. Koelma, and Arnold W.M. Smeulders Learned Lexicon-driven Interactive Video Retrieval In CIVR 2006 - International Conference on Image and Video Retrieval, H. Sundaram et al. (Eds.), LNCS 4071, pp. 11-20, Tempe, USA, 13-15 July 2006. Springer-Verlag. Cees G.M. Snoek, Marcel Worring, Bouke Huurnink, Jan C. van Gemert, Koen E.A. van de Sande, Dennis C. Koelma, and Ork de Rooij. MediaMill: Video Search using a Thesaurus of 500 Machine Learned Concepts. In Proceedings of the 1st International Conference on Semantic and Digital Media Technologies, Athens, Greece, December 2006. Cees G.M. Snoek, Marcel Worring, Jan C. van Gemert, Jan-Mark Geusebroek, and Arnold W.M. Smeulders The Challenge Problem for Automated Detection of 101 Semantic Concepts in Multimedia In Proceedings of ACM Multimedia, Santa Barbara, USA, October 2006. Pablo Toharia, Oscar Robles, Jose Luis Bosque and A. Rodriguez. "Towards a Parallel Video Segmentation on a Shared Memory Architecture". Workshop 2006 on Computation Intensive Methods for Computer Vision, held with ECCV 2006. Graz, Austria, May 2006. P. Toharia, O. D. Robles, J. L. Bosque, A. Rodriguez. "Video shot extraction on parallel architectures". In proceedings of the 2006 International Symposium on Parallel and Distributed Processing and Applications (ISPA 2006). Sorrento, Italy, December 2006. Lecture Notes in Computer Science, Vol. 4330, pp. 869-883. Springer Verlag. ISBN: 978-3-540-68067-3. Jan C. van Gemert, Jan-Mark Geusebroek, Cor J. Veenman, Cees G.M. Snoek, and Arnold W.M. Smeulders Robust Scene Categorization by Learning Image Statistics in Context In CVPR Workshop on Semantic Learning Applications in Multimedia, New York, USA, June 2006. Jan C. van Gemert, Cees G.M. Snoek, Cor Veenman, and Arnold W.M. Smeulders The Influence of Cross-Validation on Video Classification Performance In Proceedings of ACM Multimedia, Santa Barbara, USA, October 2006. Volkmer, Timo and Natsev, Apostol (Paul). (2006) Exploring Automatic Query Refinement for Text-Based Video Retrieval. In Proceedings of the IEEE International Conference on Multimedia & Expo (ICME) 2006, Toronto, Ontario, Canada, 9-12 July, 2006. Dong Wang and Jianmin Li and Bo Zhang. (2006) "Relay Boost Fusion for Learning Rare Concepts in Multimedia" in Proceedings of the Conference on Image and Video Retrieval (CIVR 2006). Meng Wang, Xian-Sheng Hua, Yan Song, Xun Yuan, Shipeng Li, and Hong-Jiang Zhang. Automatic Video Annotation by Semi-supervised Learning with Kernel Density Estimation. ACM Multimedia (ACM MM), Santa Barbara, CA, USA, Oct 23-27 2006. Wilkins P, Ferguson P, Gurrin C and Smeaton A.F. Automatic Determination of Feature Weights for Mult-Feature CBIR. ECIR 2006 - European Conference on Information Retrieval. Lalmas M et al. (Eds.): Lecture Notes in Computer Science (LNCS Series 3936), London, U.K., 10-12 April 2006. Wilkins P, Ferguson P and Smeaton A.F. Using Score Distributions for Querytime Fusion in Multimedia Retrieval. MIR 2006 - 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, Santa Barbara, CA, 26-27 October 2006. Marcel Worring, Cees G.M. Snoek, Ork de Rooij, Giang P. Nguyen, and Dennis C. Koelma Lexicon based browsers for searching in news video archives In Proceedings of the International Conference on Pattern Recognition, Hong Kong, China, August 2006. Marcel Worring, Cees G.M. Snoek, Bouke Huurnink, Jan van Gemert, Dennis Koelma, and Ork de Rooij The MediaMill Large-lexicon Concept Suggestion Engine In Proceedings of ACM Multimedia, Santa Barbara, USA, October 2006. Marcel Worring, Cees G.M. Snoek, Ork de Rooij, Giang P. Nguyen, Richard van Balen and Dennis C. Koelma MediaMill: Advanced Browsing in News Video Archives In CIVR 2006 - International Conference on Image and Video Retrieval, H. Sundaram et al. (Eds.), LNCS 4071, pp. 533-536, Tempe, USA, 13-15 July 2006. Springer-Verlag. Xiao Wu, Chong-Wah Ngo, and Qing Li. (2006). Threading and Autodocumenting News Videos. IEEE Signal Processing Magazine, volume 23, issue 2, pp. 59-68, March 2006. Lexing Xie, Dong Xu, Shahram Ebadollahi, Katya Scheinberg, Shih-Fu Chang, John R. Smith. Detecting Generic Visual Events with Temporal Cues. In Proc. 40th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, October 2006. Lexing Xie, Shih-Fu Chang. Pattern Mining In Visual Concept Streams. In IEEE International Conference on Multimedia and Expo (ICME 06), Toronto, Canada, 2006. Akira Yanagawa, Winston Hsu, Shih-Fu Chang. Brief Descriptions of Visual Features for Baseline TRECVID Concept Detectors. ADVENT Technical Report #219-2006-5 Columbia University, July 2006. Ming Zhao, Shi-Yong Neo, Hai-Kiat Goh, Tat-Seng Chua, ’¡ÈMulti-Faceted Contextual Model for Person Identification in News Video’¡É In Multimedia Modeling (MMM), Beijing, China 4-6 Jan, 2006. Wujie Zheng and Jianmin Li and Zhangzhang Si and Fuzong Lin and and Bo Zhang", "Using High-level Semantic Features in Video Retrieval" in the Proceedings of the Confernce on Image and Video Retrieval. (CIVR 2006). --------------------------------------------------------------------- 2005 (42) --------------------------------------------------------------------- John Adcock, Matthew Cooper, Andreas Girgensohn, and Lynn Wilcox. (2005) Interactive Video Search Using Multilevel Indexing International Conference on Image and Video Retrieval (CIVR) 2005 pp. 205-14 Amir, A., Berg, M., and Permuter, H. 2005. Mutual relevance feedback for multimodal query formulation in video retrieval. In Proceedings of the 7th ACM SIGMM international Workshop on Multimedia information Retrieval (Hilton, Singapore, November 10 - 11, 2005). MIR '05. ACM, New York, NY, 17-24. DOI= http://doi.acm.org/10.1145/1101826.1101832 Chen, M.-Y., Christel, M., Hauptmann, A., and Wactlar, H. Putting Active Learning into Multimedia Applications: Dynamic Definition and Refinement of Concept Classifiers. Proc. ACM Multimedia '05 (Singapore, November 2005), pp. 902-911. Christel, M., and Conescu, R. (2005). Addressing the Challenge of Visual Information Access from Digital Image and Video Libraries. Proc. ACM/IEEE-CS Joint Conference on Digital Libraries (Denver, CO, June 2005), 69-78. Christel, M., and Hauptmann, A. (2005). The Use and Utility of High-Level Semantic Features. Proc. International Conference on Image and Video Retrieval (CIVR) (Singapore, July 2005), in Lecture Notes in Computer Science 3568, 134-144. Gaughan G and Smeaton A.F.(2005) Finding New News: Novelty Detection in Broadcast News.AIRS 2005 - Second Asia Information Retrieval Symposium, Jeju Island, Korea, 13-15 October 2005. Demir Gokalp and Selim Aksoy. (2005) "Finding Faces in News Videos," in 4th International Workshop on Content-Based Multimedia Indexing, Riga, Latvia, June 21-23, 2005. Andreas Girgensohn, John Adcock, Matthew Cooper, and Lynn Wilcox. (2005) A Synergistic Approach to Efficient Interactive Video Retrieval INTERACT 2005, LNCS 3585, pp. 781-794 . Andreas Girgensohn, John Adcock, Matthew Cooper, and Lynn Wilcox. (2005) Interactive Search in Large Video Collections CHI 2005 Extended Abstracts, ACM Press, pp. 1395-1398 D Heesch and S R#N|ger. (2005) Image Browsing: Semantic Analysis of NNk Networks. Int'l Conf on Image and Video Retrieval (CIVR, Singapore, Jul 2005), pp 609--618, Springer LNCS 3568 Hoashi, K., Sugano, M., Naito, M., Matsumoto, K., Sugaya, F. (2005) Video Story Segmentation and its Application to Personal Video Recorders, Proc. of International Conference on Image and Video Retrieval 2005, LNCS3568, pp. 39-48, Jul 2005. P Howarth and S R#N|ger. (2005) Trading Precision for Speed: Localised Similarity Functions. Int'l Conf on Image and Video Retrieval (CIVR, Singapore, Jul 2005), pp 415--424, Springer LNCS 3568, 2005 P Howarth and S R#N|ger. (2005) Fractional Distance Measures for Content-Based Image Retrieval. 27th European Conference on Information Retrieval (ECIR, Santiago de Compostela, Spain, Mar 2005), pp 447-456, Springer LNCS 3408, 2005 Winston Hsu, Shih-Fu Chang. (2005) "Visual Cue Cluster Construction via Information Bottleneck Principle and Kernel Density Estimation," In International Conference on Content-Based Image and Video Retrieval (CIVR), Singapore, 2005. Nazli Ikizler and Pinar Duygulu, 2005) Person Search Made Easy. In Proceedings of The Fourth International Conference on Image and Video Retrieval (CIVR 2005), Singapore, July 20-22, 2005. Jaffre, G., and Joly, P. (2005) . Improvement of a Temporal Video Index Produced by an Object Detector. In Proceedings of the 11th International Conference on Computer Analysis of Images and Patterns (CAIP), Rocquencourt, France, september 2005. Malobabic J, Le Borgne H, Murphy N and O'Connor N. (2005) Detecting The Presence of Large Buildings in Natural Images. CBMI 2005 - 4th International Workshop on Content-Based Multimedia Indexing, Riga, Latvia, 21-23 June 2005. Mc Donald K and Smeaton A.F. (2005) A Comparison of Score, Rank and Probability-based Fusion Methods for Video Shot Retrieval. CIVR 2005 - International Conference on Image and Video Retrieval, W-K Leow et al. (Eds.), LNCS 3569, pp61-70, Singapore, 20-22 July 2005. LNCS Series 3569, (c) Springer-Verlag 2005. Natsev, A. (., Naphade, M. R., and Te#%G�Å�¡#%@i#%G�Ć#%@, J. 2005. Learning the semantics of multimedia queries and concepts from a small number of examples. In Proceedings of the 13th Annual ACM international Conference on Multimedia (Hilton, Singapore, November 06 - 11, 2005). MULTIMEDIA '05. ACM, New York, NY, 598-607. DOI= http://doi.acm.org/10.1145/1101149.1101288 O'Connor N, Cooke E, Le Borgne H, Blighe M and Adamek T. (2005) The AceToolbox: Low-Level Audiovisual Feature Extraction for Retrieval and Classification. 2nd IEE European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies, London, U.K., 30 November-1 December 2005. Rautiainen M & Sepp#Ndnen T (2005) Comparison of visual features and fusion techniques in automatic detection of concepts from news video. Proc. 2005 IEEE International Conference on Multimedia & Expo, Amsterdam, The Netherlands. Rautiainen M, Ojala T, Sepp#Ndnen T (2005) Content-based browsing in large news video databases. Proc. 5th IASTED International Conference on Visualization, Imaging and Image Processing, Benidorm, Spain. Sav S, Lee H, Smeaton A.F, O'Connor N and Murphy N. (2005) Using Video Objects and Relevance Feedback in Video Retrieval. In Multimedia Systems and Applications VIII, edited by Anthony Vetro, Chang Wen Chen, C.-C. J. Kuo, Tong Zhang, Qi Tian and John R. Smith. Proceedings of SPIE (SPIE, Bellingham, Wa) Vol. 6015, 601512 (2005), Boston, MA, USA, 23-26 October 2005. Sav S, Lee H, O'Connor N and Smeaton A.F. (2005) Interactive Object-based Retrieval Using Relevance Feedback. Acivs 2005 - Advanced Concepts for Intelligent Vision Systems, Antwerp, Belgium, 20-23 September 2005. Sav S, Lee H, Smeaton A.F. and O'Connor N. (2005) Using Segmented Objects in Ostensive Video Shot Retrieval. AMR 2005 - 3rd International Workshop on Adaptive Multimedia Retrieval, Glasgow, U.K., 28-29 July 2005. Sav S, O'Connor N, Smeaton A.F and Murphy N. (2005) Associating Low-level Features with Semantic Concepts using Video Objects and Relevance Feedback. WIAMIS 2005 - 6th International Workshop on Image Analysis for Multimedia Interactive Services, Montreux, Switzerland, 13-15 April 2005. F.J. Seinstra, C.G.M. Snoek, D. Koelma, J.M. Geusebroek, and M. Worring. (2005) User Transparent Parallel Processing of the 2004 NIST TRECVID Data Set. In International Parallel and Distributed Processing Symposium, Denver, USA, April 2005. Smeaton, A.F. (2005) TRECVid Evaluation and Related Work at Dublin City University. Smeaton A.F. VACE 18-Month Workshop, Baltimore, Maryland, 26-28 April 2005. Smeaton A.F. Large Scale Evaluations of Multimedia Information Retrieval: The TRECVid Experience. (2005) CIVR 2005 - International Conference on Image and Video Retrieval, W-K Leow et al. (Eds.), LNCS 3569, pp11-17, Singapore, 20-22 July 2005. LNCS Series 3569, (c) Springer- Verlag 2005. C.G.M. Snoek (2005) The Authoring Metaphor to Machine Understanding of Multimedia Ph.D. Thesis, University of Amsterdam, October 2005. C.G.M. Snoek et al. (2005) Multimodal Video Indexing: Past, Present, and Future, Workshop on Digital Media Monitoring and Management, Fraunhofer Institute for Computer Graphics, Darmstadt, October 17-18, 2005. (Invited talk) C.G.M. Snoek, M. Worring, J.M. Geusebroek, D.C. Koelma, and F.J. Seinstra (2005) On the Surplus Value of Semantic Video Analysis Beyond the Key Frame In Proceedings of the IEEE International Conference on Multimedia & Expo (ICME), Amsterdam, The Netherlands, July 2005. C.G.M. Snoek, M. Worring, and A.W.M. Smeulders (2005) Early versus Late Fusion in Semantic Video Analysis In Proceedings of ACM Multimedia, Singapore, November 2005. (To appear) S.M.M. Tahaghoghi, Hugh E. Williams, James A. Thom, and Timo Volkmer. (2005) Video Cut Detection using Frame Windows. In Proceedings of the 28th Australasian Computer Science Conference (ACSC2005), 193-200, Newcastle, Australia, 31 January - 3 February 2005. ISBN: 1 920 68220 1. Paola Virga and, Pinar Duygulu (2005) Systematic Evaluation of Machine Translation Methods for Image and Video Annotation, , In Proceedings of The Fourth International Conference on Image and Video Retrieval (CIVR 2005), Singapore, July 20-22, 2005. Timo Volkmer, John R. Smith, Apostol (Paul) Natsev, Murray Campbell, Milind Naphade. (2005) "A web-based system for collaborative annotation of large image and video collections", In Proceedings of the 13th ACM international conference on Multimedia, Singapore, 6-11 November, 2005 Xiao Wu, Chong-Wah Ngo, and Qing Li (2005). Co-clustering of Time-evolving News Story with Transcript and Keyframe. Proceedings of IEEE International Conference on Multimedia & Expo (ICME'05), Netherlands, Jul. 2005. Z. Yu and G. Herman. (2005) "On the Earth Mover's Distance as a Histogram Similarity Metric for Image Retrieval," IEEE International Conference on Multimedia & Expo (ICME), Jul 2005. Jinhui Yuan, Jianmin Li, Fuzong Lin and Bo Zhang. (2005) A Unified Shot Boundary Detection Framework Based on Graph Partition Model ACM Multimedia 2005, Singapore (to appear) Zhai. Y. and Shah, M. (2005) "Tracking News Stories Across Different Sources", 13-th ACMMM Multimedia Conference, Singapore, 2005. Zhai, Y., Yilmaz, A. and Shah, M. (2005) "Story Segmentation in News Videos Using Visual and Textual Cues", 4-th International Conference on Image and Video Retrieval, Singapore, 2005. Zhai, Y. and Shah, M. (2005) "A Multi-Level Framework for Video Shot Structuring", International Conference on Image Analysis and Recognition, Toronto, Canada, 2005. --------------------------------------------------------------------- 2004 (52) --------------------------------------------------------------------- Amir A., Iyengar G., Lin C.-Y., Naphade M., Natsev A., Neti C., Nock H.J., Smith J.R., Tseng B. (2004). Multimodal video search techniques: late fusion of speech-based retrieval and visual content-based retrieval. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing Vol. III Pgs. 1048-51. Liudmila Boldareva adn Djoerd Hjemstra. (2004). Interactive Content-Based Retrieval Using Pre-computed Object-Object Similarities. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.308-316. Ming-yu Chen, Alexander Hauptmann. (2004). Multi-modal classification in digital news libraries. International Conference on Digital Libraries archive Proceedings of the 2004 joint ACM/IEEE conference on Digital libraries. Tuscon, AZ. 2004. Pages: 212-213. Christel, M., Huang, C., Moraveji, N., and Papernick, N. (2004). Exploiting Multiple Modalities for Interactive Video Retrieval. Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Montreal, Canada, May 2004), Vol. III, pp. 1032-1035. Christel, M., and Moraveji, N. (2004). Finding the Right Shots: Assessing Usability and Performance of a Digital Video Library Interface. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 732-739. Christel, M., Moraveji, N., and Huang, C. (2004). Evaluating Content-Based Filters for Image and Video Retrieval. Proc. ACM SIGIR '04 (Sheffield, South Yorkshire, UK, July 2004), pp. 590-591. T.S. Chua, L. Chaisorn. (2004). Story Boundary Detection in Large Broadcast News Video Archives - Techniques, Experience, Trends. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 656-659. Cooper, M. (2004). Video Segmentation Combining Similarity Analysis and Classification. In Proceedings of the ACM MM'04, October 10-16, 2004, New York, NY, USA. Pages 252-255. de Vries A.P., Westerveld T., Ianeva T.I. (2004). Combining multiple representations on the TRECVID search task [video retrieval system]. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing Vol. III Pgs. 1052-5. Pinar Duygulu and Alexander Hauptmann. (2004). What's News, What's Not? Associating News Videos with Words. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.132-140. P. Duygulu, J.-Y. Pan, D.A. Forsyth. (2004).Towards Auto-Documentary: Tracking the Evolution of News Stories. In Proceedings of the ACM MM'04, October 10-16, 2004, New York, NY, USA., pp.820-827. Gurrin C. (2004). Video Retrieval within the TREC Framework. 2004. Dagstuhl Seminars (04021) on Content-Based Retrieval, Schloss Dagstuhl, Germany, 4-9 January 2004. C. Gurrin, H. Lee, A. F. Smeaton. (2004). F#Nmschl#Nar @ TRECVID2003: System Description (Paper and Accompanying Video). In Proceedings of the ACM MM'04, October 10-16, 2004, New York, NY, USA. pp. 938-939. Hauptmann, A., and Christel, M. Successful Approaches in the TREC Video Retrieval Evaluations. (2004). Proceedings of ACM Multimedia '04 (New York, NY, October 2004), pp. 668-675. Daniel Heesch and Stefan Rueger. (2004). Three Interfaces for Content-Based Access to Image Collections. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp. 491-499. L. Hollink, G.P. Ngyuyen, D.C. Koelma, A.T. Schreiber, M. Worring. (2004). User Strategies in Video Retrieval: A Case Study. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.6-14. Peter Howarth and Stefan Rueger. (2004). Evaluation of Texture Features for Content-Based Image Retrieval. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.326-334. Hsu W., Kennedy L., Huang C.-W., Chang S.-F., Lin C.-Y., Iyengar G. (2004). News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing Vol. III Pgs.645-8. Jarina R, O'Connor N, Murphy N and Marlow S. (2004) An Experiment in Audio Classification from Compressed Data. Proc. of Int. Workshop on Systems, Signals and Image Processing IWSSIP'04 , Poznan, Poland, 13-15 September 2004. Jones, GJF. (2004). Adaptive systems for multimedia information retrieval. ADAPTIVE MULTIMEDIA RETRIEVAL 3094: 1-18. Kyperountas M., Cernekova Z., Kotropoulos C., Gavrielides M., Pitas L. (2004). Audio PCA in a novel multimedia scheme for scene change detection. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing: Vol. iv. Pgs. 353-6. Lavrenko V., Feng S.L., Manmatha R. (2004). Statistical models for automatic video annotation and retrieval. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing Vol. III Pgs.1044-7. Malobabic J, O'Connor N, Murphy N, and Marlow S. (2004). Automatic Detection and Extraction of Artificial Text in Video. WIAMIS 2004 - 5th International Workshop on Image Analysis for Multimedia Interactive Services, Lisbon, Portugal, 21-23 April 2004 M.R. Naphade, J.R. Smith. (2004). On the Detection of Semantic Concepts at TRECVID. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 660-667. Natsev, A., Naphade, M. R., and Smith, J. R. 2004. Semantic representation: search and mining of multimedia content. In Proceedings of the Tenth ACM SIGKDD international Conference on Knowledge Discovery and Data Mining (Seattle, WA, USA, August 22 - 25, 2004). KDD '04. ACM, New York, NY, 641-646. DOI= http://doi.acm.org/10.1145/1014052.1014133 O'Hare N, Smeaton A, Czirjek C, O'Connor N, and Murphy N. (2004). A generic news story segmentation system and its evaluation. ICASSP 2004 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Quebec, Canada, 17-21 May 2004. Rautiainen M, Ojala T & Sepp#Ndnen T (2004) Cluster-temporal browsing of large news video databases. Proc. 2004 IEEE International Conference on Multimedia and Expo, Taipei, Taiwan, 2:751-754. Rautiainen M, Ojala T & Sepp#Ndnen T (2004) Analysing the performance of visual, concept and text features in content-based video retrieval. Proc. 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, New York, NY, 197-205. Oscar David Robles Sanchez. Tecnicas de Recuperacion por Contenido para Imagen y Video en Arquitecturas Paralelas [Techniques for Content-based Image and Video Retrieval on Parallel Architectures]. Universidad Politecnica de Madrid. Tesis Doctoral. Diciembre 2004. Oscar D. Robles, Pablo Toharia, Angel Rodriguez and Luis Pastor. (2004) Towards a Content-Based Video Retrieval System using Wavelet-Based Signatures. In proceedings of IASTED CGIM 2004, Kauai, Hawaii, USA, Aug. 2004, pp. 344-349. ISBN: 0-88986-418-7 Oscar D. Robles, Pablo Toharia, Angel Rodriguez and Luis Pastor. (2004) XML Specification for AVI Files in a Content-based Video Retrieval System. In proceedings of IASTED VIIP 2004. Marbella, Spain, Sep. 2004, pp. 374-378. ISBN: 0-88986-454-3 Smeaton A. (2004). Access to Archives of Digital Video Information. The 9th Search Engine Meeting, The Hague, The Netherlands, 19-20 April 2004. Smeaton A, Lee H and Mc Donald K. (2004) Experiences of Creating Four Video Library Collections with the F#Nmschl#Nar System. Journal of Digital Libraries: Special Issue on Digital Libraries as Experienced by the Editors of the Journal, Vol. 4, No. 1, pp 42-44, 2004. A. F. Smeaton, P. Over and W. Kraaij. (2004). TRECVID: Evaluating the Effectiveness of Information Retrieval Tasks on Digital Video. In Proceedings of the ACM MM'04, October 10-16, 2004, New York, NY, USA. Pages 652-655. Alan F. Smeaton, Wessel Kraaij, and Paul Over. (2004). The TREC Video Retrieval Evaluation (TRECVID): A Case Study and Status Report. in RIAO 2004 Conference Proceedings, Avignon, France. 26-28 April 2004. Pgs 25-37. Smith J.R., Over P., Leung C., Ip H., Grubinger M. (2004). Multimedia retrieval benchmarks. IEEE Multimedia vol.11, no.2: 80-4. C.G.M. Snoek, M. Worring, and A.G. Hauptmann. Detection of TV news monologues by style analysis. In International Conference on Multimedia and Expo, Taipei, Taiwan, June 2004. Fabrice Souvannavong, Bernard Merialdo, Benoit Huet. (2004). Improved Video Content Indexing by Multiple Latent Semantic Analysis. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.483-490. T. Ianeva, A.P. de Vries, and T. Westerveld (2004) A Dynamic Probabilistic Multimedia Retrieval Model. 2004 IEEE International Conference on Multimedia & Expo (ICME 2004), Taipei, Taiwan, June, 2004. http://www.uv.es/%7Etzveta/icme04.pdf T. Volkmer, S.M.M. Tahaghoghi and H.E. Williams.(2004) Gradual Transition Detection Using Average Frame Similarity. In Sadiye Guler, Alexander G. Hauptmann and Andreas Henrich editors, Proceedings of the Fourth International Workshop on Multimedia Data and Document Engineering (MDDE-04), in conjunction with the 2004 Computer Vision Pattern Recognition Conference (CVPR-04), Washington D.C., USA, 2nd July 2004, IEEE Computer Society. [also published as: Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04), Volume 9, 27 June - 2 July 2004.] Thijs Westerveld and Arjen P. de Vries. (2004). Multimedia Retrieval Using Multiple Examples. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.344-352 Westerveld, Thijs. (2004). Using generative probabilistic models for multimedia retrieval (Doctoral dissertation, Twente University, 2004). M. Worring, G.P. Nguyen, L. Hollink, J.C. van Gemert, and D.C. Koelma. Accessing video archives using interactive search. In International Conference on Multimedia and Expo, Taipei, Taiwan, June 2004. Y. Wu, E.Y. Chang, K.C-C. Chang, J.R. Smith. (2004). Optimal Multimodal Fusion for Multimedia Data Analysis. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. Rong Yan, Alexander G. Hauptmann. (2004). Co-retrieval: A Boosted Reranking Approach for Video Retrieval. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.60-69.. Yan, R., Yang, J. Hauptmann, A. (2004). Learning Query-Class Dependent Weights in Automatic Video Retrieval. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 548-555. Yang, J., Hauptmann, A. (2004). Naming Every Individual in News Video Monologues. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 580-587. Jun Yang, Ming-yu Chen, Alex Hauptmann. (2004). Finding Person X: Correlating Names with Visual Appearances. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.270-278. M.Yang, B.M. Wildemuth, G.Marchionini. (2004). The Relative Effectiveness of Concept-based Versus Content-based Video Retrieval. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 368-371. Yavlinsky A., Pickering M.J., Heesch D., Ruger S. (2004). A comparative study of evidence combination strategies. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing Vol. III Pgs.1040-3. Ye J. and Smeaton A. Poster (2004) Aggregated Feature Retrieval for MPEG-7 via Clustering. presented at: SIGIR 2004 - the 27th Annual International ACM SIGIR Conference, pp514-515, Sheffield, UK, 25-29 July 2004. D.Q. Zhang, S.-F. Chang. (2004). Detecting Image Near-Duplicate by Stochastic Attributed Relational Graph Matching with Learning. In Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA., pp. 877-884. --------------------------------------------------------------------- 2003 --------------------------------------------------------------------- Christel, M.G., and Huang, C. Enhanced Access to Digital Video through Visually Rich Interfaces. (2003).Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (Baltimore, MD, July 2003), pp. III-21 - III-24. Georgina Gaughan, Alan F. Smeaton, Cathal Gurrin, Hyowon Lee, Kieran McDonald. (2003). Video retrieval: Design, implementation and testing of an interactive video retrieval system. Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval. Berkeley, California. 2003. Pages: 23-30. Hauptmann A.G., Rong Jin, Ng T.D. (2003). Video retrieval using speech and image information. Proceedings of the SPIE - The International Society for Optical Engineering vol.5021: 148-59. G. Iyengar, H. J. Nock. (2003). Discriminative model fusion for semantic concept detection and annotation in video Proceedings of the eleventh ACM international conference on Multimedia Berkeley, CA. November 2003. Pages: 255-258. C-Y. Lin, M. Naphade, A. Natsev, C. Neti, J. R. Smith, B. Tseng, H. J. Nock, W. Adams. (2003). User-trainable video annotation using multimodal cues Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. Toronto, Canada July 2003. Pages: 403-404 Naphade, MR; Smith, JR. (2003). A hybrid framework for detecting the semantics of concepts and context. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS 2728: 196-205. Naphade M.R., Smith J.R. (2003). Role of classifiers in multimedia content management. Proceedings of the SPIE - The International Society for Optical Engineering vol.5021: 89-99. Natsev A., Naphade M.R., Smith J.R. (2003). Exploring semantic dependencies for scalable concept detection. Proceedings 2003 International Conference on Image Processing Vol. III Pgs. 625-8. Rautiainen M, Ojala T & Sepp#Ndnen T (2003) Cluster-temporal video browsing with semantic filtering. Proc. Advanced Concepts for Intelligent Vision Systems, Ghent, Belgium, 116 - 123. Rautiainen M, Sepp#Ndnen T, Penttil#Nd J & Peltola J (2003) Detecting semantic concepts from video using temporal gradients and audio classification. Proc. International Conference on Image and Video Retrieval, Urbana, IL, 260 - 270. Rong Yan, Alexander G. Hauptmann, Rong Jin. (2003). Negative pseudo-relevance feedback in content-based video retrieval Proceedings of the eleventh ACM international conference on Multimedia Berkeley, CA. November 2003. Pages: 343-346 A. Smeaton. (2003).Information Access to Digital Video Archives: A Review of TREC, and the F#Nmschl#Nar System. Invited speech at: MIR2003 - Workshop: Multimedia Information Retrieval in Business Applications, Fraunhofer Institute for Computer Graphics (IGD), Darmstadt, Germany, 30-31 Jaunary 2003 Smeaton A, Lee H, O'Connor N, Marlow S and Murphy N. (2003). TV News Story Segmentation, Personalisation and Recommendation AAAI 2003 Spring Symposium on Intelligent Multimedia Knowledge Management, Stanford University, Palo Alto, CA, 24-26 March 2003. Smeaton, AF; Over, P. (2003). TRECVID: Benchmarking the effectiveness of information retrieval tasks on digital video. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS 2728: 19-27. T. Ianeva, A. P. de Vries, and H. R#Nvhrig (2003) Detecting cartoons: a case study in automatic video-genre classification. In Proceedings of the IEEE International Conference on Multimeda & Expo (ICME), pp. 1149-1452, Baltimore, MD, US,July 2003. http://www.uv.es/%7Etzveta/icme03.pdf Thijs Westerveld, Arjen P. de Vries. (2003). Multimedia information retrieval: Experimental result analysis for a generative probabilistic image retrieval model Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval. Toronto, Canada. July 2003. Pages: 135-142. Westerveld, T; de Vries, AP; van Ballegooij, A; de Jong, F; Hiemstra, D. (2003). A Probabilistic multimedia retrieval model and its evaluation. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING 2003 (2): 186-198. Yanjun Qi, Hauptmann A., Ting Liu. (2003). Supervised classification for video shot segmentation. Proceedings 2003 International Conference on Multimedia and Expo. Vol. II Pgs.689-92. --------------------------------------------------------------------- 2002 --------------------------------------------------------------------- Basu S., Naphade M., Smith J.R. (2002). A statistical modeling approach to content based retrieval. 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings: IV 4080-3. Hauptmann A.G., Christel M.G., Papernick N.D. (2002). Video retrieval with multiple image search strategies. JCDL 2002. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries: 376 edited by Marchionini G., Hersh W. Hauptmann A.G., Jin R., Ng T.D. (2002). Multimodal information retrieval from broadcast video using OCR and speech recognition. JCDL 2002. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries: 160-1 edited by Marchionini G., Hersh W. Hauptmann A.G., Papernick N.D. (2002). Video-Cuebik: adapting image search to video shots. JCDL 2002. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries: 156-7 edited by Marchionini G., Hersh W. Naphade M.R., Basu S., Smith J.R., Ching-Yung Lin, Tseng B. (2002). Modeling semantic concepts to support query by keywords in video. Proceedings 2002 International Conference on Image Processing Vol. I Pgs. 145-8. Naphade M.R., Basu S., Smith J.R., Ching-Yung Lin, Tseng B. (2002). A statistical modeling approach to content based video retrieval. Proceedings 16th International Conference on Pattern Recognition: Pgs. 953-6 edited by Kasturi R., Laurendeau D., Suen C. H. J. Nock, G. Iyengar, C. Neti. (2002). Assessing face and speech consistency for monologue detection in video Proceedings of the tenth ACM international conference on Multimedia Juan-les-Pins, France. December 2002. Pages: 303-306. Rautiainen M., Doermann D. (2002). Temporal color correlograms for video retrieval. Proceedings 16th International Conference on Pattern Recognition: 267-70 edited by Kasturi R., Laurendeau D., Suen C. Rautiainen M & Ojala T (2002) Color correlograms in image and video retrieval. Proc. STeP 2002, The 10th Finnish Artificial Intelligence Conference, Oulu, Finland, 203 - 212. Smeaton A.F., Over P., Costello C.J., de Vries A.P., Doermann D., Hauptmann A., Rorvig M.E., Smith J.R., Wu L. (2002). The TREC2001 video track: information retrieval on digital video information. Research and Advanced Technology for Digital Libraries. 6th European Conference, ECDL 2002. Proceedings (Lecture Notes in Computer Science Vol. 2458 Pgs. 266-75 edited by Agosti M., Thanos C. --------------------------------------------------------------------- 2001 --------------------------------------------------------------------- Smeaton, A. (2001). Content-based access to digital video: the F#Nmschl#Nar system and the TREC Video track. MMCBIR 2001 - Multimedia Content-based Indexing and Retrieval, INRIA, Rocquencourt, France, 24-25 September 2001.