Accepted Papers

Full Papers Accepted for Oral Presentation

23 M3LH: Multi-Modal Multi-Label Hashing for Large Scale Data Search. Yang, Guan-Qun; Xu, Xin-Shun; Guo, Shanqing; Wang, Xiao-Lin. Shandong University, China, People’s Republic of.

30 On the Exploration of Convolutional Fusion Networks for Visual Recognition. Liu, Yu; Guo, Yanming; S. Lew, Michael. Leiden University, Netherlands, The.

36 Describing Geographical Characteristics with Social Images. ZHENG, Huangjie; YAO, Jiangchao; ZHANG, Ya. Shanghai Jiao Tong University, China, People’s Republic of.

40 A Comparison of Approaches for Automated Text Extraction from Scholarly Figures. Böschen, Falk (1); Scherp, Ansgar (1,2). 1: Christian-Albrechts-Universität Kiel, Germany; 2: ZBW – Leibniz Information Centre for Economics, Germany.

50 3D Sound Field Reproduction at Non Central Point for NHK 22.2 System. Wang, Song (1,2); Hu, Ruimin (1,2); Chen, Shihong (1,2); Wang, Xiaochen (1,2); Yang, Yuhong (1,2); Tu, Weiping (1,2); Peng, Bo (3). 1: State Key Laboratory of Software Engineering, School of Computer Science, Wuhan University, Wuhan, 430072, China; 2: National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, 430072, China; 3: Military Economy Academy, Wuhan, 430072, China.

51 Multi-Attribute based Fire Detection in Diverse Surveillance Videos. Li, Shuangqun; Liu, Wu; Ma, Huadong; Fu, Huiyuan. Beijing University of Posts and Telecommunications, China, People’s Republic of.

52 Fully convolutional network with superpixel parsing for fashion Web image segmentation. YANG, Lixuan (1,2); RODRIGUEZ, Helena (2); CRUCIANU, Michel (1); FERECATU, Marin (1). 1: Conservatoire National des Arts et Metiers, Paris, France; 2: Shopedia SAS.

63 Real-Time 3D Visual Singing Synthesis: From Appearance to Internal Articulators. Yu, Jun. University of Science and Technology of China, China, People’s Republic of.

67 Modeling User Performance for Moving Target Selection with a Delayed Mouse. Claypool, Mark (1); Eg, Ragnhild (2); Raaen, Kjetil (2). 1: Worcester Polytechnic Institute, Worcester, MA, United States of America; 2: Westerdals, Oslo, Norway.

71 Spatio-temporal VLAD Encoding for Human Action Recognition in Videos. Duta, Ionut Cosmin (1); Ionescu, Bogdan (2); Aizawa, Kiyoharu (3); Sebe, Nicu (1). 1: University of Trento, Italy; 2: University Politehnica of Bucharest, Romania; 3: University of Tokyo, Japan.

72 A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding. Dai, Yuanying; Liu, Dong; Wu, Feng. Univ Sci Tech China, China, People’s Republic of.

79 Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers. Kordopatis-Zilos, Giorgos (1); Papadopoulos, Symeon (1); Patras, Ioannis (2); Kompatsiaris, Yiannis (1). 1: Centre for Research and Technology Hellas, Greece; 2: Queen Mary University of London.

81 A Structural Coupled-layer Tracking Method Based on Correlation Filters. Chen, Sheng (1); Liu, Bin (1); Chen, Chang Wen (2). 1: CAS Key Laboratory of Electromagnetic Space Information,University of Science and Technology of China, Hefei, China; 2: University at Buffalo, State University of New York, Dept. of Computer Science & Engineering, USA.

82 Supervised Class Graph Preserving Hashing for Image Retrieval and Classification. Feng, Lu; Xu, Xin-Shun; Guo, Shanqing; Wang, Xiao-Lin. Shandong University, China, People’s Republic of.

93 Phase Fourier Reconstruction for Anomaly Detection on Metal Surface Using Salient Irregularity. Hung, Tzu-Yi (1); Vaikundam, Sriram (1); Natarajan, Vidhya (1); Chia, Liang Tien (2). 1: Rolls-Royce@NTU Corporate Lab, Singapore; 2: School of Computer Science and Engineering, Nanyang Technological University (NTU), Singapore.

96 Comparison of Fine-tuning and Extension Strategies for Deep Convolutional Neural Networks. Pittaras, Nikiforos (1); Markatopoulou, Foteini (1,2); Mezaris, Vasileios (1); Patras, Ioannis (2). 1: Centre for Research and Technology Hellas, Information Technologies Institute (CERTH-ITI); 2: Queen Mary University of London.

98 Visual robotic object grasping through combining RGB-D data and 3D mesh. Zhou, Yiyang (1); Wang, Wenhai (1); Guan, Wenjie (1); Wu, Yirui (1); Lai, Heng (1); Lu, Tong (1); Cai, Min (2). 1: National Key Lab for Novel Software Technology, Nanjing University, Nanjing, China; 2: Riseauto Intelligent Tech., Beijing, China.

103 What Convnets Make for Image Captioning?. Liu, Yu; Guo, Yanming; S. Lew, Michael. Leiden University, Netherlands, The.

105 Structure-aware Image Resizing for Chinese Characters. Liu, Chengdong; Lian, Zhouhui; Tang, Yingmin; Xiao, Jianguo. Institute of Computer Science and Technology, Peking University, No.128, Zhongguancun Street, Haidian District, Beijing, China.

106 Graph-Based Multimodal Music Mood Classification in Discriminative Latent Space. Su, Feng; Xue, Hao. Nanjing University, China, People’s Republic of.

107 Model-Based 3D Scene Reconstruction Using a Moving RGB-D Camera. Cheng, Shyi-Chyi (1); Su, Jui-Yuan (2); Chen, Ching-Min (1); Hsieh, Jun-Wei (1). 1: National Taiwan Ocean UNiversity, Taiwan, Republic of China; 2: Ming Chuan University, Taiwan.

112 Joint Face Detection and Initialization for Face Alignment. Wang, Zhiwei; Yang, Xin. Huazhong University of Science and Technology, China, People’s Republic of.

124 ReMagicMirror: Action Learning Using Human Reenactment with the Mirror Metaphor. Dayrit, Fabian Lorenzo Baytion (1); Kimura, Ryosuke (3); Nakashima, Yuta (1); Blanco, Ambrosio (2); Kawasaki, Hiroshi (3); Ikeuchi, Katsushi (2); Sato, Tomokazu (1); Yokoya, Naokazu (1). 1: Nara Institute of Science and Technology, Japan; 2: Microsoft Research Asia, China; 3: Kagoshima University, Japan.

133 Robust Image Classification via Low-Rank Double Dictionary Learning. Rong, Yi (1,2); Xiong, Shengwu (1); Gao, Yongsheng (2). 1: Wuhan University of Technology, China; 2: Griffith University, Australia.

149 Single Image Super-resolution with a Parameter Economic Residual-like Convolutional Neural Network. YANG, ZE (1); Zhang, Kai (2); Liang, Yudong (1); Wang, Jinjun (1). 1: Xi’an Jiaotong University, China, People’s Republic of; 2: Harbin Institute of Technology,, China, People’s Republic of.

151 Learning Features Robust to Image Variations with Siamese Networks for Facial Expression Recognition. Baddar, Wissam; kim, Daehoe; Ro, Yong Man. KAIST, Korea, Republic of (South Korea).

154 Augmented Telemedicine Platform for Real-time Remote Medical Consultation. Anton, David; Kurillo, Gregorij; Yang, Allen Y.; Bajcsy, Ruzena. University of California at Berkeley, Berkeley, CA, USA.

162 Robust Scene Text Detection for Multi-script Languages. Liu, Ruo-Ze (1); Sun, Xin (1); Xu, Hailiang (1); Shivakumara, Palaiahnakote (2); Su, Feng (1); Lu, Tong (1); Yang, Ruoyu (1). 1: National Key Lab for Novel Software Technology, Nanjing University, Nanjing, China; 2: Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia.

168 Fine-Grained Image Recognition from Click-Through Logs Using Deep Siamese Network. Feng, Wu; Liu, Dong. Univ Sci Tech China, China, People’s Republic of.

180 What are Good Design Gestures? -Towards user- and machine-friendly interface-. Kawahata, Ryo; Shimada, Atsushi; Taniguchi, Rin-ichiro. Kyushu University, Japan.

187 Robust Visual Tracking based on Multi-channel Compressive Features. xu, Jianqiang; lu, yao. bit, China, People’s Republic of.

194 A Framework of Privacy-Preserving Image Recognition for Image-Based Information Services. Fujii, Kojiro; Nakamura, Kazuaki; Nitta, Naoko; Babaguchi, Noboru. Osaka University.

200 Large-Scale Product Classification via Spatial Attention based CNN Learning and Multi-Class Regression. Ai, Shanshan (1); Jia, Caiyan (1); Chen, Zhineng (2). 1: School of Computer and Information Technology, Beijing Jiaotong University; 2: Institute of Automation, Chinese Academy of Sciences.

206 No-Reference Image Quality Assessment based on Internal Generative Mechanism. Qian, Xinchun; Zhou, Wengang; Li, Houqiang. University of Science and Technology of China, China, People’s Republic of.

207 Color Consistency for Photo Collections without Gamut Problems. Tian, Qi-Chong; Cohen, Laurent D.. Univ. Paris-Dauphine, PSL Research University.

Full Papers Accepted for Poster Presentation

3 M-SBIR: An Improved Sketch-based Image Retrieval Method using Visual Word Mapping. Niu, Jianwei (1); Ma, Jun (1); Lu, Jie (1); Liu, Xuefeng (2); Zhu, Zeyu (3). 1: State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing 100191, China; 2: hong kong polytechnic university; 3: School of Electronics and Information, xi’an Jiaotong university, xi’an 710049, China..

16 Discovering User Interests from Social Images. Yao, Jiangchao (1); Zhang, Ya (1); Tsang, Ivor (2); Sun, Jun (1). 1: Shanghai Jiao Tong University, China, People’s Republic of; 2: University of Technology Sydney, Austrilia.

17 Frame-independent and Parallel Method for 3D Audio Real-time Rendering on Mobile Devices. Song, Yucheng (1,2); Wang, Xiaochen (2,3); Yang, Cheng (2,4); Gao, Ge (2); Chen, Wei (1,2); Tu, Weiping (2). 1: State Key Laboratory of Software Engineering, Wuhan University, China; 2: National Engineering Research Center for Multimedia Software, Computer School of Wuhan University, China; 3: Hubei Provincial Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, China; 4: School of Physics and Electronic Science, Guizhou Normal University, Guiyang, China.

31 Color-Introduced Frame-to-Model Registration for 3D Reconstruction. Li, Fei; Du, Yunfan; Liu, Rujie. Fujitsu Research & Development Center Co., Ltd., China, People’s Republic of.

34 Scale-Relation Feature for Moving Cast Shadow Detection. Lin, Chih-Wei. Fujian Agriculture and Forestry University, China, People’s Republic of.

46 Improving the discriminative power of Bags of Visual Words Model. Ouni, Achref (1); Urruty, Thierry (1); Visani, Muriel (2). 1: XLIM UMR CNRS 7252, University of Poitiers, France; 2: L3I, University of La Rochelle, France.

48 Recognizing Emotions Based on Human Actions in Videos. Wang, Guolong; Xu, Kaiping. Tsinghua University, China, People’s Republic of.

53 A Novel Affective Visualization System for Videos based on Acoustic and Visual Features. Niu, Jianwei (1); Su, Yiming (1); Mo, Shasha (1); Zhu, Zeyu (2). 1: State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing 100191, China; 2: School of Electronics and Information, xi’an Jiaotong university, xi’an 710049, China.

66 Rocchio-based Relevance Feedback in Video Event Retrieval. Pingen, Geert (1); de Boer, Maaike (2); Aly, Robin (1). 1: University of Twente, Netherlands, The; 2: Netherlands Organization for Applied Scientific Research (TNO).

77 A Unified Framework for Monocular Video-Based Facial Motion Tracking and Expression Recognition. Yu, Jun. University of Science and Technology of China, China, People’s Republic of.

78 A Scalable Video Conferencing System Using Cached Facial Expressions. Shih, Fang-Yu; Fan, Ching-Ling; Wang, Pin-Chun; Hsu, Cheng-Hsin. National Tsing Hua University, Taiwan, Republic of China.

84 Exploiting multimodality in video hyperlinking to improve target diversity. Bois, Remi (1); Vukotic, Vedran (2); Simon, Anca (3); Sicre, Ronan (4); Raymond, Christian (2); Sébillot, Pascale (2); Gravier, Guillaume (1). 1: CNRS, IRISA and INRIA, France; 2: INSA, IRISA and INRIA, France; 3: Université de Rennes 1, France; 4: INRIA, IRISA and INRIA, France.

90 A Novel Two-step Integer-pixel Motion Estimation Algorithm for HEVC Encoding on a GPU. Chen, Keji (1); Sun, Jun (1); Guo, Zongming (1); Zhao, Dachuan (2). 1: Peking University, China, People’s Republic of; 2: Advanced Micro Devices Co., Ltd., China, People’s Republic of.

100 An Evaluation of Video Browsing on Tablets with the ThumbBrowser. Hudelist, Marco; Schoeffmann, Klaus. Klagenfurt University, Austria.

102 Illumination-Preserving Embroidery Simulation for Non-photorealistic Rendering. Shen, Qiqi; Cui, Dele; Sheng, Yun; Zhang, Guixu. East China Normal University, China, People’s Republic of.

104 Spatial Verification via Compact Words for Mobile Instance Search. Wang, Bo; Shao, Jie; He, Chengkun; Hu, Gang; Xu, Xing. University of Electronic Science and Technology of China, China, People’s Republic of.

115 Adaptive and optimal combination of local features for image retrieval. Bhowmik, Neelanjan (1,2); Gouet-Brunet, Valérie (1); Wei, Lijun (1); Bloch, Gabriel (2). 1: University Paris-Est, IGN/SR, France; 2: Nicéphore Cité, Chalon sur Saône, France.

116 Deep Convolutional Neural Network for Bidirectional Image Sentence Mapping. Yu, Tianyuan; Bai, Liang; Guo, Jinlin; Yang, Zheng; Xie, Yuxiang. National University of Defense Technology, China, People’s Republic of.

118 Online User Modeling for Interactive Streaming Image Classification. Hu, Jiagao; Sun, Zhengxing; Li, Bo; Yang, Kewei; Li, Dongyang. State Key Laboratory for Novel Software Technology, Nanjing University, P R China.

119 Unsupervised Multiple Object Cosegmentation via Ensemble MIML Learning. Yang, Weichen; Sun, Zhengxing; Li, Bo; Hu, Jiagao; Yang, Kewei. State Key Laboratory for Novel Software Technology, Nanjing University, P R China.

130 Discovering Geographic Regions in the City Using Social Multimedia and Open Data. Rudinac, Stevan; Zahálka, Jan; Worring, Marcel. University of Amsterdam, Netherlands, The.

141 Facial Expression Recognition by Fusing Gabor and Local Binary Pattern Features. Sun, Yuechuan (1); Yu, Jun (1,2). 1: University of Science and Technology of China, China, People’s Republic of; 2: Nanjing University, China, People’s Republic of.

143 Stochastic Decorrelation Constraint Regularized Auto-Encoder for Visual Recognition. Mao, Fengling; Xiong, Wei; Du, Bo; Zhang, Lefei. Computer School of Wuhan University, China.

147 The Perceptual Lossless Spatial Parameter Quantization of 3D Audio Signals. Li, Gang (1,2,3); Wang, Xiaochen (2,3); Gao, Li (2,3); Hu, Ruimin (1,2,3); Li, Dengshi (2,3). 1: State Key Laboratory of Software Engineering, Wuhan University, China; 2: National Engineering Research Center for Multimedia Software School of Computer, Wuhan University, Wuhan, China; 3: Hubei Provincial Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, China.

152 A Comparative Study For Known Item Visual Search Using Position Color Feature Signatures. Lokoc, Jakub; Kubon, David; Blazek, Adam. Charles University, Czech Republic.

157 Smart loudspeaker arrays for self-coordination and user tracking. Jee, Jungju; CHOI, JUNG WOO. School of Electrical Engineering, KAIST, Korea, Republic of (South Korea).

165 Video Search via Ranking Network With Very Few Query Exemplars. Cheng, De (1,2); Jiang, Lu (2); Gong, Yihong (1); Zheng, Nanning (1); Hauptmann, Alexander (2). 1: Xi’an Jiaotong University, China, People’s Republic of; 2: Carnegie Mellon University.

169 Using Object Detection, NLP, and Knowledge Bases to Understand the Message of Images. Weiland, Lydia (1); Hulpus, Ioana (2); Ponzetto, Simone Paolo (3); Dietz, Laura (4). 1: University of Mannheim, Germany; 2: University of Mannheim, Germany; 3: University of Mannheim, Germany; 4: CEPS – College of Engineering and Physical Sciences, Department for Computer Science, University of New Hampshire, Durham, New Hampshire, USA.

181 Exploring Large Movie Collections: Comparing Visual Berrypicking and Traditional Browsing. Low, Thomas (1); Hentschel, Christian (2); Stober, Sebastian (3); Sack, Harald (2); Nürnberger, Andreas (1). 1: Otto von Guericke University Magdeburg, Germany; 2: Hasso Plattner Institute for Software Systems Engineering, Potsdam, Germany; 3: University of Potsdam, Germany.

185 Binaural Sound Source Distance Reproduction Based on Distance Variation Function and Artificial Reverberation. Xu, Jiawang (1,2); Wang, Xiaochen (2,3); Zhang, Maosheng (2); Yang, Cheng (2,4); Gao, Ge (2). 1: State Key Laboratory of Software Engineering, Wuhan University, China; 2: National Engineering Research Center for Multimedia Software, Computer School of Wuhan University, China; 3: Hubei Provincial Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, China; 4: School of Physics and Electronic Science, Guizhou Normal University,China.

186 Compressing Visual Descriptors of Image Sequences. Bailer, Werner; Onsori-Wechtitsch, Stefanie; Thaler, Marcus. JOANNEUM RESEARCH, Austria.

196 Effect of Junk Images on Inter-Concept Distance Measurement: Positive or Negative?. Nagasawa, Yusuke; Nakamura, Kazuaki; Nitta, Naoko; Babaguchi, Noboru. Osaka University, Japan.

197 A Virtual Reality Framework for Multimodal Imagery for Vessels in Polar Regions. Sorensen, Scott (1); Kolagunda, Abhishek (1); Mahoney, Andrew (2); Zitterbart, Daniel (3); Kambhamettu, Chandra (1). 1: University of Delaware, United States of America; 2: University of Alaska Fairbanks, United States of America; 3: Alfred Wegener Institute, Germany.

202 Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation. Eskevich, Maria (1); Larson, Martha (1); Sabetghadam, Serwah (2); Aly, Robin (3); Jones, Gareth J.F. (4); Ordelman, Roeland (3); Huet, Benoit (5). 1: Radboud University, Nijmegen, Netherlands, The; 2: Vienna University of Technology, Austria; 3: University of Twente, Netherlands, The; 4: Dublin City University, Ireland; 5: EURECOM, France.

214 Movie Recommendation via BLSTM. tang, song; wu, zhiyong; chen, kang. Tsinghua University, China, People’s Republic of.

Special Session 1:
Social Media Retrieval and Recommendation

161 Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs. Liu, Shun; Xie, Hongtao; Zhou, Chuan; Mao, Zhendong. Chinese Academy of Sciences, China, People’s Republic of.

171 Collaborative Dictionary Learning and Soft Assignment for Sparse Coding of Image Features. Liu, Jie (1); Tang, Sheng (2); Li, Yu (2). 1: College of Information and Engineering, Capital Normal University, Beijing 100048, P.R.China; 2: Institute of Computing Technology, Chinese Academy of Sciences, China.

175 Multi-Task Multi-modal Semantic Hashing for Web Image Retrieval with Limited Supervision. Xie, Liang (1); Zhu, Lei (2); Chen, Zhiyong (3). 1: Wuhan University of Technology, China, People’s Republic of; 2: School of Information Technology and Electrical Engineering, The University of Queensland, Australia; 3: School of Computing, National University of Singapore.

189 Object-based Aggregation of Deep Features for Image Retrieval. Bao, Yu; Li, Haojie. Dalian University of Technology, China, People’s Republic of.

209 Linguistic-Aware Sentiment Analysis for Social Media Messages. Su, Yu-Ting; Wang, Hui-Jing. Tianjin Univ, China, People’s Republic of.

Special Session 2:
Modeling Multimedia Behaviors

22 Demographic Attribute Inference from Social Multimedia Behaviors: a Cross-OSN Approach. Xiang, Liancheng (1,2); Sang, Jitao (1,2); Xu, Changsheng (1,2). 1: National Lab of Pattern Recognition, Institute of Automation, CAS, China, People’s Republic of; 2: University of Chinese Academy of Sciences, Beijing 100049, China.

39 Utilizing Locality-Sensitive Hash Learning for Cross-Media Retrieval. Jia, Yuhua (1); Bai, Liang (1); Wang, Peng (2); Guo, Jinlin (1); Xie, Yuxiang (1); Yu, Tianyuan (1). 1: College of Information System and Management, National University of Defense Technology; 2: Tsinghua University.

83 CELoF: WiFi Dwell Time Estimation in Free Environment. Yan, Chen; Wang, Peng; Pang, Haitian; Sun, Lifeng; Yang, Shiqiang. Tsinghua University.

129 Understanding Performance of Edge Prefetching. Pang, Zhengyuan (1); Sun, Lifeng (1); Wang, Zhi (1); Xie, Yuan (2); Yang, Shiqiang (1). 1: Tsinghua University, China, People’s Republic of; 2: Indiana University, USA.

164 User Identification by Observing Interactions with GUIs. Hinbarji, Zaher (1); Albatal, Rami (2); Gurrin, Cathal (1). 1: Dublin City University, Ireland; 2: Heystaks Technologies Ltd..

Special Session 3:
Multimedia Computing for Intelligent Life

29 A Sensor-based Official Basketball Referee Signals Recognition System Using Deep Belief Networks. Yeh, Chung-Wei; Pan, Tse-Yu; Hu, Min-Chun. National Cheng Kung University, Taiwan, Republic of China.

45 Cross-modal Recipe Retrieval: How to Cook This Dish?. Chen, Jingjing; Pang, Lei; Ngo, Chong-wah. City university of HongKong, Hong Kong S.A.R. (China).

73 Deep Learning based Intelligent Basketball Arena with Energy Image. Liu, Wu (1); Liu, Jiangyu (2); Gu, Xiaoyan (3); Liu, Kun (1); Dai, Xiaowei (2); Ma, Huadong (1). 1: Beijing University of Posts and Telecommunications; 2: Zepp Labs, Inc.; 3: Information of Information Engineering, Chinese Acadmic of Science.

92 Human Pose Tracking using Online Latent Structured Support Vector Machine. Hua, Kai-Lung (2); Sari, Irawati Nurmala (2); Yeh, Mei-Chen (1). 1: National Taiwan Normal University; 2: National Taiwan University of Science and Technology.

111 Micro-expression Recognition using Feature Fusion. Zhang, Shiyu; Feng, Bailan. Institute of Automation, China, People’s Republic of.

135 egoPotray: Visual Exploration of the Mobile Communication Signature from Egocentric Network Perspective. Wang, Qing (1); Pu, Jiansu (2); Guo, Yuanfang (3); Hu, Zheng (1); Tian, Hui (1). 1: State Key Laboratory of Networking and Switching Technology, School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China; 2: CompleX Lab, Web Sciences Center, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 611731, China; 3: State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China.

137 Compact CNN Based Video Representation for Efficient Video Copy Detection. Wang, Ling; Bao, Yu; Li, Haojie; Fan, Xin; Luo, Zhongxuan. Dalian University of Technology, China.

176 Personalized Cloth Recommender System. Sanchez-Riera, Jordi (1); Lin, Jun-Ming (2); Hua, Kai-Lung (2); Cheng, Wen-Huang (1); Tsui, Arvin Wen (3). 1: Academia Sinica, Taiwan, Republic of China; 2: Dept. of CSIE, National Taiwan University of Science and Technology; 3: Industrial Technology Research Institute.

210 Efficient multi-scale plane extraction based RGBD video segmentation. Liu, Hong; Wang, Jun; Wang, Xiangdong; Qian, Yueliang. Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China, China, People’s Republic of.

Special Session 4:
Multimedia and Multimodal Interaction for Health and Basic Care Applications

8 Deep Learning of Shot Classification in Gynecologic Surgery Videos. Petscharnig, Stefan; Schöffmann, Klaus. Alpen Adria Universität Klagenfurt, Austria.

24 Description Logics and Rules for Multimodal Situational Awareness in Healthcare. Meditskos, Georgios; Vrochidis, Stefanos; Kompatsiaris, Ioannis. Information Technologies Institute, CERTH, Greece.

43 Classification of sMRI for AD diagnosis with Deep Neuronal Networks techniques: a pilot 2D + e study. Aderghal, Karim (1,3); Boissenin, Manuel (4); Benois-Pineau, Jenny (1); Catheline, Gwenaëlle (2); Afdel, Karim (3). 1: University Bordeaux/LABRI, France; 2: CNRS UMR 5287 – INCIA; 3: University Ibn Zohr; 4: ENSEIRB/LaBRI.

74 Speech Synchronized Tongue Animation by Combining Physiology Modeling and X-ray Image Fitting. Yu, Jun. University of Science and Technology of China, China, People’s Republic of.

95 Boredom Recognition based on Users’ Spontaneous Behaviors in Multiparty Human-Robot Interactions. Shibasaki, Yasuhiro (1); Funakoshi, Kotaro (2); Shinoda, Koichi (1). 1: Tokyo Institute of Technology; 2: Honda Research Institute Japan Co., Ltd., Japan.

Demonstrations

113 V-Head: Face Detection and Alignment for Facial Augmented Reality Applications. Wang, Zhiwei; Yang, Xin. Huazhong University of Science and Technology, China, People’s Republic of.

128 A demo for Image-based Personality Test. Zhang, Huaiwen (1,2); Zhang, Jiaming (3); Sang, Jitao (1); Xu, Changsheng (1). 1: Institute of Automation, Chinese Academy of Sciences; 2: University of Chinese Academy of Sciences; 3: Shandong University of Technology.

198 A web-based service for disturbing image detection. Zampoglou, Markos (1); Papadopoulos, Symeon (1); Kompatsiaris, Yiannis (1); Spangenberg, Jochen (2). 1: Centre for Research and Technology Hellas, Greece; 2: Deutsche Welle, Berlin, Germany.

213 DeepStyleCam: A Real-time Style Transfer App on iOS. Tanno, Ryosuke; Matsuo, Shin; Shimoda, Wataru; Yanai, Keiji. The University of Electro-Communications, Tokyo, Japan.

215 An Annotation System for Egocentric Image Media. Duane, Aaron; Zhou, Jiang; Little, Suzanne; Gurrin, Cathal; Smeaton, Alan F.. Insight Centre for Data Analytics, Dublin City University, Ireland.

Video Browser Showdown

216 Video Hunter at VBS 2017. Blazek, Adam; Lokoc, Jakub; Kubon, David. Charles University, Czech Republic.

217 VERGE IN VBS 2017. Moumtzidou, Anastasia (1); Mironidis, Theodoros (1); Markatopoulou, Fotini (1,2); Andreadis, Stelios (1); Gialampoukidis, Ilias (1); Galanopoulos, Damianos (1); Ioannidou, Anastasia (1); Vrochidis, Stefanos (1); Mezaris, Vasileios (1); Kompatsiaris, Ioannis (1); Patras, Ioannis (2). 1: ITI-CERTH, Greece; 2: School of Electronic Engineering and Computer Science, QMUL, UK.

218 Storyboard-based Video Browsing Using Color and Concept Indices. Hürst, Wolfgang (1); Ip Vai Ching, Algernon (1); Schoeffmann, Klaus (2); Primus, Manfred J. (2). 1: Utrecht University, Netherlands, The; 2: Klagenfurt University, Austria.

219 Enhanced Retrieval and Browsing in the IMOTION System. Rossetto, Luca (1); Giangreco, Ivan (1); Tanase, Claudiu (1); Schuldt, Heiko (1); Dupont, Stéphane (2); Seddati, Omar (2). 1: University of Basel, Switzerland; 2: Université de Mons, Belgium.

220 Concept-Based Interactive Search System. Lu, Yi-Jie; Nguyen, Phuong Anh; Zhang, Hao; Ngo, Chong-Wah. City University of Hong Kong, Hong Kong S.A.R. (China).

221 Semantic Extraction and Object Proposal for Video Search. Nguyen, Vinh-Tiep (1); Ngo, Thanh Duc (2); Le, Duy-Dinh (2); Tran, Minh-Triet (1); Duong, Duc Anh (2); Satoh, Shin’ichi (3). 1: University of Science, VNU-HCM, Vietnam; 2: University of Information Technology, VNU-HCM, Vietnam; 3: National Institute of Informatics, Tokyo, Japan.

222 Collaborative Feature Maps for Interactive Video Search. Schoeffmann, Klaus (1); Primus, Manfred Juergen (1); Muenzer, Bernd (1); Petscharnig, Stefan (1); Karisch, Christof (1); Xu, Qing (2); Huerst, Wolfgang (3). 1: Klagenfurt University, Austria; 2: Tianjin University, China; 3: Utrecht University, The Netherlands.