Download presentation
Presentation is loading. Please wait.
1
Visiwords John Tait Chief Scientific Officer
2
Warning A few half formed ideas from the world of image and video indexing which may be of interest to MT people Not original ideas (apart from I think the link) In fact a line of work which derives originally from MT
3
The nub Unsupervised Clustering of bundles of features –Colour, texture from image segments –Words, phrases from sentences or paragraphs ? Associate these bundles with “translations” by supervised machine learning –Categorised images –Parallel texts
4
Origins “Matching Words and Pictures”: Barnard, Duygulu, Forsyth, de Freitas, Blei, Jordan. Journal of Machine Learning Research 3 (2003) 1107-1135 “Image Classification Using Hybrid Neural Networks” Tsai, McGarry and Tait. Proceedings of the 26 th ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), Toronto, July, 2003. pp 431-432.
5
More or less general clusters General Concepts Specific Concepts
6
Visiwords Derived from Visiterms –These feature cluster nodes –A notion of an area of the “semantic field” –Remember these are colour, texture etc. for an area of an image …. No relation to language … or at least a very deep one
7
Matching L1 General Concepts L1 Specific Concepts L2 General Concepts L2 Specific Concepts Examples
8
Fast Forward to 2009 Better statistical models tuned to the data Much Bigger vocabularies of words categories … and lots of other advances
9
A question Is there anything like this current MT research ?
10
Concluding remarks I’m surprised this worked at all –Why should image data be coherent and cohesive ? But text is !!! Is this a better way to deal with unknown and changing vocabulary
11
Some other references A Correlation Approach for Automatic Image Annotation Hardoon, D., Saunders, C., Szedmak, S. and Shawe-Taylor, J. (2006) A Correlation Approach for Automatic Image Annotation. In: Second International Conference on Advanced Data Mining and Applications, ADMA 2006, August, Xi'an, China. Kucuktunc, O., Sevil, S. G., Tosun, A. B., Zitouni, H., Duygulu, P., and Can, F. 2008. Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites. In Proceedings of the 3rd international Conference on Semantic and Digital Media Technologies: Semantic Multimedia (Koblenz, Germany, December 03 - 05, 2008). D. Duke, L. Hardman, A. Hauptmann, D. Paulus, and S. Staab, Eds. Lecture Notes In Computer Science, vol. 5392. Springer-Verlag, Berlin, Heidelberg, 61-73. DOI= http://dx.doi.org/10.1007/978-3-540-92235-3_7
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.