<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>dariusz grabka &#187; cbir</title>
	<atom:link href="http://grabka.org/internet/tag/cbir/feed/" rel="self" type="application/rss+xml" />
	<link>http://grabka.org/internet</link>
	<description>sharing is caring.</description>
	<lastBuildDate>Mon, 26 Jul 2010 18:56:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>How Google is advancing the state of image search</title>
		<link>http://grabka.org/internet/2008/04/how-google-is-advancing-the-state-of-image-search/</link>
		<comments>http://grabka.org/internet/2008/04/how-google-is-advancing-the-state-of-image-search/#comments</comments>
		<pubDate>Thu, 01 May 2008 03:11:27 +0000</pubDate>
		<dc:creator>dariusz</dc:creator>
				<category><![CDATA[User Experience]]></category>
		<category><![CDATA[cbir]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[image search]]></category>

		<guid isPermaLink="false">http://grabka.org/internet/2008/45/how-google-is-advancing-the-state-of-image-search/</guid>
		<description><![CDATA[If you&#8217;re interested in the state of the art of large domain (internet) image search, then undoubtedly Google comes up over and over again. Google Image Search, with its simple interface and reasonable results, is the de-facto consumer-grade image search engine. Offerings from competitors are actually a little more feature rich, especially the MSN Live [...]]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;re interested in the state of the art of large domain (internet) image search, then undoubtedly Google comes up over and over again.</p>
<p><a href="http://images.google.com/">Google Image Search</a>, with its simple interface and reasonable results, is the de-facto consumer-grade image search engine. Offerings from competitors are actually a little more feature rich, especially the <a href="http://www.live.com/?searchonly=true&amp;mkt=en-ca&amp;scope=images">MSN Live Image Search</a>, but don&#8217;t resonate as loudly in academia or popular usage.</p>
<p>As an example, compare search results for &#8220;red corvette&#8221; from the big three:  <a href="http://images.google.ca/images?hl=en&amp;client=firefox&amp;rls=FlockInc.:en-US:official&amp;hs=z1i&amp;q=red%20corvette&amp;um=1&amp;ie=UTF-8&amp;sa=N&amp;tab=wi">Google</a>, <a href="http://search.live.com/images/results.aspx?q=red+corvette&amp;go=Search+Images&amp;mkt=en-ca&amp;scope=&amp;FORM=LIVSOP">MSN</a>, <a href="http://images.search.yahoo.com/search/images;_ylt=A0WTefiIJxlI5IAAoaGLuLkF?ei=utf-8&amp;fr=sfp&amp;p=red+corvette&amp;iscqry=">Yahoo</a>.  MSN nails the exploratory task: no-refresh scrolling, quick access to filters such as &#8220;photos&#8221;, &#8220;black and white&#8221;, and image size options that feel a little more usable and natural than Googles.  Yahoo! attempts some categorising and support for ontologies in their interface; while not perfect, it&#8217;s a direction highly praised in cutting-edge research.</p>
<p><span id="more-45"></span></p>
<h2>Focus on a subset</h2>
<p>Despite the simple Image Search interface, Google has been very active in image search research, particularly in hopes of making their results more relevant.</p>
<p>First off, there&#8217;s the <a href="http://blogoscoped.com/archive/2007-05-28-n84.html">neat little trick that allows you to search for faces</a>.</p>
<p>In general, the goal is to reach a higher level of result-relevancy.  In order to do that, a service need sto analyse images more deeply than by just reading the text that surrounds pictures.  Image analysis that involves a computer &#8220;seeing&#8221; an image and extracting features and content is very expensive in terms of computation.   So in the interest of progress, despite the heavy computing cost, the New York Times reports how Google plans to <a href="http://www.nytimes.com/2008/04/28/technology/28google.html?_r=2&amp;ref=business&amp;oref=slogin&amp;oref=slogin">roll out some of their image analysis features on a subset of internet images</a>, rather than try to analyse all the images on the whole of the internet. Google is focusing on the top several thousand topics of interest based on query popularity, and popularity in online shopping (iPods, Wiis, Air Force Ones, etc.)</p>
<p>This concept of focusing on a subset of internet images related to shopping isn&#8217;t new:  <a href="http://www.like.com/">like.com</a> has been in the process of <a href="http://munjal.typepad.com/recognizing_deven/2008/02/likecom-live-wi.html">launching and relaunching</a> (that&#8217;s a link to the CEO&#8217;s blog) its visual search service since early 2007. Their service helps you shop the way you do in real life &#8211; visually, with visual feedback and comparison of how things actually look compared to similar items in similar price brackets.</p>
<h2>Community Tagging</h2>
<p>While image analysis and retrieval based on image features is the future, in the mean time we have <em>tags</em>.  Tags are human-contributed textual descriptions of images. There are a variety of problems with tags (inconsitencies, incompleteness, subjectivity, etc.), but their presence reduces the image search problem to a far more manageable text search problem.</p>
<p>With this in mind, Google has quietly launched the <a href="http://images.google.com/imagelabeler/">Google Image Labeler</a>, a game in which two randomly matched people tag an image co-operatively, and score points.  Using a games to entice people to contribute annotation information is cool, but it appears to suffer from over-simplification:  a picture of a migrating albatross is far more likely to be tagged as &#8220;bird&#8221; rather than as &#8220;transatlantic migration&#8221; or &#8220;albatross.&#8221;  Though I&#8217;ve read some promising research that focuses on <a href="http://www.springerlink.com/content/76x9j562653k0378/">developing better tools</a> that aid in annotation of images, which will hopefully lead to tags that are more complete, categorised, and community edited.</p>
<h2>Leveraging Recommendations</h2>
<p>Google has done quite a bit of development and <a href="http://www2007.org/paper570.php">research on the topic of recommendations</a>, especially recommendations related to their <a href="http://news.google.ca/">Google News</a> service.</p>
<p>I can&#8217;t help but think of the possibilities of using recommendation algorithms enhance image search.  A service can push new image content to the forefront during queries, inter-mixed with the most relevant results, and allow people searching to annotate the recommended images.  This would quickly collect information on how relevant that new image is to that query.  Users could dismiss the image as irrelevant by never clicking on it, despite how often it appears in the first row of the result set.</p>
<p>That kind of information would provide a feedback loop for the developers of the relevancy algorithms, and potentially also allow for very personal image results: if those recommendations and your responses to them were kept around and associated with your Google account, future results could take advantage of your previous feedback.</p>
<p>There is much progress to be made in image search, and based on how often I come across interesting and relevant Google research in the field, it&#8217;s obvious that Google will continue to be a leader as image search matures.</p>
<div class="flockcredit" style="text-align: right; color: #CCC; font-size: x-small;">Blogged with the <a style="color: #999; font-weight: bold;" title="Flock Browser" href="http://www.flock.com/blogged-with-flock" target="_new">Flock Browser</a></div>
]]></content:encoded>
			<wfw:commentRss>http://grabka.org/internet/2008/04/how-google-is-advancing-the-state-of-image-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Settling on a Research Topic</title>
		<link>http://grabka.org/internet/2008/02/settling-on-a-research-topic/</link>
		<comments>http://grabka.org/internet/2008/02/settling-on-a-research-topic/#comments</comments>
		<pubDate>Fri, 29 Feb 2008 20:42:46 +0000</pubDate>
		<dc:creator>dariusz</dc:creator>
				<category><![CDATA[Masters Research]]></category>
		<category><![CDATA[cbir]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[user interfaces]]></category>

		<guid isPermaLink="false">http://grabka.org/internet/2008/5</guid>
		<description><![CDATA[To be perfectly honest, I was admittied into my Masters program with zero funding. In retrospect, starting a two year+ project with no guaranteed income wasn&#8217;t the greatest idea, for a variety of reasons. First, every semester I hope/pray to get a Graduate Teaching Assistant job, which luckily gets easier and easier as I accumulate [...]]]></description>
			<content:encoded><![CDATA[<p>To be perfectly honest, I was admittied into my Masters program with zero funding.  In retrospect, starting a two year+ project with no guaranteed income  wasn&#8217;t the greatest idea, for a variety of reasons.</p>
<p>First, every semester I hope/pray to get a Graduate Teaching Assistant job, which luckily gets easier and easier as I accumulate &#8220;seniority points.&#8221;</p>
<p>Second, no funding means no specified project, which means freedom to choose any research topic I please, as long as <a href="http://www.cis.uoguelph.ca/user/mwirth" title="Michael Wirth">my (very lenient/forgiving) advisor</a> is OK with it.  Well, it&#8217;s been about eight months since I&#8217;ve come back from India all ready to start researching, and only two days ago did I actually settle on a topic.</p>
<p>Eight months is a long time to pay tuition, and follow dead ends with literature reviews. Also, those months are expensive if you waste your time on partying, girls, video games, Union involvement, student government, keggers, new housemates, motorcycles, trips to Mississippi, Vancouver, Ottawa, and so on.  Well .. maybe it wasn&#8217;t a complete <em>waste</em>, per se :)</p>
<p>Finally, I&#8217;ve settled on a topic that I&#8217;m truly interested in.</p>
<p><span id="more-5"></span></p>
<h2>Usability and Image Search</h2>
<p>The topic:  usability of image retrieval interfaces for systems based on image content, rather than user-provided textual meta-data.</p>
<p>What does this mean?  Image searching (like Google Image, or Flickr search) is actually really, really complicated stuff. The systems on the back-end of the process use a variety of <em>properties</em> of an image to catalogue it in their vast library: the file name, it&#8217;s dimensions, data size, human contributed tags and categories, and a whole slew of other things. That specific information is called the <em>metadata</em>.  Most of major (Yahoo, Google, MSN, etc.) image search engines also scour the web-page content that surrounds  an image to get more metadata that can be searched.</p>
<p>Human contributed metadata is particularly meaningful: tags, categories, and file names point to the content of the image, not just its properties.   The problem with human contributed metadata is that it&#8217;s:</p>
<ul>
<li>very incomplete</li>
<li>very subjective, thus possibly inaccurate</li>
<li>very time consuming if you do want to make it accurate or complete</li>
<li>there are no standard descriptive elements in markup languages like XHTML or HTML for people to  tag their images on the web</li>
</ul>
<p>If you have a photo of a man riding a bicycle in Greece, unless you&#8217;ve tagged that picture with &#8220;man&#8221;, &#8220;bicycle&#8221;, &#8220;greece&#8221; (in four or five of the most common languages in the world), the likelihood of that image being catalogued so it can be found by an interested party is quite low.</p>
<p>Researchers since the mid-1990&#8242;s have spent a lot of time and energy in a field that hopes to automate more and more of that descriptive process, so that images can be searched on more than their properties. The field is called Content Based Image Retrieval (CBIR).  Rather than relying on metadata to describe images, one can have a computer  &#8220;see&#8221; the image and record information about it: colour, texture, salient regions, shapes, layout, etc.  This is the <em>content</em> of the image. Cataloguing all of the useful content of a very large number of images accurately, completely, and in a way that can be easily searched, is the holy grail of this field.  We are decades away from such a system, but <em>major</em> strides in that direction have already taken place.</p>
<h2>Humans + CBIR</h2>
<p>A smaller consideration of all of this is:  how is a person supposed to search for something based on content?  As in, I know a dog is fuzzy, I know that the computer can find fuzzy textures, but how do I describe to a search engine that I want to find all images of a brown, fuzzy dog?  It sounds simple, should be simple, but is not simple in the current state-of-the-art.</p>
<p>Usability is a field that concerns itself with how things are used, especially interfaces of computer programs.  I&#8217;m very interested in usability of these image search systems; image search systems that already exist, and proposed systems that will exist once we have the technology to automatically identify brown fuzzy dogs in your Flickr photos from the cottage.</p>
<p>So that&#8217;s where I&#8217;m at.  I have a general idea of the area I&#8217;m interested in, so I&#8217;m in the process of reading some review papers about the state of CBIR.  Hopefully, a literature review is coming next.</p>
]]></content:encoded>
			<wfw:commentRss>http://grabka.org/internet/2008/02/settling-on-a-research-topic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
