<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>dariusz grabka &#187; image search</title>
	<atom:link href="http://grabka.org/internet/tag/image-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://grabka.org/internet</link>
	<description>sharing is caring.</description>
	<lastBuildDate>Mon, 26 Jul 2010 18:56:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Starting my tagging study!</title>
		<link>http://grabka.org/internet/2008/08/starting-my-tagging-study/</link>
		<comments>http://grabka.org/internet/2008/08/starting-my-tagging-study/#comments</comments>
		<pubDate>Thu, 21 Aug 2008 05:32:59 +0000</pubDate>
		<dc:creator>dariusz</dc:creator>
				<category><![CDATA[Masters Research]]></category>
		<category><![CDATA[hci]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://grabka.org/internet/?p=115</guid>
		<description><![CDATA[So I&#8217;m starting my study of tagging, and searching with keywords.  Image search has lots of really interesting nuanced problems, but the ones that interest me involve the language of the image &#8220;tag&#8221;. If you&#8217;re interested in participating, take a look at my little recruitment page: http://grabka.org/internet/flickrsearchtagging/ It&#8217;s a multi-step process that starts with you [...]]]></description>
			<content:encoded><![CDATA[<p>So I&#8217;m starting my study of tagging, and searching with keywords.  Image search has lots of really interesting nuanced problems, but the ones that interest me involve the language of the image &#8220;tag&#8221;.</p>
<p>If you&#8217;re interested in participating, take a look at my little recruitment page:</p>
<p>http://grabka.org/internet/flickrsearchtagging/</p>
<p>It&#8217;s a multi-step process that starts with you installing some software, and sending me an email with your Flickr username.  After that, it&#8217;s all simple :-)</p>
<p>Take a look at the study page if you are interested in participating.  I need about 50 people to do this.</p>
]]></content:encoded>
			<wfw:commentRss>http://grabka.org/internet/2008/08/starting-my-tagging-study/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Google is advancing the state of image search</title>
		<link>http://grabka.org/internet/2008/04/how-google-is-advancing-the-state-of-image-search/</link>
		<comments>http://grabka.org/internet/2008/04/how-google-is-advancing-the-state-of-image-search/#comments</comments>
		<pubDate>Thu, 01 May 2008 03:11:27 +0000</pubDate>
		<dc:creator>dariusz</dc:creator>
				<category><![CDATA[User Experience]]></category>
		<category><![CDATA[cbir]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[image search]]></category>

		<guid isPermaLink="false">http://grabka.org/internet/2008/45/how-google-is-advancing-the-state-of-image-search/</guid>
		<description><![CDATA[If you&#8217;re interested in the state of the art of large domain (internet) image search, then undoubtedly Google comes up over and over again. Google Image Search, with its simple interface and reasonable results, is the de-facto consumer-grade image search engine. Offerings from competitors are actually a little more feature rich, especially the MSN Live [...]]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;re interested in the state of the art of large domain (internet) image search, then undoubtedly Google comes up over and over again.</p>
<p><a href="http://images.google.com/">Google Image Search</a>, with its simple interface and reasonable results, is the de-facto consumer-grade image search engine. Offerings from competitors are actually a little more feature rich, especially the <a href="http://www.live.com/?searchonly=true&amp;mkt=en-ca&amp;scope=images">MSN Live Image Search</a>, but don&#8217;t resonate as loudly in academia or popular usage.</p>
<p>As an example, compare search results for &#8220;red corvette&#8221; from the big three:  <a href="http://images.google.ca/images?hl=en&amp;client=firefox&amp;rls=FlockInc.:en-US:official&amp;hs=z1i&amp;q=red%20corvette&amp;um=1&amp;ie=UTF-8&amp;sa=N&amp;tab=wi">Google</a>, <a href="http://search.live.com/images/results.aspx?q=red+corvette&amp;go=Search+Images&amp;mkt=en-ca&amp;scope=&amp;FORM=LIVSOP">MSN</a>, <a href="http://images.search.yahoo.com/search/images;_ylt=A0WTefiIJxlI5IAAoaGLuLkF?ei=utf-8&amp;fr=sfp&amp;p=red+corvette&amp;iscqry=">Yahoo</a>.  MSN nails the exploratory task: no-refresh scrolling, quick access to filters such as &#8220;photos&#8221;, &#8220;black and white&#8221;, and image size options that feel a little more usable and natural than Googles.  Yahoo! attempts some categorising and support for ontologies in their interface; while not perfect, it&#8217;s a direction highly praised in cutting-edge research.</p>
<p><span id="more-45"></span></p>
<h2>Focus on a subset</h2>
<p>Despite the simple Image Search interface, Google has been very active in image search research, particularly in hopes of making their results more relevant.</p>
<p>First off, there&#8217;s the <a href="http://blogoscoped.com/archive/2007-05-28-n84.html">neat little trick that allows you to search for faces</a>.</p>
<p>In general, the goal is to reach a higher level of result-relevancy.  In order to do that, a service need sto analyse images more deeply than by just reading the text that surrounds pictures.  Image analysis that involves a computer &#8220;seeing&#8221; an image and extracting features and content is very expensive in terms of computation.   So in the interest of progress, despite the heavy computing cost, the New York Times reports how Google plans to <a href="http://www.nytimes.com/2008/04/28/technology/28google.html?_r=2&amp;ref=business&amp;oref=slogin&amp;oref=slogin">roll out some of their image analysis features on a subset of internet images</a>, rather than try to analyse all the images on the whole of the internet. Google is focusing on the top several thousand topics of interest based on query popularity, and popularity in online shopping (iPods, Wiis, Air Force Ones, etc.)</p>
<p>This concept of focusing on a subset of internet images related to shopping isn&#8217;t new:  <a href="http://www.like.com/">like.com</a> has been in the process of <a href="http://munjal.typepad.com/recognizing_deven/2008/02/likecom-live-wi.html">launching and relaunching</a> (that&#8217;s a link to the CEO&#8217;s blog) its visual search service since early 2007. Their service helps you shop the way you do in real life &#8211; visually, with visual feedback and comparison of how things actually look compared to similar items in similar price brackets.</p>
<h2>Community Tagging</h2>
<p>While image analysis and retrieval based on image features is the future, in the mean time we have <em>tags</em>.  Tags are human-contributed textual descriptions of images. There are a variety of problems with tags (inconsitencies, incompleteness, subjectivity, etc.), but their presence reduces the image search problem to a far more manageable text search problem.</p>
<p>With this in mind, Google has quietly launched the <a href="http://images.google.com/imagelabeler/">Google Image Labeler</a>, a game in which two randomly matched people tag an image co-operatively, and score points.  Using a games to entice people to contribute annotation information is cool, but it appears to suffer from over-simplification:  a picture of a migrating albatross is far more likely to be tagged as &#8220;bird&#8221; rather than as &#8220;transatlantic migration&#8221; or &#8220;albatross.&#8221;  Though I&#8217;ve read some promising research that focuses on <a href="http://www.springerlink.com/content/76x9j562653k0378/">developing better tools</a> that aid in annotation of images, which will hopefully lead to tags that are more complete, categorised, and community edited.</p>
<h2>Leveraging Recommendations</h2>
<p>Google has done quite a bit of development and <a href="http://www2007.org/paper570.php">research on the topic of recommendations</a>, especially recommendations related to their <a href="http://news.google.ca/">Google News</a> service.</p>
<p>I can&#8217;t help but think of the possibilities of using recommendation algorithms enhance image search.  A service can push new image content to the forefront during queries, inter-mixed with the most relevant results, and allow people searching to annotate the recommended images.  This would quickly collect information on how relevant that new image is to that query.  Users could dismiss the image as irrelevant by never clicking on it, despite how often it appears in the first row of the result set.</p>
<p>That kind of information would provide a feedback loop for the developers of the relevancy algorithms, and potentially also allow for very personal image results: if those recommendations and your responses to them were kept around and associated with your Google account, future results could take advantage of your previous feedback.</p>
<p>There is much progress to be made in image search, and based on how often I come across interesting and relevant Google research in the field, it&#8217;s obvious that Google will continue to be a leader as image search matures.</p>
<div class="flockcredit" style="text-align: right; color: #CCC; font-size: x-small;">Blogged with the <a style="color: #999; font-weight: bold;" title="Flock Browser" href="http://www.flock.com/blogged-with-flock" target="_new">Flock Browser</a></div>
]]></content:encoded>
			<wfw:commentRss>http://grabka.org/internet/2008/04/how-google-is-advancing-the-state-of-image-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Defining the human context for search and retrieval.</title>
		<link>http://grabka.org/internet/2008/03/defining-the-human-context-for-search-and-retrieval/</link>
		<comments>http://grabka.org/internet/2008/03/defining-the-human-context-for-search-and-retrieval/#comments</comments>
		<pubDate>Thu, 27 Mar 2008 22:22:42 +0000</pubDate>
		<dc:creator>dariusz</dc:creator>
				<category><![CDATA[Masters Research]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[taxonomy]]></category>

		<guid isPermaLink="false">http://grabka.org/internet/2008/35/defining-the-human-context-for-search-and-retrieval/</guid>
		<description><![CDATA[I&#8217;m trying to wrap my head around all of the information I&#8217;m reading about image search. Search, retrieval, information, data, all of these terms are loaded, and used differently depending on whether I&#8217;m reading an HCI paper, a text analysis paper, or a blog post about search engine optimisation (SEO). In hopes of simplifying things, [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m trying to wrap my head around all of the information I&#8217;m reading about image search.  Search, retrieval, information, data, all of these terms are loaded, and used differently depending on whether I&#8217;m reading an HCI paper,  a text analysis paper, or a blog post about search engine optimisation (SEO).</p>
<p>In hopes of simplifying things, I&#8217;ve settled on a human-centred, conceptual definition of search:</p>
<blockquote><p><strong>Search </strong>refers to the process of a user developing a need, defining a query, retrieving information, viewing result(s), providing feedback, and refinements of those steps.</p></blockquote>
<p>The end result does not have to be finding a single result.  Occasionally, other steps in the search process, such as seeing a result set, can satisfy the users need.  For example, if the need was to gain information (&#8220;What does a <a href="http://www.cardomain.com/ride/2241226" title="olds toro fe3, my first car :)">&#8217;87 Oldsmobile Toronado FE3</a> look like?&#8221;) rather than find a specific image (&#8220;I need a picture of a black <a href="http://www.cardomain.com/ride/2802428" title="84 cutlass supreme">&#8217;84 Cutty</a>!&#8221;), viewing the result set may be enough.</p>
<p>Figure 1 is an illustrates the definition of search, in the human and interface context.</p>
<p><span id="more-35"></span></p>
<p align="center"><img src="http://grabka.org/internet/wp-content/uploads/2008/03/searchresized.png" alt="Search (Small)" /><br />
Figure 1 &#8211; Defining <em>Search</em> &#8211; <a href="http://grabka.org/internet/wp-content/uploads/2008/03/search.png" title="Search (Full Size)">View full size (54KB)</a></p>
<p>Search begins when a user develops a need for the search, and visits some image search service.  The user then enters a query that they believe matches their search need (often is not the case).  The form of this query is dramatically affected by the context of the interface:  if the interface is just a keyword search bar, the user will enter keywords.  If the interface offers a sample-image upload, the user may upload a sample image.  If an interface offers domain specific search criteria, it will help the user find relevant images in that domain ex. an automobile search service that allows users to select &#8220;modified&#8221; vs. &#8220;stock&#8221; vs. &#8220;show&#8221; vehicles.</p>
<p>The query is then transformed into something that is useful for the information retrieval system.  Information retrieval can include retrieving images as well as their metadata, their categories, or tags or whatnot.  Here retrieval refers explicitly to the process done by the computers involved, rather than by the user.  The bulk of the challenges that are tackled by researchers involved in CBIR occur around this step (query-to-retrieval-to-result-set).</p>
<p>Once the retrieval process is complete, a result set can be displayed back to the user.  The format, order, and other information offered by the interface has a profound impact on a users ability to find a acceptable result, or otherwise navigate the result set.  The result set can offer enough information for the user to then want to refine their query, or even refine their initial search need.</p>
<p>While interacting with the result set the user can choose to select an individual result for closer inspection, selection, or whatever they want.  The user may offer feedback based on their reaction to either a single result, or the part of the result set they are interacting with.  The interface hopefully allows for some sort of feedback mechanism which feeds into the relevance calculations of the image retrieval system.  The result set then will update based on the feedback.  Examples of this would be finding images &#8220;more like this one&#8221;, excluding small images, offering &#8220;yes, this is perfect!&#8221;</p>
<p>A users act of selecting an image out of a result set is probably important enough to provide at least some relevance feedback.  This idea has been supported by research that claims that the vast majority of image searching work is done by interacting with the result set, well before a single image is selected.</p>
<p>The processing of feedback may not have an impact on the result set, but may improve search results in the future.  An example of this kind of feedback would be tagging an image that the user viewed.  ex. a photo of the Toronto skyline is viewed, and the user adds a tag or comment: &#8220;CN Tower at night&#8221;.  Flickr, Facebook, and other image hosting services heavily leverage this process in their search process.</p>
<p>Anyway, that&#8217;s the taxonomy I will be working with.  Feedback and commentary are most welcome.</p>
<h2>CBIR is missing the point</h2>
<p>As a side note, much of the technology mentioned in CBIR papers doesn&#8217;t give much credence to the human use element (&#8220;Why would people use this?&#8221;), nor the interface element (&#8220;How would a person make use of this, even if it was useful?&#8221;). There are many, many theses worth of papers to be written about:  picking an element used in CBIR and studying the corresponding human and interface artifacts ex. searching with an image texture in mind (&#8220;fuzzy dog&#8221;), and studying what people are expecting to find, and how an interface can facilitate that.</p>
]]></content:encoded>
			<wfw:commentRss>http://grabka.org/internet/2008/03/defining-the-human-context-for-search-and-retrieval/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Settling on a Research Topic</title>
		<link>http://grabka.org/internet/2008/02/settling-on-a-research-topic/</link>
		<comments>http://grabka.org/internet/2008/02/settling-on-a-research-topic/#comments</comments>
		<pubDate>Fri, 29 Feb 2008 20:42:46 +0000</pubDate>
		<dc:creator>dariusz</dc:creator>
				<category><![CDATA[Masters Research]]></category>
		<category><![CDATA[cbir]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[user interfaces]]></category>

		<guid isPermaLink="false">http://grabka.org/internet/2008/5</guid>
		<description><![CDATA[To be perfectly honest, I was admittied into my Masters program with zero funding. In retrospect, starting a two year+ project with no guaranteed income wasn&#8217;t the greatest idea, for a variety of reasons. First, every semester I hope/pray to get a Graduate Teaching Assistant job, which luckily gets easier and easier as I accumulate [...]]]></description>
			<content:encoded><![CDATA[<p>To be perfectly honest, I was admittied into my Masters program with zero funding.  In retrospect, starting a two year+ project with no guaranteed income  wasn&#8217;t the greatest idea, for a variety of reasons.</p>
<p>First, every semester I hope/pray to get a Graduate Teaching Assistant job, which luckily gets easier and easier as I accumulate &#8220;seniority points.&#8221;</p>
<p>Second, no funding means no specified project, which means freedom to choose any research topic I please, as long as <a href="http://www.cis.uoguelph.ca/user/mwirth" title="Michael Wirth">my (very lenient/forgiving) advisor</a> is OK with it.  Well, it&#8217;s been about eight months since I&#8217;ve come back from India all ready to start researching, and only two days ago did I actually settle on a topic.</p>
<p>Eight months is a long time to pay tuition, and follow dead ends with literature reviews. Also, those months are expensive if you waste your time on partying, girls, video games, Union involvement, student government, keggers, new housemates, motorcycles, trips to Mississippi, Vancouver, Ottawa, and so on.  Well .. maybe it wasn&#8217;t a complete <em>waste</em>, per se :)</p>
<p>Finally, I&#8217;ve settled on a topic that I&#8217;m truly interested in.</p>
<p><span id="more-5"></span></p>
<h2>Usability and Image Search</h2>
<p>The topic:  usability of image retrieval interfaces for systems based on image content, rather than user-provided textual meta-data.</p>
<p>What does this mean?  Image searching (like Google Image, or Flickr search) is actually really, really complicated stuff. The systems on the back-end of the process use a variety of <em>properties</em> of an image to catalogue it in their vast library: the file name, it&#8217;s dimensions, data size, human contributed tags and categories, and a whole slew of other things. That specific information is called the <em>metadata</em>.  Most of major (Yahoo, Google, MSN, etc.) image search engines also scour the web-page content that surrounds  an image to get more metadata that can be searched.</p>
<p>Human contributed metadata is particularly meaningful: tags, categories, and file names point to the content of the image, not just its properties.   The problem with human contributed metadata is that it&#8217;s:</p>
<ul>
<li>very incomplete</li>
<li>very subjective, thus possibly inaccurate</li>
<li>very time consuming if you do want to make it accurate or complete</li>
<li>there are no standard descriptive elements in markup languages like XHTML or HTML for people to  tag their images on the web</li>
</ul>
<p>If you have a photo of a man riding a bicycle in Greece, unless you&#8217;ve tagged that picture with &#8220;man&#8221;, &#8220;bicycle&#8221;, &#8220;greece&#8221; (in four or five of the most common languages in the world), the likelihood of that image being catalogued so it can be found by an interested party is quite low.</p>
<p>Researchers since the mid-1990&#8242;s have spent a lot of time and energy in a field that hopes to automate more and more of that descriptive process, so that images can be searched on more than their properties. The field is called Content Based Image Retrieval (CBIR).  Rather than relying on metadata to describe images, one can have a computer  &#8220;see&#8221; the image and record information about it: colour, texture, salient regions, shapes, layout, etc.  This is the <em>content</em> of the image. Cataloguing all of the useful content of a very large number of images accurately, completely, and in a way that can be easily searched, is the holy grail of this field.  We are decades away from such a system, but <em>major</em> strides in that direction have already taken place.</p>
<h2>Humans + CBIR</h2>
<p>A smaller consideration of all of this is:  how is a person supposed to search for something based on content?  As in, I know a dog is fuzzy, I know that the computer can find fuzzy textures, but how do I describe to a search engine that I want to find all images of a brown, fuzzy dog?  It sounds simple, should be simple, but is not simple in the current state-of-the-art.</p>
<p>Usability is a field that concerns itself with how things are used, especially interfaces of computer programs.  I&#8217;m very interested in usability of these image search systems; image search systems that already exist, and proposed systems that will exist once we have the technology to automatically identify brown fuzzy dogs in your Flickr photos from the cottage.</p>
<p>So that&#8217;s where I&#8217;m at.  I have a general idea of the area I&#8217;m interested in, so I&#8217;m in the process of reading some review papers about the state of CBIR.  Hopefully, a literature review is coming next.</p>
]]></content:encoded>
			<wfw:commentRss>http://grabka.org/internet/2008/02/settling-on-a-research-topic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
