logo Welcome, Guest. Please Login or Register.
2024-05-08 23:58:15 CoV Wiki
Learn more about the Church of Virus
Home Help Search Login Register
News: Everyone into the pool! Now online... the VirusWiki.

  Church of Virus BBS
  General
  Science & Technology

  Similarity measure
« previous next »
Pages: [1] Reply Notify of replies Send the topic Print 
   Author  Topic: Similarity measure  (Read 509 times)
David Lucifer
Archon
*****

Posts: 2642
Reputation: 8.93
Rate David Lucifer



Enlighten me.

View Profile WWW E-Mail
Similarity measure
« on: 2005-09-07 23:16:02 »
Reply with quote

I've been toying with the idea for a couple years now that there may exist an algorithm for determining a quantative measure of how similar two digital objects are. All information can be encoded as a sequence of ones and zeros aka a bit string. Obviously computer programs and email are encoded as bit strings but also all music, photos, books, movies, DNA, anything that can be recorded can be encoded.

What if there was a machine where you could feed in any two bit strings and it would reply with a number between 0 and 1 where 0 means they have no similarity and 1 means they are identical (0.5 means they are 50% similar). To determine the identical case is pretty easy, just make sure the bit strings match exactly, every one and zero. Cases close to identical should also be easy, if the bit strings are identical except for a few positions where one string has a zero where the other has a one.

It gets a lot trickier when similar things don't look similar at the bit level. Compare all these Mona Lisa pictures. They are different sizes, different shadings, some are cropped, many are altered, but they are undeniably similar to some extent. Or imagine two copies of the same Mona Lisa picture where one is upside down. We recognize them as very similar but it would be difficult for a machine to recognize the fact because all the ones and zeros are in different places.

The same song played by the same artist at two consecutive live shows will likely be very similar. One of those live recordings compared to the album version will be similar but less so. The same song recorded by a different artist will be less similar, but still more similar than a different song by the same artist. Two songs in the same genre (e.g. polka or heavy metal) should (generally) be more similar than comparing a polka song to a heavy metal song. And any song compared to a photo would be highly dissimilar.

Does such an algorithm exist? It must because humans do this all the time automatically. The act of recognition means that what we are perceiving is similar (in some salient way) to something we already know about. Finding this algorithm may very well be one of the keys to real AI.
Report to moderator   Logged
Walter Watts
Archon
*****

Gender: Male
Posts: 1571
Reputation: 8.88
Rate Walter Watts



Just when I thought I was out-they pull me back in

View Profile WWW E-Mail
Re:Similarity measure
« Reply #1 on: 2005-09-08 00:01:39 »
Reply with quote

[David Lucifer]

<snip>
Does such an algorithm exist? It must because humans do this all the time automatically. The act of recognition means that what we are perceiving is similar (in some salient way) to something we already know about. Finding this algorithm may very well be one of the keys to real AI.
<snip>


[Walter Watts]

Interesting question, David.
I see the human "act of recognition" as a fairly quick pattern matching process performed on multiple streams of highly dynamic massively parallel processed streaming data.

To digitally encode this "comparison snapshot" (for lack of a better term) begs the question of where to mark the beginning and the end of the comparator data. That would require headers (field information) which not only varies within each individual, but also from individual to individual.

Hmmm.....

Walter
<who has many more questions than answers>
Report to moderator   Logged

Walter Watts
Tulsa Network Solutions, Inc.


No one gets to see the Wizard! Not nobody! Not no how!
Mr. Yank
Neophyte
**

Gender: Male
Posts: 30
Reputation: 0.00




   soundsrushing@yahoo.com mr.yank@aol.in
View Profile WWW
Re:Similarity measure
« Reply #2 on: 2005-09-30 07:48:57 »
Reply with quote

[[ author reputation (0.00) beneath threshold (3)... display message ]]

Report to moderator   Logged

"By believing passionately in something that still does not exist, I've created it...
the non-existent is what I haven't sufficiently desired!"
Pages: [1] Reply Notify of replies Send the topic Print 
Jump to:


Powered by MySQL Powered by PHP Church of Virus BBS | Powered by YaBB SE
© 2001-2002, YaBB SE Dev Team. All Rights Reserved.

Please support the CoV.
Valid HTML 4.01! Valid CSS! RSS feed