Naked on the Web

Mon, Feb 23, 2009, by Udi Oz

Search Engines

Have you ever published your personal details on the web? Now everyone can see them, thanks (or not) to a very powerful search engine.

If you have a web service that happens to be very good at searching, you probably have something that the world needs. Just ask Google. People will always seek answers in this ocean of information. Google does an impressive job by providing accurate information within fractions of seconds, which is the essence of good searching. Some companies like Cuil.com and Powerset.com are trying to compete with Google by providing different approaches to searching, but it is still hard to cope with the gigantic monster. If you want to become a leading player in searching, you better try dominating a niche in which Google is less strong.

People searches are one such niche. These search engines are dedicated to exposing information about people. Few players are competing in this game, and each attacks it from a different angle. Spock and Wink are two examples. All of them are doing a fine job but there is one relatively new player (at least PR wise) that’s doing an excellent job. Like Google it provides very accurate information in a very short period of time.

This impressive service is called Pipl.com, a New Jersey-based startup that was established in 2006 but has only recently begun to publicly emerge. Like its competitors, it enables searches for people by their first name, last name, email or username. Unlike its competitors, the information that Pipl.com provides is much more comprehensive and includes quick facts, pictures, contact details, personal profiles, blog posts, publications, documents and news on the search subject. Actually, it’s so good that it’s frightening. It seems that there’s nothing you can hide from Pipl.com. If the information has ever been published, anywhere on the web, this unstoppable search engine will find it.

Barack Obama on Pipl.com Barack Obama on Spock.com Barack Obama on Wink.com

The Secret

So what makes Pipl.com such a good people search? How does it reach the darkest corners of the internet? The thing that distinguishes Pipl.com from its competitors is that it does not go to the traditional data sources for gathering information. Other people search engines are gathering information from social networks and/or Wikipedia, and some are powering the results with human tagging. Pipl.com is taking a totally different approach by going to a mysterious source of data, called the Deep Web.

Just as the ocean’s surface has been well-explored while its depths remain largely mysterious, the internet has its deep unknown. The information that traditional search engines expose comes from only the surface of the web. This information is what current crawling technology, which search engines use to index the web, is capable of reaching. The majority of the web has actually never been searched, because it cannot be accessed by traditional search crawling techniques. The Deep Web consists of the following web pages:

  • Pages that are not linked from anywhere else
  • Dynamically generated pages
  • Password-protected pages
  • Pages that use technical ways to avoid search indexing (captcha, robots file, etc).

The deep web is considered to be much, much bigger than the surface web.

To reach the deep web, Pipl.com is using a technique called federated search, which was designed to simultaneously search several databases that maps the deep web. In addition, they filter the results using metadata comparisons to other data sources, in order to float up the most important data. This is why its results are so rich and informative. I have seen people start scratching their heads in astonishment after searching for themselves in Pipl.com. They were amazed to see their old addresses and other old contact details. They didn’t believe it could be found, or that they even ever published it.

Of course, you can argue about the privacy issue. Exposing people’s personal information is a sensitive manner – even big Google has to cope with occasional lawsuit here and there. Is it OK to expose people’s contact details and information they might want to hide? Well, I guess that since the current direction of the internet enables private people to create online identities, one must know that if he or she puts any kind of personal information on the web, it will eventually be found, and one must be responsible of what one exposes. Besides, if a search engine is being accused of finding too much information, it’s probably doing a very good job.

The Business Perspective

Pipl.com relies on an advertising-based business model. At least for now. They deliver text ads and sponsored links that give the site a white and clean look. (Remind you of some other search engine?) However, will advertising stay the only income source? I am not sure. For example, another people search engine, Zoominfo.com, uses a different model, showing some of the data for free, but requires a premium subscription – starting at $100 a month – if you want to get a more complete picture. The company is profitable, so this model seems to be working for them.

Will Pipl.com do the same? I don’t see any reason for such a good service not to. They can still stay free for the masses and only charge businesses that will use Pipl.com for business activities. Maybe they’ll develop a designated system for helping businesses recruit new employees, gather information on competitors, and identify professionals and other business-related issues.

Go Look for Them

So if you want to look for your neighbour’s criminal history, check for your girlfriend’s past lovers, discover all of your web footprints or just scare somone by showing him how much you know about him, go to Pipl.com and see it all there.

40
Liked it
43 Comments For This Post
  1. Louie Jerome Says:

    Scary stuff! Sounds like all the data protection laws we have here is UK are useless when most of it is probably plastered all over the web.

  2. Majic Says:

    I did try my name and like magic I got a lot of sites where my name exists! And all of them are really me! Scary!

  3. Ori Sonata Says:

    Well, my real name already exist in the web before I know anything about the web at all. Plus my name is quite rare so it really stand out.

    At least I am not going to get lost looking for myself :)

  4. Vincent Lacey Says:

    A great Article. I will be more vigilant when I am putting my details anywhere online especially shopping and such.

  5. kathryn Newman Says:

    Very interesting!

  6. Jerry Jose Says:

    Really interesting

  7. veronika1902 Says:

    Wow, and yes I’ve tried to type my name or my id on Google and it appeared all the sites connected with me…. Be aware and be wise on the net, i guess!

  8. carissimi Says:

    i searched my name.. haha wow..

  9. hosariwi Says:

    I rushed to read this article, only to realize that you’re not talking of literal nakedness! Quite insightful with lots to learn, including Pipl.com and federated search, among others.

  10. Raj the Tora Says:

    very good information. Have learnt a lot from this

  11. Percy Says:

    Very nice information but very scary one. Thanks for the info.

  12. Anabel Cassar Says:

    interesting article…thanks for the detailed info!

  13. rgjimenez Says:

    I did not try it yet.

  14. pogi253 Says:

    lol i’m surprise my name appears in different pages.

  15. dloriginal Says:

    Wow, that is really scarey. Great to know.

  16. dloriginal Says:

    Great information thank you

  17. bluemagic29 Says:

    Amazing.great article i ever read.

  18. lisaking Says:

    Amazing article. I will check this out.

  19. bronnamdi Says:

    Good informative article and thanks for introducing me to pipl.com.

  20. LoveDoctor Says:

    pipi.com sounds really good. I am going to see if I can find some people from the jersey area.

  21. shujaktk Says:

    great post, its very informative
    i am writing for this blog can you visit there?
    http://medicationforyou.com
    thank you

  22. helborod Says:

    great article!

  23. Leonardo da Vinci E. Says:

    Nice article. Interesting.

  24. mamtap Says:

    Interesting and real.

  25. Nikita Billett Says:

    this is very interesting and scary too. i guess the more advanced technology becomes the less likely we a private people.

  26. msaimq Says:

    Nice article

  27. LoveDoctor Says:

    This is awesome information. These days you need to do a background check on everyone. You never know who is really behind that screen. Scary right?

  28. PruandMe Says:

    Pipi and I are gonna get together! Thanks for the info!

    :)

  29. sanguine literist Says:

    truly scary…nut informative article.

  30. kalule Arthur Says:

    man this’s great thanks

  31. Leonard Says:

    A good article! You should keep writing!

  32. xphantoms Says:

    Top information

  33. Dora77 Says:

    You really scared me…..Thanks for the info.

  34. smokychristine Says:

    This is a really well written article, understandable by a non-techie like I am.

  35. surfer1969 Says:

    Nice research on those sites.I love this article of your.

Leave a Reply
comments powered by Disqus