Stealth Search Engine Blekko Gets Money From Marc Andreessen, SoftTech
2008 is the year of the search engine startup. Hot on the heels of Powerset’s partial launch earlier this week, stealth search engine Blekko (no logo, no website, just this) raised a second round of financing.
The company raised $3 million in equity at a $23 million post-money valuation. All previous investors participated, and new investors Marc Andreessen, SoftTech VC and Western Technology Investment also invested. They simultaneously closed a $1 million lease line with Western Technology Investment for server leases.
We don’t know much yet about Blekko, which was founded by former Topix founder/CEO Rich Skrenta. The company says they won’t be launching anything to the public until 2009. See our original post on Blekko for more background information.
See our coverage of Cuill as well, another hot stealth search startup we’re tracking.
Crunch Network: MobileCrunch Mobile Gadgets and Applications, Delivered Daily.
Powerset Launches Showcase For User Search Experience
Today marks another milestone for San Francisco based contextual search engine Powerset. They’ve launched a showcase for their user search experience - effectively the search engine minus the web crawl. For now, Powerset queries only Wikipedia and augments results with data from Freebase. The product launch comes just a day after reports that the company is being shopped to potential buyers by investment bank Allen & Co.
I have been able to test Powerset via their labs site for the last few weeks. I wrote about it last month, and the version that just launched is very similar.
There is no way to look at Powerset today and determine if it can be as disruptive to search as Google was when it launched almost a decade ago. That’s because it only queries Wikipedia, and so there is little need for proper ranking algorithms to sort the good from the bad results.
But what user can see is how effective a way it is to gather information quickly. For someone doing research, Powerset effectively removes a number of steps towards getting to the final information. It is particularly effective when the information needed is on many different web pages.
For example, a query on Powerset of “when did earthquakes hit tokyo” yields stunning results. Try this query at Google or even wikipedia to compare - instead of just picking out keywords that are in your query and on a web page, Powerset is actually making some sense of the content included in the wikipedia pages:

The way that Powerset returns queries means that answers are often found in the result snips, as above. They are also structuring a lot of the Wikipedia and (and already structured Freebase) data and inserting it into results. So a search for “Bill Clinton” shows results, but also shows Freebase structured data along with additional query refinements to get to more information. The important thing below isn’t the structured data in the results, its the fact that you can click on the action words and drill down into very specific queries (to find, for example, what bills he signed, or which Supreme Court justices he nominated, or who he slept with).

Powerset is indexing web pages much differently than normal search engines, which generally just record content to match against keyword queries. Instead, Powerset is trying to understand the content on the page so that it can be matched meaningfully to queries later. Even queries that don’t use matching words.
Indexing the web is expensive, though, and Powerset’s way of doing it requires even more time and computing power dedicated to a web page. That’s why they say they aren’t indexing the entire web yet - the company has raised just $12.5 million (plus another $8 million or so in bridge loans from investors). To index the web will require a new round of financing (see the first paragraph above about their sale/financing efforts).
Powerset is has taken a lot of criticism for their goal of trying to redefine how people search the web (including from us). But their lofty goals are what makes Silicon Valley so great - succeed or fail, Powerset is trying to do something pretty spectacular.
The company has also created a demo overview video - see below.
Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.
Powerset’s Dilemma: Go For It, Or Sell
San Francisco based search startup Powerset will be launching shortly. For now, Powerset will query only Wikipedia and Freebase. But as I said when the product was demo’d to me a few weeks ago, it is compelling nonetheless: “When I tested the service I had something very similar to the “Aha!†feeling that ran through me the first time I ever used Google. In short, it is an evolutionary, and possibly revolutionary, step forward in search.”
But now the company may have to make a hard decision: sell now to one of the big Internet players looking for a point of differentiation in search, or take the risk of going it alone and possibly getting a huge, multi-billion dollar payoff down the road.
According to our sources, Powerset is exploring both options. They hired Dave Wehner, a Managing Director at investment bank Allen & Co. (he’s the guy who sold Bebo for $850 million to AOL, and is working on LinkedIn’s huge financing), to represent them in a possible sale or financing.
CNET is reporting today that Microsoft may be bidding for the company. According to our sources, those discussions have been going on for well over a month, and their most recent bid is “around $100 million.”
That probably won’t be enough to convince Powerset and their investors to sell. The big question is whether Google will step in to try and keep Powerset out of Microsoft’s hands, and start a real bidding war. That could drive the price significantly higher. Google, however, has publicly dismissed the notion of contextual search as a revolutionary step forward.
Whether that’s true or not is yet to be seen. But Powerset may find itself as a valuable chess piece in the emerging search war between Google and Microsoft. And if Google bets wrong, they could find their commanding lead in search eroded over time. A relatively small acquisition to keep Powerset of of Microsoft’s hands, even if just a hedging move, may suddenly be attractive to them.
Crunch Network: CrunchBoard because it’s time for you to find a new Job2.0
Powerset Will Launch In Coming Weeks
San Francisco based Powerset will be publicly launching a long-awaited beta version of the service in the coming weeks, the company told me yesterday. They are working on a new kind of search engine that will understand natural language searches and compete with keyword matching engines that dominate search today.
An early version of the search engine, which was demo’d to me yesterday at their offices, has been available to some users of their Powerlabs site. But for the most part, it’s been kept very quiet.
The early version of the service will serve as a showcase for the user interface and engine itself, but it will not have a full web index behind it. For now, Powerset will query only Wikipedia and Freebase. But when I tested the service I had something very similar to the “Aha!” feeling that ran through me the first time I ever used Google. In short, it is an evolutionary, and possibly revolutionary, step forward in search.
I’ll temper that statement since the company is not putting anything more than a tiny index of two sites behind the service for now. In particular, the fact that Powerset doesn’t have to bother with spam control and other relevance issues (which is what made Google so great when it launched), means it can’t yet be considered any kind of challenger in the search space. But anyone who uses it will be able to see the potential value of the engine when it is placed in front of a full web index.
For now the company is keeping specific features of the engine confidential, but I can say it has evolved significantly since a screen shot was released in mid-2007.
In preparation for the launch, some of the Powerset team have vowed not to shave until the product is released. They are chronicling their facial hair adventure on a site called Powerstache, which has been covered by Jessica Guynn at the LA Times.
Rumors have also been swirling around the company in general. A number of sources have said that Powerset is pitching for additional capital. And the company also appears to have put plans to hire a new CEO on hold - founder Barney Pell is still firmly in charge at the company.
Powerset is one of three new search engines that we’re keeping a close eye on. The other two, Cuill (pronounced “cool”) and Blekko, are still deep in stealth mode.
Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.
Microsoft Blews Brings Back Memories Of Rocket Pops At The Beach
Ok, so that isn’t an actual picture of the new Microsoft Blews news aggregator that was announced by Microsoft Research today, but tell me that the screen shot (see below) doesn’t bring back memories of eating Rocket Pops on the beach as a child (or wherever you ate them).
But back to Blews. It’s a news aggregator (see Techmeme and about 45 others, including this gem), but it goes beyond mere clustering of stories to show what’s important right now based on who’s linking to what in near real time. Blews, which is only looking at political news, also tells you the bias of the links in to a story:
BLEWS uses political blogs to categorize news stories according to their reception in the conservative and liberal blogospheres. It visualizes information about which stories are linked to from conservative and liberal blogs, and it indicates the level of emotional charge in the discussion of the news story or topic at hand in both political camps. BLEWS also offers a “see the view from the other side†functionality, enabling a reader to compare different views on the same story from different sides of the political spectrum. BLEWS achieves this goal by digesting and analyzing a real-time feed of political-blog posts provided by the Live Labs Social Media platform, adding both link analysis and text analysis of the blog posts.
Here’s what all that looks like:

Liberal links are blue (rasberry) and on the left, conservative links are red (cherry) and on the right. The middle is the story itself in white (lemon). The dots around the edges suggest the emotional charge of the commentary, which can drip off of the Rocket Pop in very hot weather.
I note that no one on the team (Michael Gamon, Sumit Basu, Dmitriy Blenko, Danyel Fisher and Christian Konig) is a user interface specialist or web designer.
Putting aside the UI, which is hard to do, the artificial intelligence behind Blews could be interesting. It is very hard to get a machine to decipher emotion and meaning from raw text unless they are doing mere keyword searches (see, for example, Powerset). Microsoft is calling this hard bit “detecting emotional charge.” If they’ve got it right, or are close, there are an unlimited number of potential applications for the technology.
As an aside, this somewhat reminds me of ScoutLabs, a startup we wrote about last December. Scout Labs helps brand marketers track commentary on their brands, and tries to decipher emotion towards that brand as well.
Crunch Network: CrunchBoard because it’s time for you to find a new Job2.0
Find Something That Is “X†And Has “Y†With Circos
Keyword search gets you pretty far when looking for pure information, but doesn’t help much on more qualitative searches like trying to find the hippest restaurant in SOHO. Searches like the latter rely on the opinions of people, not webmasters, which is one of the reasons Circo’s has launched their new qualitative search engine. The engine currently lets users search for hotels and restaurants by qualities like size, ambiance, or other qualities pulled from reviews from around the web. They have plans to expand to other categories in the future.
Circos is categorized under the ever expanding umbrella of semantic search engines, which currently includes the likes of Hakia, PowerSet, Kosmix, SemantiNet, and TrueKnowledge. However, the engine is most like Kango, which has also taken on the task of categorizing hotels based on user reviews. VibeAgent also has a search engine for its own site that will search hotels based on qualities.
While Kango auto-generates tags after pouring through user reviews, Circo lets users search for any qualities they’re interested in. The engine then grades and ranks the results by each quality on an “A” through “F” scale based on how well the description fits for reviewers. For example, a hotel reviewers feel is spacious would rate highly if searching for openness, but poorly if you’re looking for a tiny room.
As with most search engines, Circos’ real test will be whether its application draws users away from other hotel and restaurant sites with less sophisticated search engines. Currently there are a bunch competing in the space. However, Circos says their technology can easily be extended to other categories since their algorithm does all the tough work of pulling the most relevant qualities from reviews. If hotels and restaurants don’t appeal, another category may hold their home run.
Circos is angel funded, based in San Mateo, and has eight employees (4 in Singapore).
Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.
Tags: semantic, search, techcrunch, web2.0
The Next Google Search Challenger: Blekko
Rich Skrenta, who created the first computer virus (Elk Cloner), co-founded the Open Directory Project, and co-founded online news site Topix, may have bitten off the biggest challenge of his career - taking on Google. In search.
Skrenta left Topix last June. He started his new company, Blekko, almost immediately, along with five others from the Topix core team. They raised $2 million in seed funding in September from Baseline Ventures, two early Googlers (David DesJardins and Jeremy Wenokur), and the founding team.
The company is still deep in stealth and, apparently, working out of a garage in true startup style (see image below). The Blekko website, which today has nothing on it except a picture of a puppet created by Skrenta’s daughter, isn’t even close to having a landing page up, let alone the final product. But eventually Skrenta says they’ll launch a full scale search engine to compete with the big guys.
Skrenta, who’s very media savvy, won’t say much about how he’s going to tackle search (he’s not a fan of PageRank though:“PageRank wrecked the web. Google is the cause of all of this. and Google is going down with it.”). He says they are looking at improvements on the back end (indexing and query serving) as well as the user search experience itself. Beyond that, he says we have to wait. And it might be a long wait at that. The company, Skrenta says, may not have a public prototype available until 2009.
Normally an entrepreneur announcing they’re taking on Google with a six person team and just $2 million in funding would either be laughed at or ignored. In Skrenta’s case, he has proven himself more than once as capable of taking on big challenges and winning. This will be a company to watch, and speculate on, in 2008.
There are other promising search startups out there. Powerset, Cuill and the upcoming Wikia Search Engine are all yet to launch. Mahalo is growing fast (but still tiny). Can anyone unseat Google? Perhaps not any time soon. But you don’t have to get much market share to be a huge winner in this space - every 1%, they say, is worth a cool billion dollars.
cb_widget_report_widget("cb_widget_1199397524"); cb_widget_report_element("cb_widget_0_1199397524","blekko");
Crunch Network: CrunchBoard because it’s time for you to find a new Job2.0
Google’s Norvig Is Down On Natural Language Search
Don’t expect to see natural-language search at Google anytime soon. Despite the buzz of startups like Powerset and, to a lesser degree, true knowledge, Google’s head of research Peter Norvig pooh-poohs the notion that people are clamoring to write full sentences in search boxes. In a Q&A with Technology Review, he says:
We don’t think it’s a big advance to be able to type something as a question as opposed to keywords. Typing “What is the capital of France?” won’t get you better results than typing “capital of France.”
True, true. But he does acknowledge that there is some value in the technology:
We think what’s important about natural language is the mapping of words onto the concepts that users are looking for. . . . To give some examples, “New York” is different from “York,” but “Vegas” is the same as “Las Vegas,” and “Jersey” may or may not be the same as “New Jersey.” That’s a natural-language aspect that we’re focusing on. Most of what we do is at the word and phrase level; we’re not concentrating on the sentence. We think it’s important to get the right results rather than change the interface.
In other words, a natural-language approach is useful on the back-end to create better results, but it does not present a better user experience. Most people are too lazy to type in more than one or two words into a search box anyway. The folks at both Google and Yahoo know that is true for the majority of searchers. The natural-language search startups are going to find out about that the hard way. If Google doesn’t trounce them first.
cb_widget_report_widget("cb_widget_1198078316"); cb_widget_report_element("cb_widget_0_1198078316","powerset"); cb_widget_report_element("cb_widget_1_1198078316","trueknowledge");
Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.


