Tag Archives: Semantic Apps

Benchmark Capital, Business, Funding, Goldman Sachs, Internet, Investments, Millennium Technology Ventures, Money, Omidyar Network, Semantic Apps, Semantic Web, Software, Technology, Venture Capital, Web 2.0

Massive second round of funding for Freebase – $42 Million

January 18, 2008 Web 2.0 Innovations Leave a comment

Freebase, the open and shared database of the worldâ€™s knowledge, has raised a whopping amount of money in its Series B round of funding, $42 Million, in a round that included Benchmark Capital and Goldman Sachs. Total funding to date is $57 million.

The investment is considerable, and comes at a time when a number of experts are betting that a more powerful, â€œsemanticâ€ Web is about to emerge, where data about information is much more structured than it is today.

In March 2006, Freebase received $15 million in funding from investors including Benchmark Capital, Millennium Technology Ventures and Omidyar Network.

Freebase, created by Metaweb Technologies, is an open database of the worldâ€™s information. Itâ€™s built by the community and for the community â€“ free for anyone to query, contribute to, build applications on top of, or integrate into their websites.

Already, Freebase covers millions of topics in hundreds of categories. Drawing from large open data sets like Wikipedia, MusicBrainz, and the SEC archives, it contains structured information on many popular topics, including movies, music, people and locations â€“ all reconciled and freely available via an open API. This information is supplemented by the efforts of a passionate global community of users who are working together to add structured information on everything from philosophy to European railway stations to the chemical properties of common food ingredients.

By structuring the worldâ€™s data in this manner, the Freebase community is creating a global resource that will one day allow people and machines everywhere to access information far more easily and quickly than they can today.

FreebaseÂ aims to “open up the silos of data and the connections between them”, according to founder Danny Hillis at the Web 2.0 Summit. Freebase is a database that has all kinds of data in it and an API. Because it’s an open database, anyone can enter new data in Freebase. An example page in the Freebase db looks pretty similar to a Wikipedia page. When you enter new data, the app can make suggestions about content. The topics in Freebase are organized by type, and you can connect pages with links, semantic tagging. So in summary, Freebase is all about shared data and what you can do with it.

Hereâ€™s a video tour of how does Freebase work. Freebase categorizes knowledge according to thousands of â€œtypesâ€ of information, such as film, director or city. Those are the highest order of categorization. Then underneath those types you have â€œtopics,â€ which are individual examples of the types — such as Annie Hall and Woody Allen. It boasts two million topics to date. This lets Freebase represent information in a structured way, to support queries from web developers wanting to build applications around them. It also solicits people to contribute their knowledge to the database, governed by a community of editors. It offers a Creative Commons license so that it can be used to power applications, on an open API.

This is one of the biggest Series B rounds for the past 12 months. And probably what Google tries to do with its Knol to Wikipedia is the same what Freebase tries to achieve too â€“ replicate and commercialize the huge success of the non-profit Wikipedia.

Other semantic applications and projects include Powerset, Twine, AdaptiveBlue, Hakia, Talis, LinkedWords, NosyJoe, TrueKnowledge, among others.

Peter Rip, an investor in Twine has quickly reacted on the comparison between the two Freebase and Twine the VentureBeatâ€™s Matt Marshall made.

As an investor in Twine, allow me correct you about Twine and Metawebâ€™s positioning. You correctly point out that Metaweb is building a database about concepts and things on the Web. Twine is not. Twine is really more of an application than a database. It is a way for persons to share information about their interests. So they are complementary, not competitive.

Whatâ€™s most important is that Twine will be able to use all the structure in something like Metaweb (and other content sources) to enrich the userâ€™s ability to track and manage information. Think of Metaweb as a content repository and Twine as as the app that uses content for specific purposes.

Twine is still in closed beta. So the confusion is understandable, especially with all the hype surrounding the category.

Nova Spivack, the founder of Twine has also commented on.

Freebase and Twine are not competitive. That should be corrected in the above article. In fact our products are very different and have different audiences. Twine is for helping people and groups share knowledge around their interests and activities. It is for managing personal and group knowledge, and ultimately for building smarter communities of interest and smarter teams.

Metaweb, by contrast, is a data source that Twine can use, but is not focused on individuals or on groups. Rather Metaweb is building a single public information database, that is similar to the Wikipedia in some respects. This is a major difference in focus and functionality. To use an analogy, Twine is more like a semantic Facebook, and Metaweb is more like a semantic Wikipedia.

Freebase is in alpha.

Freebase.com was the first Semantic App being featured by Web2Innovations in its series of planned publications where we will try to discover, highlight and feature the next generation of web-based semantic applications, engines, platforms, mash-ups, machines, products, services, mixtures, parsers, and approaches and far beyond.

The purpose of these publications is to discover and showcase todayâ€™s Semantic Web Apps and projects. We’re not going to rank them, because there is no way to rank these apps at this time – many are still in alpha and private beta.
More

http://www.metaweb.com/about/
http://freebase.com
http://roblog.freebase.com
http://venturebeat.com/2008/01/14/shared-database-metaweb-gets-42m-boost/
http://www.techcrunch.com/2008/01/16/freebase-takes-42-million/
http://www.dmwmedia.com/news/2008/01/15/freebase-developer-metaweb-technologies-gets-$42.4-million
http://www.crunchbase.com/company/freebase
http://www.readwriteweb.com/archives/10_semantic_apps_to_watch.php
http://en.wikipedia.org/wiki/Danny_Hillis
http://www.metaweb.com
http://en.wikipedia.org/wiki/Metaweb_Technologies
https://web2innovations.com/money/2007/11/30/freebase-open-shared-database-of-the-worlds-knowledge/
http://mashable.com/2007/07/17/freebase/
http://squio.nl/blog/2007/04/02/freebase-life-the-universe-and-everything/

Alexandra Investment Management, Funding, Internet, Investments, Search Engines, Semantic Apps, Semantic Web, Social Search Engines, Software, Technology, Web 2.0

Hakia takes on major search engines backed up by a small army of international investors

December 11, 2007 Web 2.0 Innovations 1 Comment

In our planned series of publications about the Semantic Web and its Apps today Hakia is our 3rd featured company.

Hakia.com, just like Freebase and Powerset is also heavily relying on Semantic technologies to produce and deliver hopefully better and meaningful results to its users.

Hakia is building the Web’s new “meaning-based” (semantic) search engine with the sole purpose of improving search relevancy and interactivity, pushing the current boundaries of Web search. The benefits to the end user are search efficiency, richness of information, and time savings. The basic promise is to bring search results by meaning match – similar to the human brain’s cognitive skills – rather than by the mere occurrence (or popularity) of search terms. Hakia’s new technology is a radical departure from the conventional indexing approach, because indexing has severe limitations to handle full-scale semantic search.

Hakia’s capabilities will appeal to all Web searchers – especially those engaged in research on knowledge intensive subjects, such as medicine, law, finance, science, and literature. The mission of hakia is the commitment to search for better search.

Here are the technological differences of hakia in comparison to conventional search engines.

QDEX Infrastructure

hakia’s designers broke from decades-old indexing method and built a more advanced system called QDEX (stands for Query Detection and Extraction) to enable semantic analysis of Web pages, and “meaning-based” search.Â
QDEX analyzes each Web page much more intensely, dissecting it to its knowledge bits, then storing them as gateways to all possible queries one can ask.
The information density in the QDEX system is significantly higher than that of a typical index table, which is a basic requirement for undertaking full semantic analysis.
The QDEX data resides on a distributed network of fast servers using a mosaic-like data storage structure.
QDEX has superior scalability properties because data segments are independent of each other.

SemanticRank Algorithm

SemanticRank algorithm of hakia is comprised of innovative solutions from the disciplines of Ontological Semantics, Fuzzy Logic, Computational Linguistics, and Mathematics.Â
Designed for the expressed purpose of higher relevancy.
Sets the stage for search based on meaning of content rather than the mere presence or popularity of keywords.
Deploys a layer of on-the-fly analysis with superb scalability properties.
Takes into account the credibility of sources among equally meaningful results.
Evolves its capacity of understanding text from BETA operation onward.

In our tests weâ€™ve asked Hakia three English-language based questions:

Why did the stock market crash? [ http://www.hakia.com/search.aspx?q=why+did+the+stock+market+crash%3F ]
Where do I get good bagels in Brooklyn? [ http://www.hakia.com/search.aspx?q=where+can+i+find+good+bagels+in+brooklyn ]
Who invented the Internet? [ http://www.hakia.com/search.aspx?q=who+invented+the+internet ]

It basically returnedÂ intelligent results for all. For example, Hakia understood that, when we asked “why,” I would be interested in results with the words “reason for”–and produced some relevant ones.Â

HakiaÂ is one of the few promising Alternative Search EnginesÂ as being closely watched by Charles Knight at his blog AltSearchEngines.com, with a focus on natural language processing methods to try and deliver ‘meaningful’ search results. Hakia attempts to analyze the concept of a search query, in particular by doing sentence analysis. Most other major search engines, including Google, analyze keywords. The company believes that the future of search engines will go beyond keyword analysis – search engines will talk back to you and in effect become your search assistant. One point worth noting here is that, currently, Hakia still has some human post-editing going on – so it isn’t 100% computer powered at this point and is close to human-powered search engine or combination of the two.

They hope to provide better search results with complex queries than Google currently offers, but they have a long way to catch up, considering Googleâ€™s vast lead in the search market, sophisticated technology, and rich coffers. Hakiaâ€™s semantic search technology aims to understand the meaning of search queries to improve the relevancy of the search results.

Instead of relying on indexing the web or on the popularity of particular web pages, as many search engines do, hakia tries to match the meaning of the search terms to mimic the cognitive processes of the human brain.

â€œWeâ€™re mainly focusing on the relevancy problem in the whole search experience,â€ said Dr. Berkan in an interview Friday. â€œYou enter a question and get better relevancy and better results.â€

Dr. Berkan contends that search engines that use indexing and popularity algorithms are not as reliable with combinations of four or more words since there are not enough statistics available on which to base the most relevant results.

â€œWhat we are doing is an ultimate approach, doing meaning-based searches so we understand the query and the text, and make an association between them by semantic analysis,â€ he said.

Analyzing whole sentences instead of keywords would indefinitely increase the cost to the company to index and process the worldâ€™s information. The case is pretty much the same with PowersetÂ where they are also doing deep contextual analysis on every sentence on every web page and is publicly known fact they have higher cost for indexing and analyzing than Google. Taking into consideration that Google is having more than 450,000 servers in several major data centers and hakiaâ€™s indexing and storage costs might be even higher the approach they are taking might cost their investors a fortune to keep the company alive.

It would be interesting enough to find out if hakia is also building their architecture upon the Hbase/Hadoop environment just like Powerset does.Â

In the context of indexing and storing the worldâ€™s information it worth mentioning that there is yet another start-up search engine called Cuill thatâ€™s claiming to have invented a technology for cheaper and faster indexation than Googleâ€™s. Cuill claims that their indexing costs will be 1/10th of Googleâ€™s, based on new search architectures and relevance methods.

Speaking also for semantic textual analysis and presentation of meaningful results NosyJoe.com is a great example of both, yet it seems it is not going to index and store the worldâ€™s information and then apply the contextual analysis to, but rather than is focusing on what is quality and important for the people participating in their social search engine.Â

A few months ago Hakia launched a new social featureÂ called “Meet Others” It will give you the option, from a search results page, to jump to a page on the service where everyone who searches for the topic can communicate.

For some idealizedÂ types of searching, it could be great. For example, suppose you were searching for information on a medical condition. Meet Others could connect you with other people looking for info about the condition, making an ad-hoc support group. On the Meet Others page, you’re able to add comments, or connect directly with the people on the page via anonymous e-mail or by Skype or instant messaging.

On the other hand implementing social recommendations and relying on social elements like Hakiaâ€™s Meet the Others feature one needs to have huge traffic toÂ turn that interestingÂ social feature into an effective information discovery tool. For example Google with its more than 500 million unique searchers per month can easily beat such social attempts undergone by the smaller players if they only decide to employ, in one way or another, their users to find, determine the relevancy, share and recommend results others also search for. Such attempts by Google are already in place as one can read over here: Is Google trying to become a social search engine.

Reach

According to Quantcast, Hakia is basically not so popular site and is reaching less than 150,000 unique visitors per month. Compete is reporting much better numbers – slightly below 1 million uniques per month. Considering the fact the search engine is still in its beta stage these numbers are more than great. Analyzing further the traffic curve on both measuring sites above it appears that the traffic hakia gets is sort of campaign based, in other words generated due to advertising, promotion or PR activity and is not permanent organic traffic due to heavy usage of the site.

The People

Founded in 2004, hakia is a privately held company with headquarters in downtown Manhattan. hakia operates globally with teams in the United States, Turkey, England, Germany, and Poland.

The Founder of hakia is Dr. Berkan who is a nuclear scientist with a specialization in artificial intelligence and fuzzy logic. He is the author of several articles in this area, including the book Fuzzy Systems Design Principles published by IEEE in 1997. Before launching hakia, Dr. Berkan worked for the U.S. Government for a decade with emphasis on information handling, criticality safety and safeguards. He holds a Ph.D. in Nuclear Engineering from the University of Tennessee, and B.S. in Physics from Hacettepe University, Turkey. He has been developing the companyâ€™s semantic search technology with help from Professor Victor Raskin of PurdueUniversity, who specializes in computational linguistics and ontological semantics, and is the companyâ€™s chief scientific advisor.

Dr. Berkan resisted VC firms because he worried they would demand too much control and push development too fast to get the technology to the product phase so they could earn back their investment.

When he met Dr. Raskin, he discovered they had similar ideas about search and semantic analysis, and by 2004 they had laid out their plans.

They currently have 20 programmers working on building the system in New York, and another 20 to 30 contractors working remotely from different locations around the world, including Turkey, Armenia, Russia, Germany, and Poland.
The programmers are developing the search engine so it can better handle complex queries and maybe surpass some of its larger competitors.

Management

Dr. Riza C. Berkan, Chief Executive Officer
Melek Pulatkonak, Chief Operating Officer
Tim McGuinness, Vice President, Search
Stacy Schinder, Director of Business Intelligence
Dr. Christian F. Hempelmann, Chief Scientific Officer
John Grzymala, Chief Financial Officer

Board of Directors

Dr. Pentti Kouri, Chairman
Â Dr. Riza C. Berkan, CEO
John Grzymala
Anuj Mathur, Alexandra Global Fund
Bill Bradley, former U.S. Senator
Murat Vargi, KVK
Ryszard Krauze, Prokom Investments

Advisory Board

Prof. Victor Raskin (Purdue University)
Prof. Yorick Wilks, (Sheffield University, UK)
Mark Hughes

Investors

Hakia is known to have raised $11 million in its first round of funding from a panoply of investors scattered across the globe who were attracted by the companyâ€™s semantic search technology.

The New York-based company said it decided to snub the usual players in the venture capital community lining Silicon Valleyâ€™s Sand Hill Road and opted for its international connections instead, including financial firms, angel investors, and a telecommunications company.

Poland

Among them were Polandâ€™s Prokom Investments, an investment group active in the oil, real estate, IT, financial, and biotech sectors.

Turkey

Another investor, Turkeyâ€™s KVK, distributes mobile telecom services and products in Turkey. Also from Turkey, angel investor Murat Vargi pitched in some funding. He is one of the founding shareholders in Turkcell, a mobile operator and the only Turkish company listed on the New York Stock Exchange.

Malaysia

In Malaysia, hakia secured funding from angel investor Lu Pat Ng, who represented his family, which has substantial investments in companies worldwide.
From Finland, hakia turned to Dr. Pentti Kouri, an economist and VC who was a member of the Nokia board in the 1980s. He has taught at Stanford, Yale, New York University, and HelsinkiUniversity, and worked as an economist at the International Monetary Fund. He is currently based in New York.

United States

In the United States, hakia received funding from Alexandra Investment Management, an investment advisory firm that manages a global hedge fund. Also from the U.S., former Senator and New York Knicks basketball player Bill Bradley has joined the companyâ€™s board, along with Dr. Kouri, Mr. Vargi, Anuj Mathur of Alexandra Investment Management, and hakia CEO Riza Berkan.

Hakia was on of the first alternative search engine to make the home page of web 2.0 Innovations in the past yearâ€¦ http://web2innovations.com/hakia.com.php

Hakia.com is the 3rd Semantic App being featured by Web2Innovations in its series of planned publications [Â ] where we will try to discover, highlight and feature the next generation of web-based semantic applications, engines, platforms, mash-ups, machines, products, services, mixtures, parsers, and approaches and far beyond.

The purpose of these publications is to discover and showcase todayâ€™s Semantic Web Apps and projects. Weâ€™re not going to rank them, because there is no way to rank these apps at this time – many are still in alpha and private beta.

Via

[ http://www.hakia.com/ ]
[ http://blog.hakia.com/ ]
[ http://www.hakia.com/about.html ]
[ http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php ]
[ http://www.readwriteweb.com/archives/hakia_meaning-based_search.php ]
[ http://siteanalytics.compete.com/hakia.com/?metric=uv ]
[ http://www.internetoutsider.com/2007/07/the-big-problem.html ]
[ http://www.quantcast.com/search/hakia.com ]
[ http://www.redherring.com/Home/19789 ]
[ http://web2innovations.com/hakia.com.php ]
[ http://www.pandia.com/sew/507-hakia.html ]
[ http://www.searchenginejournal.com/hakias-semantic-search-the-answer-to-poor-keyword-based-relevancy/5246/ ]
[ http://arstechnica.com/articles/culture/hakia-semantic-search-set-to-music.ars ]
[ http://www.news.com/8301-10784_3-9800141-7.html ]
[ http://searchforbettersearch.com/ ]
[ https://web2innovations.com/money/2007/12/01/is-google-trying-to-become-a-social-search-engine/ ]
[ http://www.web2summit.com/cs/web2006/view/e_spkr/3008 ]
Â

Semantic Apps, Semantic Web, Technology, Web 2.0

The Semantic Web and its Applications Today

November 29, 2007 Web 2.0 Innovations 3 Comments

Since Web 2.0 Innovations is all about discovering and showcasing the innovation on web, we think the Semantic Web is playing a significant role of the next web transformation. Therefore in a series of publications we will try to discover, highlight and feature the next generation of web-based semantic applications, engines, platforms, mash-ups, machines, products, services, mixtures, parsers, and approaches and far beyond.

We are not going to try to explain in details what Semantic Web is after all. There has been plenty of information on web as to what does really that term mean. First off it is the Tim Berners-Lee W3C led initiative that touts technologies like RDF, OWL and other standards for metadata. Basically it promises to change how the web works in first place, to meaningfully connect the different datasets around web in a readable and usable format for both humans and robots.

The Semantic Web is a web of data. There is lots of data we all use every day, and itâ€™s not part of the web. One can see his/her bank statements or travel arrangements on the web, and the photographs and one can see his/her appointments in a calendar. But can one see his/her photos in a calendar to see what one was doing when she or him took them? Can one see the bank statement lines in a calendar?

Why not? Because we don’t have a web of data. Because data is controlled by applications, and each application keeps it to itself.

The Semantic Web is about two things. It is about common formats for integration and combination of data drawn from diverse sources, where on the original Web mainly concentrated on the interchange of documents. It is also about language for recording how the data relates to real world objects. That allows a person, or a machine, to start off in one database, and then move through an unending set of databases which are connected not by wires but by being about the same thing.

Some of the very basics Semantic components include:

— XML provides an elemental syntax for content structure within documents, yet associates no semantics with the meaning of the content contained within.

— XML Schema is a language for providing and restricting the structure and content of elements contained within XML documents.

— RDF is a simple language for expressing data models, which refer to objects (“resources”) and their relationships. An RDF-based model can be represented in XML syntax.

— RDF Schema is a vocabulary for describing properties and classes of RDF-based resources, with semantics for generalized-hierarchies of such properties and classes.

— OWL adds more vocabulary for describing properties and classes: among others, relations between classes (e.g. disjoint ness), cardinality (e.g. “exactly one”), equality, richer typing of properties and characteristics of properties (e.g. symmetry), and enumerated classes.

— SPARQL is a protocol and query language for semantic web data sources.

Here are some more links to plenty of resources that can get you to the basics and fundaments of the Semantic Web.

http://en.wikipedia.org/wiki/Semantic_Web

http://www.w3.org/2001/sw/

http://www.w3schools.com/semweb/default.asp

http://infomesh.net/2001/swintro/

http://www.w3.org/2000/10/swap/Primer (Getting Into RDF & Semantic Web Using N3)

http://www.w3.org/DesignIssues/Semantic (Semantic Web Roadmap)

http://purl.org/swag/whatIsSW (What Is The Semantic Web?)

http://uwimp.com/eo.htm (Semantic Web Primer)

http://logicerror.com/semanticWeb-long (Semantic Web Introduction – Long)

http://www.scientificamerican.com/2001/0501issue/0501berners-lee.html (SciAm: The Semantic Web)

http://www.xml.com/pub/a/2001/03/07/buildingsw.html (Building The Semantic Web)

http://infomesh.net/2001/06/swform/ (The Semantic Web, Taking Form)

http://www.w3.org/2001/sw/Activity (SW Activity Statement)

http://www.w3.org/2000/01/sw/ (SWAD)

If by any chance you are working for or being a part of, or just know about, a company or a team deploying Semantic Web in one way or another drop us a short note at info [at] web2innovations.com and we would love to feature your work here.

Web 2.0 Money

Tag Archives: Semantic Apps

Massive second round of funding for Freebase – $42 Million

The Semantic Web and its Applications Today

The Money & Business Behind the Web 2.0 Innovations