» tagged pages
» logout

(Feed found, click Add Page to syndicate.) Error finding feed, please try again » Find feed title

A Blog Page allows you to add entries, for news or other time sensitive postings

(Login required to save to your tagged pages.)
(or Cancel)

Make further edits, (or Cancel)

(Login required to save to your tagged pages.)
(or Cancel)

(Editing anonymously: to be credited for your changes, login or register a new account)

Change Page Permissions? Changing these permissions will adjust who can modify this page.

Anonymous (change)
Swik Users (change)
(or Cancel)
Upload an image from your computer:
or Copy an image from a URL:
or Erase the current icon:
Icon Preview:

or Cancel

Erase search? The contents of search page and all pages directly attached to search will be erased.

or Cancel

(Editing anonymously: to be credited for your changes, login or register a new account)

other page actions:
search

search

Tags Applied to search

1 person has tagged this page:

search Wiki Pages

This page provides information about search technology and search services.

Popular links include:

Google's AJAX Search API

Hibernate Search (via Lucene Integration

Koders – search open source code

sorted by: recent | see : popular
Content Tagged search

Jeff's Search Engine Caffè: Current Open Source Search Engine Libraries

"Here is my short list of the most important open source [free] information retrieval libraries being used today that are undergoing active development as of writing."

open-source: del.icio.us tag/open-source

Microsoft Ex-Pats Developing Open Source Software Outside of Redmond

It seems that open source maven, Matt Asay along with well-known Microsoft blogger Mary Jo Foley have come to the conclusion that Microsoft doesn’t need open source. Asay contends that Microsoft’s open source activity has more to do with regulators than best practices and user collaboration.

Microsoft’s open-source charade is not about customers. It’s about regulators. Until Microsoft can convince U.S. and European regulators that its market power is not as bad as it once was, the company will need to hide behind expressions of openness.

Hence, Microsoft “opens” up its protocols (i.e., lets everyone read but not touch…without forking over cash). It inks “open” interoperability agreements with Novell and others, which actually do nothing more than bind otherwise open-source success to Microsoft’s proprietary technology. Microsoft general counsel Brad Smith acknowledges the shift, or lack thereof:

“It is (a change in philosophy) in some significant ways and yet it has also other aspects that are a continuation and we’re probably thinking a little bit about both pieces,” Smith said, explaining Microsoft’s twin thrusts of promoting intellectual property rights by encouraging interoperability among various software platforms.Business as usual. Just under the openness guise.

I suspect that’s a reasonable assumption. Though the folks working in open source software from Microsoft like Sam Ramji seem pretty sincere. With Bill Gates retiring and Microsoft’s initiatives on open source wouldn’t it be a sardonic turn of events for open source spread like a virus inside the walls of Redmond (especially since that’s how detractors likes to describe open source software).

There are a number of ventures run by ex-Microsofties who are seeing success. Maybe the the question real question is, “How far does the apple fall from the tree?”

Redmond Refugees Driving Open Source

Last week Black Duck Software run by a former Microsoft employee, Doug Levin, acquired Koders an open source repository and search engine. Likewise Software, the maker of an open source unified authentication product was started by some ex-Microsoft employees. Starting as a proprietary software maker they eventually moved to a open source development model and have gotten involved with the popular open source Samba project. Finally, MindTouch an open source wiki developer is run by ex-Microsoft employees and has seen great success with their open source project. All the companies mentioned above are run by ex-Micrsoft employees and all seem to be having decent success in open source.

Deki Wiki and Application Collaboration

One of my favorite open source projects is Deki Wiki by MindTouch. The product solves many of the same problems as Microsoft Sharepoint. More so than Sharepoint, DekiWiki extends its collaboration beyond people to the application layer. Rather than striving to be an all-in-one solution Deki Wiki boasts a robust web services API that allows for integration between other applications. Already Deki Wiki supports authentication via LDAP and Active Directory as well as authentication systems from popular open source content management systems like Wordpress, Drupal, and Joomla!.

Today was the release of MIndTouch’s v8.05 (codenamed Jay Cooke) which was largely driven by requirements from the Mozilla Foundation (who will be relaunching their developer community using Deki Wiki). I like to see open source projects use open source software when they get the chance. Sometimes I think we do that because of solidarity among free software developers. However, in the case of Deki Wiki I think that it’s safe to say that Deki Wiki is not only open source, but also best of breed. I have raved about Deki Wiki before but this release has some very cool new features. Through web services you can enable real time chat, embed Google spreadsheets, query databases, and include interactive maps from Google and Windows Live.

One feature that really caught my eye was the new polygot feature that give you the ability to provide language support at the page and user level. I suspect this feature is especially useful to companies who want to easily provide content in a number of different languages. It’s also a very nice platform for collaborative content localization.

Another variation on the theme of application interactivity, Deki Wiki v8.05 now supports, OpenSearch. OpenSearch is a collection of simple open formats for sharing search results originally developed by Amazon and A9. Beyond these features MindTouch’s newest version includes an improved file uploader and content transforms to allow users to specify selections of text for syntax highlighting, SVG, LaTex and graphs. All and all a big step forward.

Summary

The jury is out on how significant Microsoft’s open source ambitions will affect their future plans. Many people are skeptical for the obvious reasons. In the meantime I think their is a lot of cool technology being created by these Redmond Refugees.

Technorati Tags: , , , , , , ,

ShareThis

MySQL: Planet MySQL

Moving to Unicode 5.1



Google has just begun supporting Unicode 5.1, less than one month after it was released. It's now available in search, so people speaking languages such as Malayalam can now search for words containing the new characters in Unicode 5.1.

Web pages can use a variety of different character encodings, like ASCII, Latin-1, or Windows 1252, or Unicode. Most encodings can only represent a few languages, but Unicode will handle anything from Chinese to French to Arabic. We have long used Unicode as the internal format for all the text we search: any other encoding is first converted to Unicode for processing. So we regularly update to each new version of Unicode (and relevant related standards like CLDR and BCP 47) to make sure we are current. Thus Unicode plays a key role in our mission.

Uptick in native Unicode webpages

Just last December there was an interesting milestone on the web. For the first time, we found that Unicode was the most frequent encoding found on web pages, overtaking both ASCII and Western European encodings—and by coincidence, within 10 days of one another. What's more impressive than simply overtaking them is the speed with which this happened; take a look at the blue line in this graph.

You can see a long-term decline in pages encoded in ASCII (unaccented letters A through Z). More recently, there's been a significant drop in the use of encodings covering only Western European letters (ASCII and a few accented letters like Ä, Ç, and Ø). We're seeing similar declines in other language-specific encodings. Unicode, on the other hand, is showing a sharp increase in usage.

This is based on our indexing of web pages, and thus may vary somewhat from what other search engines find. However, the trends are pretty clear, and the continued rise in use of Unicode makes it even easier to do the processing for the many languages that we cover.

Google: Official Google Blog

Google AJAX Search API Blog: Flash and Server Side Access

Google has quietly released a JSON based one under the guise of supporting "Flash and other non JavaScript environments". Comes with<sep/>

json: del.icio.us/tag/json

Page 1 | Next >>
Username:
Password:
(or Cancel)