Wednesday, March 22, 2017

Dictionary of Algorithms and Data Structures

The National Institute of Standards and Technology hosts a Dictionary of Algorithms and Data Structures on its website.   This is a great resource for students as well as experienced professionals who want to refresh their memory about the exact definition of a red-black tree.

The dictionary is edited and maintained by Paul Black and Vreda Pieterse.  It's quite extensive, with references, bibliography, and a guide for how to cite.

The URL is .

Thursday, August 6, 2015

National Consortium for Data Science "DataBytes" Webinars

Check out the National Consortium for Data Science "DataBytes" Webinars, available at:

The webinars are held on the first Wednesday of every month, excluding July.

According to the NCDS site,  DataBytes is "an NCDS-sponsored lunchtime webinar series that gives our members and the larger data community the chance to discuss the most pressing and interesting issues in data science. "

Saturday, April 11, 2015

About APIs

A recent article in JavaWorld talks about the wide variety of APIs now available for use.  While the concept of APIs is not new, recent "advances in infrastructure, type and variety of programming languages, and abundant computing resources have created new opportunities. "

The post links out to several other interesting article about the challenges and opportunities of API development.

View the article here:

Wednesday, September 24, 2014

Watson Analytics Tool

IBM has released a new tool for performing analytics on big data.   The name of the tool is "Watson Analytics," and the name refers to Watson, the famous IBM computer that won on Jeopardy. Essentially this tool allows users to pose natural language questions and get answers derived from large data sets.  An example given in the New York Times article linked below is "What high-value customers am I most likely to close sales with in the next 30 days?"     The idea is that a business user can use the tool without having to talk to a data scientist first.  The author of the New York times article ponders whether this is the "killer app" for Big Data.

Here is the link:

Saturday, January 11, 2014

New ways of representing data

The New York Times Bits blog this week has an article titled "A Makeover for Maps" that talks about cutting edge visualizations of data.  The article cites some interesting new depictions of data--such as the one created by Eric Rodendeck of  Stamen Design that creates  "...a representation of how photos spread on Facebook that looks like ice crystals forming on a car window."

The article goes on to talk about the idea of creating visualizations that can be used by different audiences -- like a dynamic chart of Nasdaq financial data that can be read one way by a pension fun manager looking for trends and another way by an SEC investigator looking into an unusual spike in trading during a given day.

Check out the full article with multiple visualization examples at

Monday, October 14, 2013

Looking at the problems on the site

You may have heard the news about people in the U.S. experiencing problems trying to sign up for healthcare on the new site.  This site is perhaps the most visible manifestation of the Affordable Care Act, aka "Obamacare."  On, users can search the "Health Insurance Marketplace" for coverage.  However, the site has been plagued by reports of slowness and poor user experience. 

Several online articles attempt to surmise what is going wrong with the site from a technical perspective.  Regardless of your personal views on the Affordable Care Act, I think these are relevant for us to read because they provide us a chance to avoid making the same (apparent) technical mistakes in our projects.  I'd like to highlight some general points that I have read so far:

  • Load test both the interface _and_ the back end functionality. 
  • User experience (UX) should not be an afterthought
  • Poor validation logic (eg requiring numbers as part of username/password but not informing user of this beforehand or in error messages) leads to extremely poor UX

For more on the technical issues with, see the following articles:

And here is a contrarian opinion, stating that the architecture of is sound, but that the project was rushed:

Friday, September 27, 2013

Google updates its search algorithm

Google has updated its search algorithm for the first time since 2010.  According to this post in the Bits Blog on the New York Times site, this is the most extensive overhaul of the algorithm since 2000.

Although Google is not revealing specific details of the new algorithm, code named "Hummingbird," the main thrust of the changes are geared toward understanding longer and more complex queries.  To that end, Google is employing its Knowledge graph, which is a map of semantic relationships.

One of the drivers for understanding longer queries is that more people are speaking queries into their phones using natural language, which is by its nature more complex than a simple keyword query typed into a search box.

For more information, see:

Forbes article:

Search Engine Land FAQ on Hummingbird:

Google "15th anniversary" blog post (includes a great "Google Search Timeline" graphic):

Google post on the Knowledge graph (2012):