Big Data means more than just Volume; it can also show up as Big Variety. There are so many science datasets available, that searching for and finding the right one is becoming harder every day. One solution is to return search results with the most relevant ones at the top. Why so difficult? Well, dataset relevancy is a little different than the ordinary relevancy rankings one would use for web pages. Dataset versioning, temporal overlap, spatial overlap, download frequency are all potential means of presenting the datasets most likely to be useful to a user.
Christopher Lynnes recently retired from NASA as System Architect for NASA’s Earth Observing System Data and Information System, known as EOSDIS. He worked on EOSDIS for 30 years, over which time he has worked multiple generations of data archive systems, search engines and interfaces... Read More →