Several years ago, I worked on a project where the goal was to try to come up with an "equitable" version of a measure of dependence; the idea was you could take a large multi-dimensional data set, score the dependence for each pair of variables, rank the pairs by their score, and then look at the top-scoring paris to try to determine the most interesting relationship to follow up on in further work. We were motivated by the need for data exploration tools for multi-dimensional data sets.
After a large number of years, we've updated the site http://www.exploredata.net/ , with some (finally) recently published papers, and new versions of the code that are faster, more accurate, and can do additional tasks (what we call TIC as well as MIC). Our technical information subpage has links to papers, including the relatively recent papers in JMLR and the Annals of Applied Statistics. Our MINE-Application page contains links to our new version of the code, as well as links to other versions (such as minepy, a library that has APIs in python and Matlab).
The incentive for all this was, in part, one of the co-authors, Yakir Reshef, finishing up his PhD thesis. Congratulations Yakir!
Sunday, April 15, 2018
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment