visualization, etc.

A call for a visualization Twitter hashtag

October 19, 2009 · 4 Comments

The visualization community should have a de facto twitter hashtag for all things vis/infovis/vast/whateveryouwanttocallit. I find that I’m much more likely to post a link if I don’t have to blabber for a couple of paragraphs, and Twitter is perfect for that. And if we’re all posting about the same things on Twitter, we might as well decide on a hashtag. Robert Kosara’s #visweek suggestion was just about perfect, but I’m afraid #visualization is wasting too many characters. I’m fine with #vis, but it might offend the InfoVis folks (TJ, Robert: would you hashtag your posts #vis? I don’t want to have to search for #vis and #infovis, and I hate the distinction anyway)

Here are my suggestions: #visualization, #vis, #v13n.

→ 4 CommentsCategories: vis
Tagged:

VisWeek Evaluation: Longer response to the question

October 14, 2009 · Leave a Comment

(Sorry I don’t have the name of the person that asked the question) At the Q&A session of our paper, someone pointed out that the function we used only hits a fraction of the Marching Cubes cases, and that this could be trouble.

This is a great point. If you were using MMS to debug one particular algorithm (in this case, MC), you would certainly want to consider an analytical function that hits as many cases as possible. This is part of the verification pipeline that Tiago talked about earlier in the talk: the need for the constant cycle of “new tests -> verification -> more confidence -> new observations -> new tests” as a way of verifying our implementations. In this paper, we wanted a function that was both 1) complicated enough to trigger interesting behavior across a wide set of isosurfacing implementations and 2) simple enough that we could theoretically  analyze the expected convergence of the algorithm.

In the case of MC, geometric and normal convergence are pretty trivial (as Tiago briefly mentioned, it’s just Taylor series plus a dumb trick to turn algebraic distance into geometric distance). So essentially any analytical function whose isosurfaces you can get geometric distances from would work as a manufactured Marching Cubes solution.

→ Leave a CommentCategories: vis · visweek

VisWeek: give us data or source! (Also, twitter rules)

October 14, 2009 · Leave a Comment

If you spot an annoying guy asking for the source or data used in talks and paper presentations, you now know who I am. If you give me your data, I can still run my code on your data. If you give me your code, I can still run your code on my data. However, if you give me neither, you better have an incredibly well-documented writeup. Otherwise, they’re just pictures in the paper. Please, PLEASE, everyone, let’s make an extra effort to share at least part of the materials.

Also, I love Twitter for the underlying live commentary on presentations. You should really be following the #visweek hashtag.

→ Leave a CommentCategories: vis · visweek

VisWeek: general impressions and things to see

October 13, 2009 · Leave a Comment

I have not written much about what has been going on in the last couple of days, mostly because it seems like the continuous updates like I was doing last year will be impossible from the conference floor. However, I also feel like they feel better as twitter updates, and Marian Doerk has been posting a lot of these. Look for the latest about VisWeek on twitter under the #visweek hashtag, and somewhat longer updates about the conference here.
In general, I have been pretty pleased about the many different references to machine learning that I have heard and seen in the conference. I’m looking forward to “Visual Human+Machine Learning” at Vis, and I was impressed to see an entire one-day workshop/forum dedicated to bringing machine learning people and techniques to visualization. Props to FODAVA organizers!
One of the papers I’m looking forward to seeing is “Continuous Parallel Coordinates”. It is the followup paper to “Continuous Scatterplots”. After sitting at about 10 papers for VAST, I am convinced that continuous density-based plots are vastly useful, and much better than the discrete counterparts. I hope these techniques become popular.

I have not written much about what has been going on in the last couple of days, mostly because it seems like the continuous updates like I was doing last year will be impossible from the conference floor. However, I also feel like they might work better as twitter updates: Marian, TJ and Robert have been posting a lot of these. Look for the latest about VisWeek on twitter under the #visweek hashtag, and somewhat longer updates about the conference here.

In general, I have been pretty pleased about the many different references to machine learning that I have heard and seen in the conference. I’m looking forward to Visual Human+Machine Learning at Vis, and I was impressed to see an entire one-day workshop/forum dedicated to bringing machine learning people and techniques to visualization. Props to FODAVA organizers!

One of the papers I’m looking forward to seeing is Continuous Parallel Coordinates. (It is the followup to Continuous Scatterplots). After seeing 4 or 5 VAST papers that could clearly use some variant of these, I am convinced that continuous density-based plots (histograms, scatterplots, and now parallel coordinates) are vastly useful, and much better than the discrete counterparts. I hope these techniques become popular.

→ Leave a CommentCategories: vis · visweek

VAST Session 1: Spatio-Temporal Analytics

October 12, 2009 · 1 Comment

Hello everyone. I’m sitting on the first paper session of the day, but posting will be delayed since the wireless connection in the conference rooms is essentially non-existent. I have to confess I haven’t followed VAST too closely in its first few years. I was surprised to see a lot of interesting papers on the fast forward this morning, so you can expect at least some writing about them right here. Also, I would like to publicly congratulate whoever had the idea of USB proceedings: leafing through the papers on a USB drive is much easier than carrying the huge printed proceesings or having a DVD noisily spinning all the time.
The first talk is from the paper entitled “Interactive Visual Clustering of Large Collections of Trajectories”, by Andrienko and co-authors at the University of Pisa. It is (obviously) about a clustering approach to classifying trajectory data. The abstract claims that “structurally complex objects such as trajectories of moving entities cannot adequately be described as points in multi-dimensional spaces”. In my opinion, this is just incorrect (in machine learning this is widely used and known as the kernel trick). The talk did not give many details about what the clustering algorithm is actually doing, and seemed to be more about the system and the data itself than the underlying technique. Since I promised you before that I would be talking about fun things with inner product spaces, I will include an example of inner products for trajectory data.
The second talk is again about visualization of trajectory data: “Proximity-based Visualization of Movement Trace Data”, by Crnovrsanin and co-authors at UC Davis. The authors propose reducing the dimensionality of trajectory data by mapping a trajectory to a 2D line where the coordinates are time and distance to a point of interest. While this seems to be effective once you know the position of the point of interest (the case study they used were related to a bombing incident siulation), I also worry that line crossings in that diagram are perceptually important but not very informative unless the distance is small (the only thing you can say about two objects that are at distance k apart from a third object is that they’re at most at distance 2k from each other). The presenter claimed to have picked this representation over a 3D display of the lines because of the occlusion that would be present in such a display, but I’m not sure that would be significantly worse than the uninformative crossings in the 2D scenario. Their second case study was about comparing the proximity of elk and deer to roads, and show that deer tend to be closer to roads than elk. That’s a cool example, but would a simple statistic of average distance to road have been enough?
Writing these first two paragraphs, I missed the third talk by Steed and co-authors from the NRL and MSU, whose paper is entitled “Guided Analysis of Hurricane Trends Using Statistical Processes Integrated with Interactive Parallel Coordinates”, which from my completely ignorant glance and overhearing, seems to be about enriching parallel coordinate plots with statistical information.

Hello everyone. I’m sitting on the first paper session of the day, but posting will be delayed since the wireless connection in the conference rooms is essentially non-existent. I have to confess I haven’t followed VAST too closely in its first few years. I was surprised to see a lot of interesting papers on the fast forward this morning, so you can expect at least some writing about them right here. Also, I would like to publicly congratulate whoever had the idea of USB proceedings: leafing through the papers on a USB drive is much easier than carrying the huge printed proceesings or having a DVD noisily spinning all the time.

The first talk is from the paper entitled “Interactive Visual Clustering of Large Collections of Trajectories”, by Andrienko and co-authors at the University of Pisa. It is (obviously) about a clustering approach to classifying trajectory data. The abstract claims that “structurally complex objects such as trajectories of moving entities cannot adequately be described as points in multi-dimensional spaces”. In my opinion, this is incorrect (in machine learning this is widely used and known as the kernel trick). The talk did not give many details about what the clustering algorithm is actually doing, and seemed to be more about the system and the data itself than the underlying technique. Since I promised you before that I would be talking about fun things with inner product spaces, I will include an example of inner products for trajectory data.

The second talk is again about visualization of trajectory data: “Proximity-based Visualization of Movement Trace Data”, by Crnovrsanin and co-authors at UC Davis. The authors propose reducing the dimensionality of trajectory data by mapping a trajectory to a 2D line where the coordinates are time and distance to a point of interest. This is effective once you know the position of the point of interest (the case study they used were related to a bombing incident siulation), I worry that line crossings in that diagram are perceptually important but not very informative unless the distance is small (about the only thing you can say about two objects that are at distance k apart from a third object is that they’re at most at distance 2k from each other). The presenter claimed to have picked this representation over a 3D display of the lines because of the occlusion that would be present in such a display, but I’m not sure that would be significantly worse than the uninformative crossings in the 2D scenario. Their second case study was about comparing the proximity of elk and deer to roads, and show that deer tend to be closer to roads than elk. That’s a cool example, but would a simple statistic of average distance to road have been enough?

Writing these first two paragraphs, I missed the third talk by Steed and co-authors from the NRL and MSU, whose paper is entitled “Guided Analysis of Hurricane Trends Using Statistical Processes Integrated with Interactive Parallel Coordinates”, which from my completely ignorant 5-second glance, seems to be about enriching parallel coordinate plots with some sort of statistical information about the datasets in use.

I’m also missing the fourth talk, since I just snuck out of the room to post this. More soon!

→ 1 CommentCategories: vis · visweek

Don’t fear the 1/|grad f|

October 8, 2009 · Leave a Comment

In our Vis paper last year about continuous histograms, we show that the right continuous formulation for histograms is the integral of the inverse gradient magnitude of the scalar field over the desired isosurface:

\pi(h) = \int_{f^{-1}(x) = h} | \nabla f(x) |^{-1} \; dS

In this note, I want to show that for most well-behaved functions, you have no reason to be afraid of that infinity term that will prop up in critical points. More precisely: for Morse functions and critical values h, \lim_{x \to h} \pi(x) exists and is the same from both sides. I’ll stick with two-dimensional functions for now, but it works for higher dimensions.

Around a critical point of a function, choose a rotated coordinate system such that f(x,y) = k_0 x^2 +k_1 y^2 + \epsilon , where \epsilon are all third-order terms. Since this a Morse function, the critical point is non-degenerate and isolated, and so k_0, k_1 \neq 0 .
Then, || \nabla f(x,y) || = g \approx \sqrt{4 k_0^2 x^2 + 4 k_1^2 y^2} . First, we assume k_0 = k_1 . Then,

g = (2 \sqrt{2k}) (x^2 + y^2)^{1/2}

g = (2 \sqrt{2k}) r

Now we just compute \pi at a point around a critical point with radius r:

\int_{x^2 + y^2 = r^2} (2 \sqrt{2} r)^{-1} dS = \frac{2 \pi r}{2\sqrt{2k}r} = \pi/\sqrt{2k}

This means that around a source or a sink in 2D, the critical point tends toward a constant (since we’re getting rid of the higher-order terms). The case where k_0 \neq k_1 but they share a sign is also easy: simply upper-bound the integral by using a larger domain of integration (instead of an ellipse, use the corresponding larger circle) and integrand (use the smallest gradient over the ellipse). Both factors are finite, so the integral is also finite. Similarly, the case where k_0 = - k_1 can be solved by substituting the complicated integral over a hyperbola with an integral over a portion of the asymptotes that has greater length.

This result should not be surprising: since \pi is really the continuous analog of a histogram, \pi could only be infinite if the cumulative distribution function of the scalar field were discontinuous. This implies a thick slab of the field being constant (that is, \{ f^{-1}(h) = x \} would have positive measure). This does not happen in Morse functions (since critical values are isolated).

I believe that a similar argument holds for Sven’s continuous scatterplots. It has to, by analogy with multi-dimensional histograms. The situation is more complicated, though, because the values where their denominator is zero are the Jacobi sets of the sets of functions, and I just don’t know enough about them to be able to tell.

(Update: fixed typos. Thanks, Gordon!)

→ Leave a CommentCategories: vis

Hi from AT&T Research

October 2, 2009 · 2 Comments

Now that I’m settled here in (cold!) Madison, NJ and actually doing some work, I hope to be able to write about some of the things I have been thinking about recently. Work is exciting: a lot of very interesting data and a lot of incredibly smart people. I have a feeling that the collective IQ dropped by a few points the moment I walked in the building.

VisWeek starts next week! Last year I got the surprise of realizing people were actually reading what I was writing. I even got a “hey, you’re the blog guy” (Hi, Raghu :) ). So, expect liveblogging. Plenty of exciting changes for this year’s conference, but I particularly like the parallel Vis+InfoVis sessions. There will certainly be growing pains, but I stand by my belief that, in the long run, this is the right move for the community as a whole. Also, VisWeek is now on twitter, the updates of which will most likely be posted by yours truly – buyer beware :)

In the next few posts, I want to talk about verification in scientific visualization (and in particular about Tiago’s upcoming paper on Vis which I was a part of), and fun and games with inner product spaces.

→ 2 CommentsCategories: vis

Maneesh Agrawala’s shiny new half-mil

September 22, 2009 · Leave a Comment

Congratulations to Maneesh Agrawala for his MacArthur Fellowship award, popularly known as the “genius” fellowship:

http://sanfrancisco.bizjournals.com/sanfrancisco/stories/2009/09/21/daily43.html

Are graphics and visualization respectable now? :)

→ Leave a CommentCategories: vis

Timeout

July 9, 2009 · 3 Comments

It’s lights out from now to the defense. See you on the other side!

→ 3 CommentsCategories: etc

Why do sites even have passwords?

July 1, 2009 · 1 Comment

Every time I have to log in to a reviewer website I forget the password. I simply hit “reset password”, which takes me through a mostly-pleasant challenge-response kind of email exchange. So the question I have is: why do we even have passwords for these sites anymore? Can’t an email exchange be the default authentication mode? With cookies, the credentials can live for a longer time anyway, and this neatly pushes the password problem to the email account.

→ 1 CommentCategories: vis