My Biased Coin: Text-book Algorithms at SODA (Guest Post, Mikkel Thorup)

Friday, December 18, 2009

Text-book Algorithms at SODA (Guest Post, Mikkel Thorup)

Mikkel Thorup sent in the following guest post:

Text-book algorithms at SODA

This is a pitch for promoting good text-book algorithms at SODA. Erdos promoted book proofs, but book algorithms are in some sense far more important in that they could end up being understood and used by every programmer with a degree in CS. This can yield a huge external impact, and I do think we do ourselves and the world a big favor taking this work seriously. Instead of taking papers on this theme (which would, incidentally, be a great idea), perhaps the area could serve as the basis for a lighter afternoon entertainment session, providing cool stuff that one could take home and show students.

To me the greatest text-book algorithms work well in both theory and practice. They have a cool non-obvious idea that will impress the students, yet, after first you get the idea, they are simple to understand and implement. Unfortunately, to get into a top conference, it is best if you also have 5-10 pages worth of complications. Typically the complications are not themselves that interesting, but they are helpful in making the paper look hard enough; otherwise some referees are bound to complain about lack of meat (fat?).

Note that by insisting on complications, we narrow the contributers to the small pond of theoreticians. Simple efficient algorithms are sought by every smart practitioner, and it no coincidence that many of the elegant algorithms theorists analyze are discovered outside theory. On the other hand, I do think theorists are the ones in the best position to develop great simple algorithms thanks to our fundamental understanding, and I think we should celebrate it when it
happens.

To be more clear, let me present a somewhat controversial example of what I consider a great text-book contribution which is buried in a paper [DHKP97] about very different issues. The example is for universal hashing (low collision probability) where the new scheme is simpler and much faster than the classic method.

Suppose we want universal hashing of w-bit keys to u-bit indices. The classic solution is to pick a prime p>2^w, and a random a in [p], and then use the hash function

h_a(x) = ((ax) mod p) mod 2^u --- math terminology.

The alternative from [DHKP97] is to let b be a random odd w-bit number and use

h_b(x) = ((bx) mod 2^w) div 2^(w-u) --- math terminology.

To prove that this is universal is a nice text book exercise using that odd numbers are relative prime to powers of two.

There may be no obvious advantage of one scheme over the other on an established theory model like the unit-cost RAM, but the difference is major in reality. Implementing the scheme from [DHKP97] is extremely simple with standard portable C-code. We exploit that C-multiplication (*) of unsigned w-bit numbers is done mod 2^w, and get

h_b(x) = (b*x) >> (w-u) --- C code.

By comparison, the implementation of the classic scheme is problematic. One issue is that the mod operation is very slow, and has been so for more than 30 years. Already when Carter and Wegman introduced universal hashing at STOC'77, they were aware of the issue.
They suggested using Mersenne primes (p=2^i-1) allowing us to bypass mod p with some faster bit-trick operations. Even using that, we still have the issue that the classic scheme requires us to compute ax exactly, and ax has more than 2w bits. Since w-bit multiplication is mod 2^w, we need 6 w-bit multiplications to compute ax in its full length, and that is even ignoring the issue of mod p. If 2w-bit multiplication is available, it suffices with two multiplications, but these are often more expensive than w-bit multiplication.

The impact of the scheme from [DHKP97] is big in that it unites theory and practice in what is probably the world's most common non-trivial inner loop. The classic prime based scheme is so slow that practitioners have come up with all kinds of alternatives that are not even remotely universal, e.g., some combination of shifts and xor, hence no entropy. The new scheme is faster than all these hacks, so now we can convince practitioners to use real universal hashing, often leading them to better more robust results.

Is the above theory? It is certainly involves a mathematical observation about the use of relative primality, and I like to think of algorithms as math with mathematically well-defined impact on computing. To get a mathematically well-defined measure, we can, for example, look at how many operations are needed in C, which has been used for efficient portable code since 1972. A theoretical irritant is that we have a whole array of measures, e.g., depending on how we count 2w-bit multiplication and mod-operations. However, the new scheme is clearly better: the variations only affect exactly how much it is better---some factor between 3 and 15.

It is a bit interesting to contrast the above situation with, say, the more robust world of polytime approximation with, say, a very well-defined difference between a worst-case factor 2 and 3. Translating to reality, if the polytime algorithm is superquadratic, it is typically too slow to finish on large scale problems. Moreover, one often gets much better results using simple heuristics with bad worst-case behavior.

For the hashing we are not just talking worst-case but all-cases (same instructions performed on all keys), and I have never tried a real computer on which the new scheme didn't gain at least a factor 4 in speed compared with the classic scheme tuned with Mersenne primes. On top of that, the new scheme is much simpler to implement. While this difference is very convincing for anyone experienced with efficient programming, it may be a bit hard to appreciate for "THEORY of Computing" conferences like STOC/FOCS. However, I see algorithms and SODA more as a "Theory of COMPUTING" with a scope closer to the reality of computing, hence with a bigger interest in text-book algorithms that unite theory and practice. Highlighting such simple, but incredibly useful, practical computing algorithms would both increase the impact of SODA (and arguably theory more generally) and provide a useful distinguishing characteristic for the conference.

[DHKP97] M. Dietzfelbinger, T. Hagerup, J. Katajainen, and M. Penttonen.
A reliable randomized algorithm for the closest-pair problem.
J. Algorithms, 25:19-51, 1997.

32 comments:

Anonymous said...: To play devil's advocate:

1. Strings are commonly used as hash keys. How do we adapt the proposed hash function to this case?

2. The fastest non-universal hash functions don't even multiply (e.g. http://www.azillionmonkeys.com/qed/hash.html ). If there's a significant performance penalty for universality in practice, why should practitioners pay it?; December 18, 2009 at 3:48 PM
Anonymous said...: I think that referees who insist on these complications are often poor referees, and they should be avoided.

There is a point to asking for some fat, though. Some complications are very natural extensions, and possibly important even if their proofs are merely technical with no new ideas. The advantage of having these extensions in the original papers is that they are put out there for posterity. The extensions are often too trivial to warrant full papers on their own, so they can otherwise become just folklore (sometimes without ever having been carefully proved).; December 18, 2009 at 5:21 PM
Unknown said...: I agree completely. I had a paper rejected from STOC a few years ago with reviewer comments indicating that text-book algorithms were verboten, e.g. "[the submission] is nice, but quite simple."; December 18, 2009 at 6:28 PM
Anonymous said...: h_a(x) = ((ax) mod p) mod 2^u --- math terminology.
....
h_b(x) = ((bx) mod 2^w) div 2^(w-u) --- math terminology.

What do you mean by "math terminology" ? What other terminology is there ?

Moreover, mathematically these statements make no sense. The notation "mod p" denotes an equivalence relation, namely

a \eqv b mod p is shorthand for

p divides a - b.

So I fail to see what is meant by the
first statement. The second is even more mysterious.

This careless attitude towards mathematical language and notation is a persistent problem in theoretical CS literature (probably due to a lack of proper training in mathematical writing). However, unless such levity with mathematical notation is gotten rid of, it is hard to take such papers seriously.; December 18, 2009 at 6:36 PM
Anonymous said...: Anon #4: for someone who is trying to be a mathematical pedant, you are doing a bad job of blurring the mathematical distinction between \bmod (a perfectly well defined binary operator in mathematics, despite C/C++/Java getting it wrong) and \pmod (an indication of which equivalence relation is being used in an equivalence).; December 18, 2009 at 6:50 PM
Anonymous said...: blurring the mathematical distinction between \bmod (a perfectly well defined binary operator in mathematics, despite C/C++/Java getting it wrong) and \pmod (an indication of which equivalence relation is being used in an equivalence).

This is really confusing.
A binary operator is not an equivalence relation. What are the definitions of \bmod and \pmod ?

The last part seems to imply that both \bmod and \pmod are equivalence relations, contradicting the first part.; December 18, 2009 at 7:49 PM
Anonymous said...: We are in serious danger of derailing the discussion in what I think is a very interesting and important post, but:

x mod y

(where y is a positive integer), written in LaTeX using \bmod, is the non-negative integer that is less than y and is congruent to x (mod y). In C/C++/Java, the "%" operator does something like this, but with the bogus "feature" that it gives negative results when x is negative to match those languages' rounding rules for division. Python gets it right.

On the other hand,

x = z (mod y),

written in LaTeX using \pmod, expresses the fact that x and z are related by the equivalence relation =(mod y).; December 18, 2009 at 8:09 PM
Unknown said...: On the off-topic thread: it is clear that C gets modular arithmetic wrong, but it is not entirely clear what the correct answer is in the unusual case that the modulous is negative. For example should 5 mod -3 equal 2 or -1?

See http://portal.acm.org/citation.cfm?id=128862; December 18, 2009 at 8:48 PM
Adam Smith said...: To get back on to the topic of the post:

I think one of the reasons that simple ideas (algorithms, models, whatever) have trouble getting accepted to conferences is that it is much harder to referee them; their value is tricky to assess.

Some reasons:

1) It is difficult to assess the novelty of something that is obvious in retrospect. (What if it was folklore? What if it was an idea that was "in the air" but not really folklore yet?) It is remarkably easy to convince yourself that you basically knew something but had never bothered to flesh out the idea... unless fleshing out the idea obviously took a month of time from someone talented and technically well-versed in the details of a field.

2) There are lots clean, nontrivial implementation ideas that aren't all that important since they are not the bottleneck in any crucial applications.

In most communities, simple nontrivial ideas can get attention as long as they change the state of the art for some important problem. For example, if you last year you had come up with a simple trick that halved the error rate of prediction algorithms for the Netflix prize, you would have gotten $1M.
And similarly, simple ideas that solve important open problems in theory (think Moser's STOC best paper) get well-deserved praise.

I like the idea of an afternoon at SODA devoted to the type of work described in Mikkel's post. IPL is also a good journal venue for them.

My question is: if we want major *theory* conferences to reward this type of work with publication in the main track, how should we evaluate the submissions?

Again: improving a widely deployed algorithm (like universal hashing) is pretty clearly a good thing; but what if I improve my own algorithm, that is not yet widely deployed? How do we measure impact reliably? Whatever the answer is, it will probably look very different from (and, my gut suggests, more subjective than) the criteria that are currently the norm in theory.; December 18, 2009 at 11:31 PM
Michael Mitzenmacher said...: Anon #4: It's pretty clear that Mikkel has used "math terminology" to contrast with the line later in his text that he labeled "C code" -- the point being is that while the two equations look quite similar at the mathematical level, they're quite different at the code implementation level. And yes, he's using mod as a binary operator here.

(Still, always nice to have some mathematical pedant to point out that mod is an equivalence relation so how can we computer scientists write things like a = 1 mod p ; I get one in my algorithms class most every year, too!)

I do hope as DE suggests we go back to comments on the actual theme of Mikkel's post, which relates to why there are so few algorithms-as-I-like-to-define-them (that is, algorithms that people might actually use in running programs) at a conference like SODA, and whether perhaps that could be changed.; December 18, 2009 at 11:34 PM
Michael Mitzenmacher said...: Adam,

I think I disagree with your final point:

"How do we measure impact reliably? Whatever the answer is, it will probably look very different from (and, my gut suggests, more subjective than) the criteria that are currently the norm in theory."

Perhaps it is my experience in networking/systems conferences, but I think that reasonable PCs can perfectly well assess how interesting an algorithm implementation idea is. In systems conferences, a major component of what they do is determine which implementations they think are interesting and will have the most impact, in terms of the actual implementation or the underlying ideas. I'd agree some of the criteria are different than those used in judging theory papers, and I'd agree that the theory community, as a whole, has not exercised this type of judgment sufficiently in, say, the last two decades and would have to "re-train those muscles," so to speak. But I think that impact (realized or potential) can be adequately judged by people who work on algorithms "in the real world", and their judgments would likely be no more (and possibly would be less) subjective than those used to decide what papers get into theory conferences currently.

The main judgment, I think, would be, "Would I or someone I know potentially find this useful?" People who code and deploy algorithms regularly will, I think, have good judgment about what sort of things pass this test. Of course, good papers will attempt to demonstrate that their algorithm is useful in real-world contexts by explaining where they could be used or actually showing their performance in real applications.; December 18, 2009 at 11:48 PM
rgrig said...: Here is my eurocent: The background knowledge of the reader determines what is complicated and what is not. With space constraints, authors must judge how much background to include. Even without space constraints, you run the risk of boring your readers if you include too much background.

How hard it is to understand a paper is mainly a function of the overlap between the background knowledge of the writer and that of the reader and secondarily a function of the author's writing skills. It is not a function of the value of the idea being presented.

A good paper should be easy to understand and useful. Of course, the trick is that "useful" means different things to different people: can be put in a program, reveals unexpected theoretical connections, etc.

Now let me tell you a secret. I wrote the above because I felt obliged to contribute to the main subject to earn the right to ask my silly(?) question.

My TAoCP copy recommends the hashing scheme h_b(x) = ((bx) mod 2^w) div 2^(w-u), with b being an odd number close to 2^w/1.62. (The golden ratio is supposed to make it work well for keys in arithmetic progression. And then says how for hashing short strings something different than the golden ration works better.) Knuth credits Floyd for 'most of these ideas'. So the main observation of [DHKP97] is that random (odd) bs gives a universal set of hash functions. Right?

PS: The complaint about 'mod' not being a binary operator illustrated perfectly the background mismatch I mentioned earlier. I know I'm evil, but I found it hilarious.; December 19, 2009 at 8:21 AM
Unknown said...: As someone who's spent far more time on computer program efficiency than on theoretical computer science, that example comes off to me as pretty cool (though I'm still wondering what the "(*)" is doing there, and a bit distressed that a computer scientist can be so theoretical that he or she forgets what "mod" means in an algorithm as opposed to in number theory). It also explains a lot about my publication record: My advisor wrote a defining textbook, renown for the brevity of its proofs and explanations. But his favorite result of mine, something sort and sweet, was rejected by referees. I had to eventually just shoehorn it in to a somewhat related paper. Your point makes the case for accepting short papers as a special case. If it were clear what was being asked for and if the acceptance rate were low enough, getting one of them in could be even more prestigious than having a full paper (Eventually I could envision the following process: Prepare the short result; if that gets rejected as too "uncomplicated," then add to it for the next conference.) But first you have to make room. Sadly, some forums (e.g., IEEE Transactions on Information Theory) have done just the opposite.; December 19, 2009 at 11:47 AM
Anonymous said...: x mod y

(where y is a positive integer), written in LaTeX using \bmod, is the non-negative integer that is less than y and is congruent to x (mod y).

This is a very bad and confusing notation that needs to be avoided. It involves an arbitrary choice of representative from a congruence class. Normally, there is no sensible choice of such a representative (for example, when one "mods" out by a vector subspace in a vector space). Moreover, it conflicts with the standard definition of "modulo" as the equivalence relation induced by quotienting by a sub-object.

In the number theoretic context, or in the case of rings of polynomials in one variable or other Euclidean domains, the notation rem(a,b) -- the remainder of a after dividing by b -- is well defined (upto multiplication by units) and should be used instead.

Unlike, what some other commentators have said, this is not a matter of mathematical pedantry at all. Language is a very (most ?) important aspect of mathematics, and should be respected as such. Choice of the right notation often makes complicated statements look completely obvious -- and the ultimate goal of mathematics is to make the "unobvious look obvious" and a huge part of that is the right choice of notation.; December 19, 2009 at 2:00 PM
Anonymous said...: "This careless attitude towards mathematical language and notation is a persistent problem in theoretical CS literature (probably due to a lack of proper training in mathematical writing). However, unless such levity with mathematical notation is gotten rid of, it is hard to take such papers seriously."

Really?? this for a notation for x mod y? It amazes me how easy you find it to belittle the mathematical maturity/training of an entire area by the somewhat different notation they use for such a simple concept, a concept that everyone understood back in highschool. Admittedly, the notation is different from what is common in number theory, but TCS is not a subfield of number theory -- and hence has its own style and variants of notation.; December 19, 2009 at 6:28 PM
Jonathan Katz said...: I'd rather see discussion of Mikkel's points here; feel free to continue this arcane and inane discussion about x mod y elsewhere.; December 19, 2009 at 6:51 PM
Anonymous said...: but TCS is not a subfield of number theory -- and hence has its own style and variants of notation.

TCS is certainly not a subfield of number theory, but it is a mathematical field. For instance, ECCC the principal repository of papers of TCS, requires that the papers it accepts have ...
" ... clear mathematical profile and
strictly mathematical format."

As a mathematical discipline TCS papers should stick to commonly accepted mathematical formalism -- deviating from these principles usually results in controversies such as Neal Koblitz's well publicized and justified criticism of TCS style cryptography.; December 19, 2009 at 7:20 PM
Anonymous said...: I'd rather see discussion of Mikkel's points here; feel free to continue this arcane and inane discussion

This discussion is not arcane and in fact might in fact be pertinent to the point in the original post.

Clearly, there are two views about TCS. From one point of view, algorithms are like any other mathematical objects, and proving bounds on their complexity is a natural mathematical endeavor (akin to proving uniform bounds in any other area of mathematics). From this point of view, questions of "practicality" is really outside the domain of TCS, best left to the "practitioners". To people who subscribe to this point of view, writing "a mod b" when one means "rem(a,b)" is a horror of horrors.

A second point of view of algorithms (that perhaps the original poster subscribes to) holds practicality as the more important goal. From this point of view mathematical rigor (and its associated baggage of culture and discipline etc.) fades into background, and such basic questions about notation becomes moot.; December 19, 2009 at 7:44 PM
Adam Smith said...: Answering Michael M. (NB: threading comments would be helpful here):

I understand that systems-oriented communities evaluate impact all the time and fairly reliably. (I have seen this in action on various "applied" security PCs and panels.) However, they do it in an application-specific context. And when they referee papers that step a little outside their specialty, they tend to be much less reliable (I am thinking, for example, of papers on privacy at database, datamining and security conferences).

The task of the SODA PC (note: I've also served on one of those) is to evaluate a paper's contribution to our broad algorithmic knowledge base. The notion of impact is much less well-defined, and this is why I think it is likely to be more subjective.

All that said, it would be great to do the experiment and try it at some upcoming SODA.

PS: As for x mod y: I encountered this notation overload in my first abstract algebra class in college. The professor explained briefly the differences between the two widespread uses of the notation, and from then on I always was able to deduce, from context, which meaning of "mod" was intended. Notation overload is common in both math and CS papers. Sometimes it's bad, sometimes it's good. I don't think it has much to do with the relative importance of mathematical depth in different fields.; December 19, 2009 at 7:59 PM
Michael Mitzenmacher said...: Hi Adam. I'm glad you agree it would be a fun thing to try!

I do wish to still register my disagreement with this paragraph:

The task of the SODA PC (note: I've also served on one of those) is to evaluate a paper's contribution to our broad algorithmic knowledge base. The notion of impact is much less well-defined, and this is why I think it is likely to be more subjective.

Sentence 1: I think the type of thing Mikkel is talking about would certainly count as a "contribution to our broad algorithmic knowledge base". I don't think he's talking about programming tricks; he's talking about connecting the theory to implementation. I believe a SODA PC could/should judge these types of papers. Some efforts, though, would have to be made; as I stated in my earlier comment, I'm willing to cede that theorists have, as a community, moved so far from implementation that many theorists would not be suitable to review such papers. [I should point out I think that's a problem, that we as a community should be trying to fix...] At the same time, I feel underqualified when having to, for example, review a quantum computing paper for FOCS/STOC; that doesn't mean such papers shouldn't be submitted or accepted when I am on the PC, or even that I can't review them!

Sentence 2: I don't see why impact is harder to judge than "quality" for other theory papers. I review a lot of papers; I see papers with valid proofs -- as valid (and sometimes moreso) than papers that are in FOCS/STOC -- that I think should be rejected from LATIN. What separates one class of papers from another? My (or, rather, the community's) subjective notion of what's "interesting", "quality" work.

Impact can be demonstrated through actual implementations and deployment (in which case I think such judgments will not be subjective), or potential impact can be gauged by suitable reviewers -- of which there are many available. Yes, there will be some subjective judgment in choosing papers for potential rather than demonstrated impact -- but such subjective judgments are made all the time, even for purely theoretical papers.; December 20, 2009 at 12:12 AM
Dan Spielman said...: I would love to see more examples of textbook algorithms. If you have more in mind, please post them!

As for getting them into SODA, I think we have a bit of a chicken-and-egg problem. Because they don't tend to appear in SODA, it is harder for a reviewer to become familiar with the state of the art. This makes it more difficult to make confident reviews.

It also makes it harder for the next textbook author to become aware of these algorithms!

Perhaps someone should start a wiki or blog about textbook algorithms?; December 20, 2009 at 11:38 AM
rgrig said...: Perhaps there could be a special track for papers on textbook algorithms.; December 20, 2009 at 12:17 PM
Mikkel Thorup said...: My main suggestion was that deriving good text book algorithms is
something that we should be competitive about, giving the best
contributions the same kind of conference credit that we do for other
important algorithmic contributions. They should lead to great talks.

Authors of text books should be good at collecting and presenting the
best publicly known algorithms, but we cannot expect them to come up
with ideas for new more elegant algorithms than those already known.

Currently, we don't really know what to do with a cool simple text book algorithm giving a really useful solution to an important
problem. They may just be discussed locally as folklore, and never
come out in public, or if they do, they may be well-hidden as part of
a solution to a different problem. Without proper published records,
it is hard to claim that one has a new better text book solution.

I don't think evaluation is a principal problem. At STOC/FOCS we
already handle apples and oranges, e.g., algorithms and crypto. Math
has a long tradition for appreciating simpler proofs, and I think we
can all appreciate a really nice text book solution. If the abstract
uses the key-word "text book (style)" then reviewers should stop
looking for complexity and instead judge based on simplicity and
elegance, ease of implementation etc.
Needless to say that we still want
a clear new angle for the problem considered, as in my hashing example where [DHKP97] find an elegant use relative primality.; December 20, 2009 at 1:54 PM
Unknown said...: Notation overload is common in both math and CS papers.

Indeed. Imagine if TCS couldn't deal with the difference between the definitions of "source code" among programmers and information theorists!; December 20, 2009 at 2:24 PM
Anonymous said...: I think we are conflating two different qualities of an algorithm here: is it simple, and is it useful (likely to have a big impact on disciplines outside of theory)? The hash example has both. We can judge simplicity immediately (and at theory conferences it is often seen as a negative), but the best test of usefulness is the test of time.

As a demonstration that it is still possible to get simple algorithms into SODA (I make no claim of usefulness), I'll point to my own paper from this coming conference. Simple algorithm for approximating max independent set: use the leaves of a depth-first search tree. Simple algorithm for approximating TSP when all distances are 1 or 2: use a depth-first ordering on the graph of distance-1 pairs. Although both problems are hard to approximate, one of these two is always a good approximation for any particular graph; the paper contains several similar results. The paper also does contain the requisite "5-10 pages of complications" that, as Mikkel observes, seem to be necessary to get something like this into a good theory conference.

If an algorithm is simple and useful, but not theoretical novel (we already know an impractical solution to universal hashing, so why do we need another more practical one) then it needs more than theory to be published on its own: algorithm experiments showing it to be much better than the alternative, for instance, leading to a paper at ALENEX. This post seems to be arguing that the difficulty of publishing a purely theoretical algorithms paper for this sort of improvement is a systemic problem, but I'm not entirely convinced: if we want to argue that an algorithm is better than prior alternatives despite not giving us any new theoretical results, shouldn't there be some justification for that claim?; December 20, 2009 at 3:05 PM
Anonymous said...: I know it's just loosely related to the main topic of the discussion:

Michael wrote:

"I don't see why impact is harder to judge than "quality" for other theory papers."

In the UK, there is a very interesting discussion about the quality vs the impact (usually, in the sense of economic impact though). See, for example, http://www.csc.liv.ac.uk/~leslie/impact/impact.html

Most of people in the UK I know wouldn't agree with your claim: it's very difficult to assess "the impact".; December 20, 2009 at 8:06 PM
Unknown said...: Dan Spielman wrote:
I would love to see more examples of textbook algorithms. If you have more in mind, please post them!

Here are two examples of papers (of mine) that aren't nearly as useful as that hashing trick but have a little text-book flavor in the naturalness of the problems and simplicity of solutions (in my biased opinion).

Yet another algorithm for dense max cut: go greedy
* simple algorithm
* implementable except that constants are universe-sized if you want theoretical guarentee
* problem natural and interesting to theoreticians but not to practitioners
* main contribution simple idea but enough complications to make it look hard
* previous work got same results more complicated ways
* SODA accept

Finding Strongly Connected Components in Parallel using O(log^2 n) Reachability Queries
* simple algorithm
* very implementable and amenable to heuristic improvements
* problem of minor interest to practitioners
* analysis quite simple (important parts about 1 page), combining existing techniques in new but non-surprising ways.
* The previous work includes some quite famous people who would have scooped me if they had found my result obvious in foresight
* STOC reject, SPAA accept

Dan Spielman wrote:
As for getting them into SODA, I think we have a bit of a chicken-and-egg problem. Because they don't tend to appear in SODA, it is harder for a reviewer to become familiar with the state of the art. This makes it more difficult to make confident reviews.

Another chicken-and-egg problem: with the current status quo the only text-book algorithms that are likely to be submitted to STOC/FOCS/SODA are those that the relevant practitioner communities are not inclined to accept at their own conferences. It's even harder for the reviewers to select the best text-book algorithms if the best aren't even submitted!; December 20, 2009 at 9:02 PM
Mikkel Thorup said...: Three points:

0) I think it would be great if we
for text-book algorithms
had something like the Math Monthly in CS, going out cheaply to a wide audience (IPL is expensive and
commercial). Perhaps a good role for
Comm. ACM.

1) I do not think it is in our interest if the best general text-book algorithms end up in applied conferences. Applied conferences
are even less likely to accept something
simple and elegant. They prefer to have it tied up in a system, and then it will be even harder to discover that this is a general purpose algorithm that goes beyond the limits of the applied conference.

2) I do think theory of CS can be a bit too tied to certain traditional theoretical measures. One of the points
with the hashing scheme [DHKP97] is that
it gives a large constant factor in any reasonably realistic mathematical measure, e.g., taking into account that
w-bit multiplication is mod 2^w (discards overflow). This is not an experimental observation, but something that follows from the definition of standard programming languages like C. I do realize that the theory community does not care too much about details of the running times these days, but the hashing I mention is something of major impact, e.g., in the processing of high volume streams where the info is lost if not handled in time.; December 21, 2009 at 11:36 AM
Anonymous said...: The fastest 2 universal hashing algorithm appeared in stoc couple of years back: it runs in linear time. It is simple and elegant.... Though the ideas are nontrivial and use spielman codes. See the work of Ishai at al; December 22, 2009 at 2:09 AM
Jeremy Gibbons said...: It sounds to me like what the SODA community needs is something analogous to Richard Bird's "Functional Pearls", which favour nifty presentations, and explicitly discourage the "5 to 10 pages of complications". (Declaration of interest: I now edit this column.); December 22, 2009 at 4:59 PM
Mikkel Thorup said...: There has been several comments/questions concerning what is the fastest universal
hashing for strings etc.

Just for the reference, in section 5
of the paper below, I survey what
I think are the fastest methods known on real computers. I do believe that they out-perform most non-universal multiplication-free hacks,
both in speed and in quality, e.g.,
recently I have used them to replace
FNV in applications. Of course, they cannot be as fast as the most naive hacks, e.g., just using the least significant bits.

@INPROCEEDINGS{Tho09,
AUTHOR = {Mikkel Thorup},
TITLE = "String hashing for linear probing",
BOOKTITLE = {Proceedings of the 20th ACM-SIAM Symposium
on Discrete Algorithms (SODA)},
YEAR = {2009},
pages ="655--664",
}; December 23, 2009 at 9:36 AM
Anonymous said...: Instead of taking papers on this theme (which would, incidentally, be a great idea), perhaps the area could serve as the basis for a lighter afternoon entertainment session, providing cool stuff that one could take home and show students.

Let me play the contrarian here.

Just to be sure, I like what you call textbook algorithms. The hashing example is a great one. I think such papers should be accepted to SODA, and we should as a community probably appreciate such works more than we do.

That being said, I disagree with the idea of a separate "track" or afternoon session for this area. Where would the resources come from? Would you accept fewer papers in SODA to make room? Would you add even more tracks?

Book proofs are also hard to publish and we usually think more about them and try to use to them to generalize. Book algorithms combined with experimental comparison with known techniques would likely be accepted by SODA in the current setting. I do not support giving special treatment to one area, likely at the expense of others.

I think we need to make sure the PC has members who are qualified to judge such submissions (most theoreticians aren't). In the meantime, such submissions need to work a little harder and make compelling arguments and give empirical support to their claims of efficiency.; December 23, 2009 at 10:21 PM

My Biased Coin

Friday, December 18, 2009

Text-book Algorithms at SODA (Guest Post, Mikkel Thorup)

32 comments:

My Book on Randomized Algorithms

Blog Archive

Science/Tech Blogs I Read

Contributors

Legal Statement

Analytics