We’re Reaping Just What We Sowed

Theoretical Computer Science is a glass slipper that has never quite fit the foot of practical Computer Engineering. The essence of their marriage is that theoreticians devise a formal model which captures some essential features of reality, and then apply reasoning about it to implementations. The problems occur when the models do not capture enough of the structure of real systems to explain and predict their behavior.

Some of the earliest issues in modeling computation arise in dealing with finiteness. The problem with finite computational models is that they are all equivalent in expressive power and also quite limited. This creates two issues: the inability to solve irregular problems on inputs of unbounded size (see the Pumping Lemma), and the possibility of highly specific effects that arise due to the precise numerical bound on size (as is found in finite group theory).

The idea of computing using an abstract machine model (for example, push-down automata and Turing machines) that can grow during the execution of an algorithm leads to a theory of computation that is quite rich. The implications of such models can apply to real-world computers, as long as resource utilization does not exceed their physical limitations. Even when those bounds are reached, there is still the question of what could in future be computed on machines of ever-greater size and speed. However, when even futuristic physical limitations and issues like power consumption are addressed, the correspondence between the infinitary models and reality starts to fray.

A widely understood example of this divergence can be found in the application of the theory of algorithmic complexity to sorting. The classical analysis of sorting yields the well-known result that, in the worst case, the number of comparison or data movement operations required to sort a list of N integers is bounded by some constant multiple of N*log(N). However, the most commonly used algorithm, Quicksort, requires quadratic resources in the worst case (meaning some multiple of N²). A multiple of N² is greater than any multiple of N*log(N) for large-enough N, so in the unbounded theory of complexity, Quicksort is considered “less efficient” than any number of algorithms, such as Insertion Sort, that can be implemented using at most a multiple of N*log(N) resources.

There are many reasons for this mismatch between theory and practice, including:

locality

Bubble Sort

In spite of the complexity of this reality, the abstract theory of sorting is still taught because of its simplicity and power. It is still common to teach students that algorithms with high resource utilization on unbounded inputs should be avoided, or may be impossible to use in practice. We then rely on their unlearning these early lessons either in practical data management classes or on the job. The truth is considered too complex and does not lend itself to abstract proofs, a skill we are eager to teach.

I came across a more current example of such a mismatch when teaching an undergraduate Computer Networking course. One of the fundamental lessons of Network Architecture is that packet networks are not reliable: packets sent may be lost, may be corrupted in transit, may be delayed without bound, and may be misdelivered. There is also a theorem of distributed systems which tells us it is impossible to use an unreliable network to implement one that is completely reliable. But Computer Networking textbooks universally describe the Internet’s Transport Control Protocol (TCP) as “reliable.”

The truth is complex. TCP implements a different error model than the Internet Protocol, using a number of tools including checksums, sequence numbering, and retransmission to reduce the probability of data corruption or misdelivery. TCP also introduces new error modes (such as a “broken connection”) which do not exist in the Internet’s underlying packet delivery model. Together, these are thought to reduce the probability of error to the point that most common applications are sufficiently stable, and this is considered a usable definition of “reliable.”

Application developers regularly use TCP as if it were completely reliable, although its 16-bit checksum makes “silent data corruption” highly probable in sufficiently large data transfers. Developers working in application areas such as scientific computing, where such errors are considered unacceptable, understand the issue and use longer end-to-end file hashes and retransmission to overcome errors. However, in fields such as multimedia streaming where occasional errors are thought to be acceptable, it is common to simply consider TCP to be “reliable enough.” The same problems haunt storage and computation at large scale, and are only addressed when the mismatch causes unacceptable application failures. It is worth noting that, because of the highly “discontinuous” nature of programmable systems, unanticipated errors can be the cause of massive security failures or seemingly unrelated aberrant system behavior.

Today we are faced with Generative AI, a new subfield of computer engineering that offers us the use of trained behaviors known to be error-prone. Generative AI may be considered “reliable enough” for some applications, at least initially. The relationship between Generative AI and the real world is unknown, because of the use of statistical and randomized methods. The application of more reliable guardrail requires the use of classical AI tools based on formal logic. These are not keeping up with the prolific ability of Generative AI to copy and guess.

The defense of Generative AI that it is “good enough” to be used in a world already filled with human and machine error echoes the universal response of my networking students that my objection to calling TCP “reliable” is pedantic and old-fashioned. They say the word “reliable” has simply been given a new meaning. Or that “everyone knows” that the “reliable” mechanism must be checked (although this fact is not taught, and is not widely understood). Mainly, they want to keep the subject matter simple and understandable so they can learn to program using an intuitive API, get a grade, and a job. And the writers of textbooks and the developers of modern university curricula seem to agree with them.

So please, everyone who reacts in horror to the acceptance of “hallucinations” and other errors in generative AI, stop clutching your pearls. We have developed a field (computer science) based on the cynical acceptance of a mismatch between theory and practice. We have done this in order to validate our own education and the history of the field to date.

It is important that we teach practical computer engineering as a field separate from formal computer science. The latter can help in the understanding and analysis of the former, but may never model it well enough to be predictive in the way the physical sciences are.

Micah D. Beck (mbeck@utk.edu) is an associate professor at the Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, USA.

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签