A blog about software and making.

The Algorithm Design Manual

Two books in one!

  • A guide on how to approach solving problems.
  • An Encyclopedia of algorithms and data structures.

If you want a sense of ‘what’s out there’ in the world of algorithms and data structures then this is the book for you. It doesn’t go deep into any one topic but it does give you a sense of what problems a given algorithm might solve and some of the trade-offs to consider when applying it.

Have a few minutes to burn? Just go to a random page and I guarantee you will learn something new.

Goodreads

Algorithms in a Nutshell Review

For a desktop reference book, this is a surprisingly fun read. It gives a broad overview of different algorithms focusing on when you would want to use them and any tradeoffs compared to other solutions. The book’s small size is convenient if you want something to read on the plane. It also has one of the best chapters on network flow problems I’ve come across.

Goodreads

Chapter Quotes & Nodes

  • Preface
  1. Algorithms Matter
    • Start with the big picture: understand the problem, identify potential causes, and then dig into the details. If you decide to try to solve the problem because you think you know the cause, you may solve the wrong problem.
    • Various algorithms have “sweet spots” in which their performance has no equal and designers can take advantage of specific information about the problem to improve performance.
  2. The Mathematics of Algorithms
    • Logarithmic algorithms are extremely efficient because they rapidly converge on a solution. In general, these algorithms succeed because they reduce the size of the problem by about half each time.
    • Not every operation can be optimized; in fact, optimizing one operation may degrade the execution of another operation.
  3. Patterns and Domains
    • Patterns are a great way to communicate precisely and concisely well-formed concepts.
    • A pattern is not a template where you simply fill in the blanks (except the template method ☺). It is an approach, or a plan, for solving a particular class of problems.
    • A test suite is composed of a set of k individual trials (typically k≥10). The best and worse performers are discarded as outliers, the remaining k-2 trials are aggregated, and the average and standard deviations are computed.
    • Domains are application areas that share common traits. Each domain has its own vocabulary that provides a language to describe the domain.
    • a floating-point number is a finite representation that is designed to approximate a real number whose representation may be infinite.
    • The most common way of describing floating-point error is to use the term relative error, which computes the ratio of the absolute error (desired - approximation) with the desired value.
    • The stack grows “downward” and the heap grows upward … if the stack grows too large, a program crashes because the memory for the individual stack frames will overwrite memory that should be safely protected in the heap.
    • Dynamic types languages are often interpreted and variable values are known only at runtime, and therefore cannot be checked statically.
  4. Sorting Algorithms
    • Pointer-based storage - A contiguous array of information contains pointers to the actual information rather than storing the information itself.
    • Value-based storage - Packs a collection of n elements into record blocks of a fixed size.
    • Information is written to secondary storage usually a value-based contiguous collection of bytes.
    • total ordering - For any two elements p and q in a collection, exactly one of the following three predicates is true: p=q, pq.
    • No algorithm that sorts by comparing elements can do better than O(n log n) performance in the average or worst case.
    • Insertion sort
      • Use when you have a small number of elements to sort or the elements in the initial collection are already “nearly sorted”.
      • Inefficient for value-based data because of the amount of memory that must be shifted to make room for a new value.
    • Quick Sort
      • Use when you want a good average-case result*
      • Further computation to identify the proper pivot (beyond median-of-3/5) is rarely able to provide beneficial results because of the incurred costs.
    • Counting Sort
      • Can only be used in limited situations because of the constraints that the elements in the array being sorted are drawn from a limited set of k elements.
    • Bucket Sort
      • Use when the elements are drawn from a dense universe.
      • The fastest sort when the elements to be sorted can be uniformly partitioned using a fast hashing function.
      • Reduces its processing costs at the expense of extra space.
      • Data must be uniformly distributed, so it’s evenly partitioned between buckets and an ordered hash function must be used so all elements in bucket bi are lexicographically smaller than all elements in bucket bi+1.
      • Once all the elements are sorted into buckets the values can be extracted in sorted order using insertion sort on each bucket.
  5. Searching
    • Sequential Search
      • If the predominant result of a sequential search is false, you may want to consider a different search algorithm, even when the collection is relatively small.
      • Move element up/to the front on a successful sequential search to exploit an increased likelihood that the item will be searched for again. Like Most-Recently-Used (MRU).
    • Binary Search
      • Using secondary storage, the time required to search for an element is dominated by the costs to access the storage.
      • Can incrementally search for a word as it is being typed.
    • Hash-based Search
      • Use a hash function to transform one or more characteristics of the searched-for item into a value that is used to index into an indexed hash table.
      • A poorly designed hash function will leave many of the slots in hash table empty (wasting space) and there will be many collisions where keys map to the same slot (bad performance).
      • Java’s .hashCode() tries to be efficient and caches the value of the computed hash to avoid recomputation.
      • Perfect hash function guarantees no collisions for a specific set of keys.
      • The size of A is typically chosen to be a prime number but we can use any number when we are not using open addressing.
      • As the load factor goes down, the average length of each slot’s linked list also goes down improving performance.
      • Open addressing reduces storage overhead, such as pointers to the next element in a list of collisions.
      • Perfect hashing uses a standard hash function to index into a primary table, A. Each slot, A[i], points to a smaller secondary hash table, Si, that has an associated hash function hi. The selection of appropriate hash functions guarantees that there are no collisions in the secondary tables giving O(1) performance.
    • Binary Tree Search
      • Prefer over arrays when the underlying data in the search set changes (lots of inserts/deletes) frequently.
      • Prefer over hash-based solution when the set size is unknown, the set is highly dynamic (lots of inserts/deletes), and you want to traverse the data in ascending/descending order.
      • B-tree variation that minimizes the number of disk accesses.
  6. Graph Algorithms
    • Adjacency matrices
      • Fixed upper limit beyond which no matrix can be constructed with the available memory.
      • Unsuitable when there are multiple relationships between a pair of elements. We end up storing a list at each matrix element which is nearly an adjacently list.
    • Adjacently list
      • Vertices could be stored in sorted order to enable rapid failure when searching if an edge exists.
    • Breadth First Search
      • Guaranteed to find the shortest path
      • Stores “to be visited” vertices so there may be a non-trivial amount of storage space required for very large graphs.
    • Dijkstra’s algorithm - Single source shortest path
      • Runs forever if a negative edge weight exists.
      • Uses priority queue to keep track of the vertice with the shortest distance for the current path.
    • The most reliable path is the path with the greatest value when each step on the path’s probability of success is multiplied together. Convert to the log probability of success so they can be added instead.
    • Bellman-Ford - Single source shortest path
      • Avoid using for dense graphs or it degenerates to O(V3)
      • Works with negative edge weights as long as no negative cycle exists.
      • Detect negative edge weight by executing primary loop one extra time and looking for changes in values.
    • Floyd-Warshall - All pairs shortest paths
      • To detect negative edge weights scan the diagonal (i==j) at the end and any -ve values indicate that a -ve cycle exists.
      • Only need to keep track of matrix of subproblems because we are concerned with the total distance, not the path that involves the fewest number of vertices.
    • Prim’s Algorithm - Minimum spanning tree
      • Intuition is that the edge(u,v) with the lowest weight between the current minimum subgraph and the remaining elements must belong to the minimum spanning tree.
  7. Path Finding in AI
    • Construct a Game tree to find a path from the current game state to some future game state that ensures victory or a draw.
    • Search tree can be greatly reduced by detecting and eliminating identical states that may simply be rotated or reflected.
    • A* Search
      • Variation of Dijkstra’s algorithm that directs search with heuristic information when approximate answers are acceptable.
  8. Network Flow Algorithms
    • You could apply linear programming to network flow problems but specialized algorithms with outperform it by several orders of magnitude.
  9. Computational Geometry
  10. When All Else Fails
  11. Epilogue
    • Know your data - without specific knowledge of your data, it is only possible to recommend algorithms in the most general way.
    • Decompose the problem into smaller problems
    • Choose the right data structure
    • Add storage to increase performance - Many algorithms are optimizied by storing information that reflects the results of past computations.
    • If not solution is evident, construct a search - Convert the problem into a search over a very large graph.
    • If no solution is evident, reduce your problem to another problem that has a solution
    • Writing algorithms is hard – testing algorithms is harder

Odds & Ends - June 2015

Thoughts, terms, and ideas I’ve come across over the last few months.

  • Attention is the currency of our age
  • Innovation is found at the boundaries between disciplines, not by narrowly focusing in one sphere.
  • Many developers derive pleasure from removing bugs that they should not have created in the first place. Instead, structure code so it correct the first time and make it as easy to debug as possible.
  • Losing 20% of your data is worse than losing 100% because you don’t know what you have lost.
  • Aside from making reasonable algorithm choices, cache misses are the main thing you need to worry about for performance. Compared to cache misses, minor efficiency improvements just don’t matter.
  • If you are waiting for the network it probably doesn’t matter how slow your network is.
  • Idiomatic code: what the language wants you to do.
  • If a language puts all your allocations on the heap it’s likely causing cache misses since you can’t decide how objects are organized in the heap.
  • Each reference is 8 bytes on a 64-bit machine.
  • Tell don’t ask - Rather than asking an object for data and acting on it, just tell the object what to do. Let the object make its own decisions based on its own data.
  • Interfaces break dependencies.
    • Runtime dependency - Flow of control leaves one module and enters another.
    • Source code dependency - A name declared in one module appears in another module.
    • If ModuleA.MethodA() calls ModuleB.MethodB() then ModuleA needs ModuleB to compile (source + runtime).
    • If ModuleA.MethodA() calls InterfaceB.MethodB() then ModuleB need not exist to compile A (source only).
  • Acceptable Response Times
    • Loading - 1000mn - A splash screen helps
    • Respond to finger down - 100ms
      • Users notice less than 100-150ms
      • Remember that we may schedule a response but the engine may not get to it right away!
      • Try for 100ms as it will be increased by scheduler delays.
    • Animation chunks - 6ms
      • 60Hz so all work must take ~16ms (4ms for OS so 6ms left for work).
      • Work must be schedulable in the 6ms available!
    • Idle/Cleanup - 50ms
  • Trees
    • Every path is a tree.
    • Trees are allowed to branch.
    • A tree is connected and n vertices have n-1 edges.
    • Trees have unique chains of edges joining two vertices
    • [A-B-C-D] Tree
    • [A-B] [C-D] A forest containing two trees
    • [A]→[B] Not a tree because there are 2 paths from A to D.
      ↓ ↓
      [C]→[D]
  • A chair is a resource. A teammate is a human being.
  • Projections - A piece of code that takes a series of events and produces a transient state from them.
  • React - state flows down via properties and events flow up via callbacks.
  • You only get one shot at encoding facts into a type system, and once you do you are stuck with them, so don’t try to encode too much, and make sure what you encode is fundamental (won’t change) about your business domain.
  • ECMAScript Debugging tips: console.Table([1,2,3], [2,3,4]); object.observe(foo); console.trace(foo);
  • Algorithm Correctness - Does it do what it is supposed to do?
  • Algorithm Efficiency - Does it have a run-time that is polynomially bounded?
  • Types of Equality - Reference equality, Identity Equality (same pk in DB), Logical Equality (value objects with identity fields)
  • A hash is a fingerprint for data. It takes data of any length and gives you a small fixed sized identifier that you can use to compare or identify the data.

Algorithms Design and Analysis - Part 2

  • Tim Roughgarden does an amazing job explaining how different algorithms work and the problems they are trying to solve.
  • Great graph section covering different minimum spanning tree and shortest path algorithms.
  • The best explanation of dynamic programming I’ve seen so far.
  • The catchphrase ‘Can we do better?’ sticks in your head long after the course is over.

Certificate
Course Link