The article is originally published on Dev.to.
Table of Contents
What's Big O?
Big O notation and time complexity are fundamental concepts in computer science.
Big O is a way of describing the efficiency of algorithms without getting too mired in the details. It describes how the time (or the number of operations needed) it takes to run grows as the size of the input grows.
- Big O notation helps us answer the question, "How do our functions or algorithms behave/scale when the size of the inputs increases significantly?"
The idea here is that we care about things with a difference in an order of magnitude. For example, given the same amount of inputs, I don't really care if my algorithm runs for 100ms versus 105ms, we care if it runs for 100ms vs 10 seconds (a large, noticeable difference).
When measuring Big O, we just take the important stuff. For example, O(4+2n) can just be simplified to O(n), we can take away the 'minor details' such as the constant + 4 and even the coefficient, which don't make a lot of difference when things are in large scale.
I like to think of Big O as a tool in the back of my mind that helps me grasp the "Big Picture", giving an idea of how efficient the code or algorithms are.
Time complexity
Time complexity is a way of showing how the runtime of a function increases as the size of the input increases. It describes the amount of computer time it takes to run a function.
There are many different types of time complexity and these are some of them.
- Constant time, O(1) - If we are doing things that only require one step or when there are no loops, then the complexity is O(1).
- Linear time, O(n) - Loops such as for loops and while loops, something that causes the runtime to increase at magnitude proportional to the input size. E.g. an array of 100 items results in 100 loops.
- Quadratic time, O(nĀ²) - Two nested loops of the same input. Similarly, if we have three nested loops, then the time complexity is cubic time, O(nĀ³).
- Example algorithms with quadratic time: Bubble sort, Insertion sort
- Logarithmic time, O(log n) - When a divide-and-conquer strategy is used, it's said to be O(log n). In logarithmic time, the increase in time decreases as the input increases.
- Example algorithms with logarithmic time: Binary search
- Factorial time, O(n!) - It's the most expensive one. We are adding a nested loop for every elements.
There are some basic rules to remember when considering the Big O for an algorithm or code.
The Rule Book of Big O
- Worst Case
- Remove Constants
- Different Terms for Different Inputs
- Drop Non-Dominant Terms
Rule 1: Worst Case
Always consider the worst-case scenario. Even if the loop breaks earlier, it does not matter, we always take the Big O in the worst-case scenario. We can't just assume that things are always going well, even though sometimes our function can just run for an O(1). As shown in the example below, sometimes the item we want is located at the index of 0, and we finish off early, but it's still considered as O(n).
Rule 2: Remove Constants
In this example, we are creating an input with a length we've defined (10), and pass it to the function. Inside the function, we create an array called meaningLessArr
with a length based on the input argument. We have two console.log and a loop to loop for two times the length of the input.
Variable assignment of meaningLessArr
is ignored in this example but it doesn't matter much because, in the end, our goal is to remove the constants.
- O(3n + 2) is simplified to O(3n + 1). This is because O(any constant) is simplified to O(1). O(2) is simplified to O(1), O(100) ā O(1), O(3333) ā O(1), and so on.
- O(3n + 1) is then simplified to O(n + 1) by removing the coefficient. The key here is that, whether it is 3n, or 4n, or 5n, they are all linear, we can simplify them to just n. We do not particularly care about how steep the line is, we care about how it increases, is it increasing linearly, exponentially, or what.
- And finally, it is simplified to O(n) after dropping the constant 1, as 1 does not have an effect when the input is large.
Rule 3: Different Terms for Different Inputs
When we have multiple inputs or multiple arguments, we give a unique term for each of them, as they are separate inputs with different sizes. In other words, the complexity depends on two independent factors. In the example below, n and m represent the sizes of two different inputs.
Let's look at another example with nested loops. We have two similar functions that do similar things. The difference is that the makeTuples()
takes one argument while makeTuplesTwo()
takes two arguments. Thus, we can say that makeTuples()
depends on one independent factor while makeTuplesTwo()
depends on two independent factors.
Let's do a quick exercise! What's the Big O for the function below?
The answer is O(n + nm)! Even better, we can say it is O(nm). This is because we can simplify things here. By expressing O(n + nm) as O(n(1+m)), we can now see the 1+m. 1+m can be simplified to just m. Therefore, after the simplification, we get O(nm).
Here are some great threads to dive deep about O(m+n) and O(nm):
Precise definition of Big O:
Rule 4: Drop Non-Dominant Terms
Actually, if you understand the concept of simplification like simplifying O(n+nm) to become O(nm) in the exercise above, then you probably already understand this rule. It's basically the same idea.
Again, if we have something like , it can be simplified to by dropping the + n.
Or we can imagine when n is large, then the + n probably does not give a lot of effects. In this case, nĀ² is the dominant term, the big and important term, while + n is not. We ignore the little parts and focus on the big parts.
For equation , let's try plugging in some numbers.
- Plug in 3, we get 18 + 3 + 30.
- Plug in 10, we get 200 + 10 + 30.
- Plug in 500, we get 500000 + 500 + 30.
- Plug in 100000, we get 20,000,000,000 + 100000 + 30.
The Big O for this mathematic equation would be . Not only we can remove the constant and coefficient by applying the rule we learned before, we can also drop the + x as this term is not the 'big' one.
Essentially, is the one that contributes to the huge gap so we take it as the Big O.
Summary
- Big O does not matter a lot when inputs are not sufficiently large. If a function is written to only accept a fixed small amount of data, then we don't particularly care about the time & space complexity in this case. Also in some scenarios, for example, O(n) might be more efficient than O(1) depending on the inputs.
- Everything comes at a cost. Sometimes writing efficient code results in code that is hard to read, and vice versa. The goal is to strike a balance between code efficiency and readability, depending on problems and situations.
Thanks to all who read this post.