Smooth Infinitesimal Analysis posits an infinitesimal such that the following statements are both true:

This is a very promising idea, but the way they pose it leans on ignoring the law of excluded middle, which is a little unsettling. I propose a different interpretation of the same definitions. In geometric algebra, squaring erases direction, as seen in how the basis vectors square:

In light of this, I think there should be a different infinitesimal oriented along each basis vector, such that:

Now, one can see clearly why the infinitesimal is not zero. It's because of the direction, not the magnitude. Squaring erases the direction, giving us the pristine scalar 0. A result of this is all higher power are also 0, because they all contain a square that eats the rest:

This property will prove most useful later.

Addendum: In the rest of this piece, even though geometric algebra literature typically uses numerical subindices, we will use the subindices x and y to indicate orientation along the x and y axes, respectively.

Directional Mathematics

The math we are all most used to is the mathematics of scalars; quantities with magnitudes but no direction. Calculus, understood properly, requires us to consider the mathematics of direction without magnitude. We can construct:

As a statement about directions, this tells us about the directional sum of the x and y axes. We can also subtract them like so:

In fact, any linear combination is expressible:

However, do not be fooled. Since this is only the mathematics of direction, equivalent ratios of a and b represent the same direction.

As such, it is generally more useful to express directions relative to a reference direction. Let's choose the x axis.

When we isolate x/x, the direction cancels out, leaving:

This the form we will expect of our derivatives going forward.

We can recover ε_y by composing the relative direction with the reference direction like so:

This will prove useful later.


Derivatives

The goal of a derivative is to find the direction of motion at a specific point. We represent this as:

To see how this construction makes derivatives easy, let's try taking a simple derivative with this construction.

By inserting ε_x, we can probe our function, and find our derivative by isolating ε_x on the other side.

We can delete the squared infinitesimal because it squares to zero. Defining the infinitesimal along f and subtracting f(x) from both sides:

And the final trick:

There we have it! The derivative, achieved with only our oriented zero. Note that this is not scalar division. This is a linear ratio of directions, expressing direction as a function of x. Just what we were looking for!

Another one

Let's try a cubic.

This time, two terms are deleted for being zero.

Power Rule

Indeed, this will work for any power of x.

Lucky for us, the remaining terms of the binomial expansion delete themselves because they all have higher powers of ε_x, no matter how big n is. Then:

This is our power rule at full generality.

Sum Rule

The sum of two arbitrary functions, coming right up.

Assuming g and h are differentiable:

Simple, right?

Product Rule

This one is a little more interesting.

Here is the fun part. We can rearrange the following core identity for a shortcut:

Substituting in our product function:

Via binomial expansion:

The two infinitesimals multiply to zero via direction erasure. Subtracting the original product:

Eh voila!

Chain Rule

Last but not least, the one for compositions!

We have to be careful with this one. The infinitesimal only enters h directly:

Using our nifty shortcut again:

We now have an interesting problem. How to we handle ε_h inside of g? Well, let's pretend h is not a function for a moment. Our shortcut applies again!

Of course, h is still a function, and we have to add the dependence on x back in. The infinitesimal on g is with respect to h, not x, so we make the ε_h explicit:

Subtracting the original function as usual:

Finally,

We arrive at our derivative, which all together says a change in x causes a change in h causes a change in g. This works multiplicatively because the local change is linear.


Coming Soon

That's about it for the foundations! The next installments to look forward to will cover integration, Taylor series, and deriving the trigonometric functions via Euler's formula as a definition.