Home
Chapters

Generic expressions are represented by <something> (e.g., <function> or <operator>). This is just notation, and the symbols < and > should not be misconstrued as Julia's syntax.

Action	Keyboard Shortcut
Previous Section	`Ctrl + 🠘`
Next Section	`Ctrl + 🠚`
List of Sections	`Ctrl + z`
List of Subsections	`Ctrl + x`
Close Any Popped Up Window (like this one)	`Esc`
Open All Codes and Outputs in a Post	`Alt + 🠛`
Close All Codes and Outputs in a Post	`Alt + 🠙`

When benchmarking, the equivalence of time measures is as follows.

Unit	Acronym	Measure in Seconds
Seconds	s	1
Milliseconds	ms	10^-3
Microseconds	μs	10^-6
Nanoseconds	ns	10^-9

Links
Dark Mode

Personal

Website

8g. Type Stability with Higher-Order Functions

Martin Alfaro

PhD in Economics

PART II: HIGH PERFORMANCE
7. Introduction to Performance

a. Overview and Goals

b. When To Optimize Code?

c. Benchmarking Execution Time

d. Preliminaries on Types

e. Functions: Type Inference and Multiple Dispatch

8. Type Stability

a. Overview and Goals

b. Defining Type Stability

c. Type Stability with Scalars and Vectors

d. Type Stability with Global Variables

e. Barrier Functions

f. Type Stability with Tuples

g. Type Stability with Higher-Order Functions

h. Gotchas for Type Stability

9. Reducing Memory Allocations

a. Overview and Goals

b. Stack vs Heap

c. Objects Allocating Memory

d. Slice Views to Reduce Allocations

e. Pre-Allocations

f. Reductions

g. Static Vectors for Small Collections

h. Lazy Operations

i. Lazy Broadcasting and Loop Fusion
10. Vectorization (SIMD)

a. Overview and Goals

b. Macros as a Means for Optimizations

c. Introduction to SIMD

d. SIMD: Independence of Iterations

e. SIMD: Unit Strides

f. SIMD: Branchless Code

g. Packages For SIMD

11. Multithreading

a. Overview and Goals

b. Introduction to Multithreading

c. Task-Based Parallelism: @spawn

d. Thread-Safe Operations

e. Parallel For-Loops: @threads

f. Applying Parallelization

g. Packages for Multithreading

Introduction

Functions in Julia are first-class objects, a concept also referred to as first-class citizens. This means that functions can be treated like any other variable, thus allowing for vectors of functions, functions returning other functions, and more.

In particular, the property makes it possible to define higher-order functions, which are functions that take another function as an argument. We've already encountered several examples of higher-order functions, often in the form of anonymous functions passed as arguments. A classic example of a higher-order function is map(<function>, <collection>), given that <function> is one of its arguments. Throughout the explanations, we follow common terminology and refer the function passed as an argument as the callback function.

In this section, we'll examine conditions under which higher-order functions are type-stable. As we'll see, these functions present some challenges for specializing their computation method.

Functions as Arguments: The Issue

For exploring type stability, a distinctive feature of functions is that each function defines its own unique concrete type. In turn, this concrete type is a subtype of an abstract type called Function, which encompasses all possible functions defined in Julia. This type system creates challenges when specializing the computation method of higher-order functions, as it can potentially lead to a combinatorial explosion of methods, with a unique method generated for each callback function.

To address this issue, Julia takes a conservative approach, often choosing not to specialize the methods of high-order functions. The performance in such cases can be severely degraded, as the execution runtime would become similar to performing operations in the global scope.

Taking this into account, it's important to pinpoint the scenarios where specialization is inhibited and monitor its consequences. If it occurs that performance is severely impaired, there are still ways to enforce specialization. In the following section, we explore several techniques for doing so.

Warning!

Exercise caution when inducing specialization. Overly aggressive specialization can degrade performance severely, explaining why Julia's default approach is deliberately conservative. In particular, you should avoid specialization when your script repeatedly calls a high-order function with many unique functions. [note] For discussions about the issue of excessive specialization, see here and here.

An Example of No Specialization

To illustrate when higher-order functions don't specialize, let's consider a specific scenario where we sum the transformed elements of a vector x. The only requirement we impose is that the transforming function should be generic, allowing us to possibly apply different functions for the transformation.

Below, we implement this operation through a higher-order function foo. This first uses map to transform x through some function f, and then applies the function sum to add the transformed elements. For demonstrating how foo works, we also call it with the function abs as its specific transformed function.

x = rand(100)

function foo(f, x)
    y = map(f, x)
    
    sum(y)    
end

Output in REPL

julia>

@code_warntype foo(abs,x)

Although we can show that foo(abs,x) isn't specialized, @code_warntype fails to detect any type-stability issues. This is a consequence of @code_warntype evaluating type stability under the assumption that specialization is attempted. In our example, this assumption doesn't hold and therefore @code_warntype is of no use.

The specific reason for this behavior is that Julia avoids specialization when a callback function isn't explicitly called within the function. In the example, the function f only enters foo as an argument of map, but there's no explicit line calling f.

To obtain indirect evidence regarding the lack of specialization, we can compare the runtimes of the original foo function with a version that explicitly calls f.

x = rand(100)

function foo(f, x)
    y = map(f, x)
    

    sum(y)    
end

Output in REPL

julia>

foo(abs, x)

48.447

julia>

@btime foo(abs, $x)

  195.579 ns (3 allocations: 928 bytes)

x = rand(100)

function foo(f, x)
    y = map(f, x)
    f(1)                # irrelevant computation to force specialization

    sum(y)
end

Output in REPL

julia>

foo(abs, x)

48.447

julia>

@btime foo(abs, $x)

  45.745 ns (1 allocation: 896 bytes)

The comparison lays bare a significant reduction in time when f(1) is added. Furthermore, there's also a notable decrease in memory allocations. As we'll demonstrate when exploring the subject, excessive allocations often serve as a telltale sign of type instability.

Forcing Specialization

Explicitly calling the function to circumvent the no-specialization issue isn't optimal, since it introduces an unnecessary computation. Fortunately, alternative solutions exist to address the problem. One approach is to type-annotate f, which provides Julia with a hint to specialize. Another solution is to wrap the function in a tuple and then call it. This ensures the identification of the function's type, as tuples identify a concrete type for each element.

Below, we outline both approaches.

x = rand(100)


function foo(f::F, x) where F
    y = map(f, x)
    

    sum(y)
end

Output in REPL

julia>

foo(abs, x)

48.447

julia>

@btime foo(abs, $x)

  46.686 ns (1 allocation: 896 bytes)

x = rand(100)
f_tup = (abs,)

function foo(f_tup, x)
    y = map(f_tup[1], x)
    

    sum(y)
end

Output in REPL

julia>

foo(f_tup, x)

48.447

julia>

@btime foo($f_tup, $x)

  45.101 ns (1 allocation: 896 bytes)

NOTATION

PAGE LAYOUT

LINKS TO SECTIONS

KEYBOARD SHORTCUTS

TIME MEASUREMENT

Dark Mode

PART II: HIGH PERFORMANCE

Introduction

Functions as Arguments: The Issue

An Example of No Specialization

Forcing Specialization