Home
Chapters

Generic expressions are represented by <something> (e.g., <function> or <operator>). This is just notation, and the symbols < and > should not be misconstrued as Julia's syntax.

Action	Keyboard Shortcut
Previous Section	`Ctrl + 🠘`
Next Section	`Ctrl + 🠚`
List of Sections	`Ctrl + z`
List of Subsections	`Ctrl + x`
Close Any Popped Up Window (like this one)	`Esc`
Open All Codes and Outputs in a Post	`Alt + 🠛`
Close All Codes and Outputs in a Post	`Alt + 🠙`

When benchmarking, the equivalence of time measures is as follows.

Unit	Acronym	Measure in Seconds
Seconds	s	1
Milliseconds	ms	10^-3
Microseconds	μs	10^-6
Nanoseconds	ns	10^-9

Links
Dark Mode (Experimental)

Personal

Website

9c. Pre-Allocations

Martin Alfaro

PhD in Economics

PART II: HIGH PERFORMANCE
7. Introduction to Performance

a. Overview and Goals

b. When To Optimize Code?

c. Benchmarking Execution Time

d. Preliminaries on Types

e. Functions: Type Inference and Multiple Dispatch

f. Preliminaries on Memory Allocations

8. Type Stability

a. Overview and Goals

b. Defining Type Stability

c. Type Stability with Scalars and Vectors

d. Type Stability with Global Variables

e. Barrier Functions

f. Type Stability with Tuples

g. Type Stability with Higher-Order Functions

h. Gotchas for Type Stability

9. Reducing Memory Allocations

a. Objects Allocating Memory

b. Slice Views to Reduce Allocations

c. Pre-Allocations

d. Reductions

e. Tuples and Static Vectors for Small Collections

f. Lazy Operations

g. Lazy Broadcasting and Loop Fusion

Introduction

When working with repeated vector computations, such as those that arising in for-loops, memory allocation can become a major performance bottleneck. This is because memory allocation is a costly operation that involves searching for free memory, tracking memory information, and freeing unused memory (a process known as garbage collection). While the cost of an allocation in isolation may not be substantial, incurring it repeatedly can significantly impact performance.

To address this issue, we can employ a strategy known as pre-allocation. This involves reserving a block of memory, which is then reused multiple times during the execution of a program. The approach is particularly relevant for intermediate results that don't need to be preserved across iterations or stored for future use.

In practical terms, it requires initializing vectors before running the for-loop, which are then mutated in each iteration. By doing so, we avoid the overhead of creating new objects multiple times.

The performance gains from pre-allocation can be substantial. In fact, this technique is a common optimization strategy applicable to multiple programming languages beyond Julia. Ultimately, its effectiveness relies on minimizing the use of the heap.

The section begins by reviewing methods to initialize vectors, which constitutes a prerequisite for pre-allocation. We then present two scenarios where pre-allocation proves advantageous, including one that highlights its benefits for nested for-loops.

Initializing Vectors

Vector initialization refers to the creation of a vector that will be subsequently filled with values. The process involves two steps: reserving the space in memory and populating it with some initial values. An efficient way to initialize a vector is by only performing the first step, keeping whatever content is held in the memory address at that moment. Values with such a feature are referred in Julia as undef.

There are two methods for initializing a vector with undef values. The first requires specifying the type and length of the array, and its syntax resembles the creation of new vectors. The second is based on the function similar(y), which creates a vector with the same type and dimensions as another existing vector y.

Below, we compare the performance of approaches to initializing a vector. In particular, we show that working with undef values is faster than setting specific values. To starkly show these differences, we create a vector with 100 elements and repeat the procedure 100,000 times.

x           = collect(1:100)
repetitions = 100_000                       # repetitions in a for-loop

function foo(x, repetitions)
    for _ in 1:repetitions
        Vector{Int64}(undef, length(x))
    end
end

Output in REPL

julia>