Implementation

The previous version of the August syntax has a lot of problems.

Using string literals for commands (without an escape system btw) is a crime that is unforgivable. The previous parse was handwritten and the output system sucked.

This is a document describing the new model, design and syntax for August.

Let's make some breaking changes.

Model

August introduces the concept of a Unit, removing the distinction between Tasks and Command Definitions. Units sequentially execute commands which are defined internally.

Some of these commands have special properties like being able to modify the Unit context and call other Units.

flowchart LR
    S[Script]
    subgraph Unit
        C1[Command]
        C2[Command]
        C3[Command]
        C1 --> C2 --> C3
    end    

    U1[Unit]
    U2[Unit]

    S --> U1
    S --> U2
    S --> Unit

Syntax

// Comment
expose Build as build

unit Build {
    meta(
        @name "Build Task"
        @description ""
    )

    depends_on(Test)

    do(OtherUnit)

    exec(cargo build --release)
}

depends_on hoists its tasks and can execute them in any order like Task task:[test, build].

flowchart TD
    Build --> Test
    Build --> Format
    Test --> Format
    Build --> Clean

depends_on will only run each unit once per execution. In the above example, Build's could execute its dependencies by completing Format and Clean standalone, then Test without running Format again. It could also complete Test first, which would complete Format and then just do Clean. do can run other units, ignores the once property and replaces command definitions.

Lexing

enum Token {
    String(String),
    Ident(String),
    RawIdent(String),

    Unit,
    Expose,
    As,

    Attr,
    DoubleColon,
    OpenDelim(Delim),
    CloseDelim(Delim),

    Err(char)
}

enum Delim {
    Round,
    Square,
    Arrow,
    Curly
}

Parsing

Describing the entire formal grammar for August would suck, so here is a few interesting ones.

Exec:

exec(cargo "build" --release)

Meta:

meta(
    @key "Value"
)

Commands:

FS::create(str)
FS::remove(str)
FS::move(str, str)
FS::copy(str, str)
FS::print_file(str)
FS::eprint_file(str)

ENV::set_var(str, str)
ENV::remove_var(str)
ENV::path_push(str)
ENV::path_remove(str)

IO::println(str)
IO::print(str)
IO::eprintln(str)
IO::eprint(str)

exec() ~()
meta() @()
depends_on()
do()

Module names are case insensitive.

Threads Runtime

>=0.5 && < 0.7

depends_on runs its tasks in parallel. So let's define how this works. If only one task is specified, it will be run on the current thread. Every other task will be run on a different one.

unit Build {
    depends_on(Test, Clean, Format)
}

unit Test {}
unit Clean {}
unit Format {}

flowchart TB
    subgraph Main Thread
        Build --> Test
    end

    subgraph Thread 2
        Build --> Clean
    end

    subgraph Thread 3
        Build --> Format
    end

With connection dependencies.

unit Build {
    depends_on(Test, Clean, Format)
}

unit Test {
    depends_on(Clean)
}
unit Clean {}
unit Format {}

flowchart TB
    subgraph Main Thread
        Build -->|Block until Clean|Test
    end

    subgraph Thread 2
        Build --> Clean
    end

    subgraph Thread 3
        Build --> Format
    end

Every unit is placed in a map to track whether it is in progress or has been completed or not.

In the second example shows a unit that shares a dependency with its dependent.

flowchart LR
    Build --> Test
    Test --> Clean
    Build --> Clean

As Build is the unit being called, it begins work on both Test and Clean at the same time. Build should not need to concern itself with the dependencies of its dependencies. Test will run any units that aren't in progress or completed, following the same threading rules. After all of these complete, Test will block until Clean is completed.

Unlike the original execution model, this model resolves the dependency graph breadth-first, rather than depth-first. Thanks to the blocking behaviour, we still can assure that they constraints hold.

Due to the nature of this model, circular dependencies are not allowed and will deadlock. This can be solved with a verification step for scripts which may be helpful for larger projects.

Codewise, this model should use scoped threads to ensure all threads have completed (read joined), and all in progress dependencies are resolved. The general algorithm works like so:

flowchart TB
    A[Count incomplete dependencies N]
    subgraph Thread Scope
        B[If N >= 2\nSpawn N-1 threads running other units]
        C[Run one unit on main thread]
        D[Block for initial in progress units]
        E[Wait for other threads to complete]
    end
    F[Done]
    A --> B --> C --> D --> E --> F
    D -->|hint::spin_loop|D

Async Runtime

0.6.0

Previously with August's restructuring in 0.5, units were lanuched on seperate threads for concurrency. This is wildly ineffecient and more importantly, places a cost on tasks as an abstraction.

August now uses an async runtime model based on Tokio and FuturesUnordered (may be replaced with JoinSet). This means that a fixed number of threads are used but still allows for concurrent execution.

Blocking on an in progress unit is also now implemented with futures instead of spin loop. Returns a function on error to prevent needing to store values to create an error in the future.

struct BlockOnDepFuture<'a> {
    uos: &'a AtomicU8,
}

impl Future for BlockOnDepFuture<'_> {
    type Output = Result<(), fn(String, Spanned<String>) -> RuntimeError>;

    fn poll(self: std::pin::Pin<&mut Self>, cx: &mut std::task::Context<'_>) -> Poll<Self::Output> {
        if self.uos.load(Ordering::Acquire) < UOS_COMPLETE {
            cx.waker().wake_by_ref();
            return Poll::Pending;
        }

        if self.uos.load(Ordering::Relaxed) == UOS_FAILED {
            Poll::Ready(Err(|unit_name, dep_name| {
                RuntimeError::FailedDependency(unit_name, dep_name)
            }))
        } else {
            Poll::Ready(Ok(()))
        }
    }
}

The threads runtime will still be avaliable in 0.6 via august run --deprecated-threads-runtime, but will be removed from the codebase 0.7.