alisp.org: Remove notes and overview

Not needed with what we're doing currently. I'll make proper documentation when I'm done.
alisp.org: make a backlog for tasks
2026-02-05 04:35:54 +00:00 · 2026-02-05 04:35:47 +00:00 · 2026-02-05 04:34:29 +00:00
2 changed files with 72 additions and 77 deletions
--- a/.dir-locals.el
+++ b/.dir-locals.el
@@ -1,6 +1,6 @@
 ;;; Directory Local Variables            -*- no-byte-compile: t -*-
 ;;; For more information see (info "(emacs) Directory Variables")

-((nil . ((compile-command . "make MODE=debug")
+((nil . ((compile-command . "make MODE=debug test examples")
         (+license/license-choice . "GNU General Public License Version 2")))
 (c-mode . ((mode . clang-format))))
--- a/alisp.org
+++ b/alisp.org
@@ -3,48 +3,7 @@
 #+date: 2025-08-20
 #+filetags: :alisp:

-* Notes
-** Overview
-~alisp.h~ is a single header for the entire runtime. We'll also have a
-compiled shared library ~alisp.so~ which one may link against to get
-implementation.  That's all that's necessary for one to write C code
-that targets our Lisp machine.
-
-We'll have a separate header + library for the compiler since that's
-not strictly necessary for transpiled C code to consume.  This will
-transpile Lisp code into C, which uses the aforementioned ~alisp~
-header and library to compile into a native executable.
-
-** WIP How does transpiled code operate?
-My current idea is: we're transpiling into C for the actual Lisp code.
-User made functions can be transpiled into C functions, which we can
-mangle names for.  Macros... I don't know, maybe we could have two
-function pointer tables so we know how to execute them?
-
-Then, we'll have an associated "descriptor" file which describes the
-functions we've transpiled.  Bare minimum, this file has to have a
-"symbol name" to C mangled function name dictionary.  We can also add
-other metadata as we need.
-
-*** TODO Deliberate on whether we compile into a shared library or not
-If we compile these C code objects into shared libraries, the
-descriptor needs to concern itself with code locations.  This might be
-easier in a sense, since the code will already be compiled.
-** WIP How do we call native code?
-When we're calling a natively compiled function, we can use this
-metadata mapping to call the C function.  This native code will use
-our Lisp runtime, same as any other code, so it should be pretty
-seamless in that regard.  But we'll need to set a calling convention
-in order to make calling into this seamless from a runtime
-perspective.
 * Tasks
-** TODO Capitalise symbols (TBD) :optimisation:design:
-Should we capitalise symbols?  This way, we limit the symbol table's
-possible options a bit (potentially we could design a better hashing
-algorithm?) and it would be kinda like an actual Lisp.
-** TODO Design Strings
-We have ~sv_t~ so our basic C API is done.  We just need pluggable
-functions to construct and deconstruct strings as lisps.
 ** WIP Reader system
 We need to design a reader system.  The big idea: given a "stream" of
 data, we can break out expressions from it.  An expression could be
@@ -83,7 +42,6 @@ easier.  We're not going to do anything more advanced than the API
 i.e. no parsing.
 **** DONE Design the tagged union
 **** DONE Design the API
-*** WIP Figure out the possible parse errors
 *** DONE Design what a "parser function" would look like
 The general function is something like ~stream -> T | Err~.  What
 other state do we need to encode?
@@ -91,38 +49,68 @@ other state do we need to encode?
 *** TODO Write a parser for symbols
 *** TODO Write a parser for lists
 *** TODO Write a parser for vectors
-*** TODO Write a generic parser that returns a generic expression
-** TODO Test system registration of allocated units :test:
-In particular, does clean up work as we expect?  Do we have situations
-where we may double free or not clean up something we should've?
-** TODO Design garbage collection scheme :design:gc:
+*** TODO Write the general parser
+** Backlog
+*** TODO Design Big Integers
+We currently have 62 bit integers implemented via immediate values
+embedded in a pointer.  We need to be able to support even _bigger_
+integers.  How do we do this?
+*** TODO Design garbage collection scheme :design:gc:
 Really, regardless of what I do, we need to have some kind of garbage
-collection header on whatever we allocate e.g. references if we
-reference count for GC.
-*** TODO Mark stage
-When some item is being used by another, we need a way to adjust the
-metadata such that the system is aware of it being used.
+collection header on whatever managed objects we allocate.

-For example, say I have X, Y as random allocated objects.  Then I
-construct CONS(X, Y).  Then, ref(X) and ref(Y) need to be incremented
-to say I'm using them.
-*** TODO Sweep
-Say I have an object that I construct, C.  If ref(C) = 0, then C is no
-longer needed, and is free.
+Firstly, the distinction between managed and unmanaged objects:
+- Managed objects are allocations that are generated as part of
+  evaluating user code i.e. strings, vectors, conses that are all made
+  as part of evaluating code.
+- Unmanaged objects are allocations we do as part of the runtime.
+  These are things that we expect to have near infinite lifetimes
+  (such as the symbol table, vector of allocated objects, etc).

-There are two components to this:
- we need a way of decrementing references if an object is no longer needed.
- we need a way of running through everything we've allocated so far
-  to figure out what's free to take away.
+We need to perform garbage collection against the managed objects, and
+leave the unmanaged objects to the runtime.
+**** TODO Mark stage
+We need to mark all objects that are currently accessible from the
+environment.  This means we need to have a root environment which we
+mark all our accessible objects from.  Any objects that aren't marked
+by this obviously are inaccessible, so we can then sweep them.

-Once we've filtered out what we don't need anymore, what should we do
-with them?  Naive approach would be to just actually ~free~ the cells
-in question.  But I think the next item may be a better idea.
-*** TODO Use previous allocations if they're free to use
-If we have no references to a cell, this cell is free to use.  In
-other words, if I later allocate something of the same type, instead
-of allocating a new object, why not just use the one I've already got?
+How do we store this mark on our managed objects?  I think the
+simplest approach would be to allocate an extra 8 bytes just before
+any managed object we allocate i.e. [8 byte buffer] <object>.  Then,
+during the mark phase, we can walk back those 8 bytes and
+inspect/mutate the mark.
+**** TODO Sweep
+Once we've marked all objects that are accessible, we need to
+investigate all the objects that aren't.  We do have
+[[file:alisp.h::vec_t memory;][this]] which provides a global map of
+all the stuff we've allocated so far ([[file:alisp.h::void
+sys_register(sys_t *, lisp_t *);][sys_register]] is used to add to
+this, and any managed object is expected to register).

+We can iterate through the map and collect all the unmarked objects.
+What do we do with these?
+
+1) They are technically freestanding objects allocated through
+   ~calloc~, so we could just free them.
+2) Manage some collection of previous allocations to reuse in our next
+   allocation.
+
+Option (1) is obvious and relatively clean to setup in our current
+idea:
+- Say at index I we have an object that is unmarked
+- Free the associated object at index I
+- Swap the end of the array with the cell at index I, then decrement
+  the size of the container
+
+This is an O(1) time operation.
+
+Option (2) is also relatively straightforward, but we need another
+counter in order to make it work:
+- Say at index I we have an object that is unmarked
+- Swap the end of the array with the cell at index I, then decrement
+  the size of the container
+**** TODO Use previous allocations if they're free to use
 This way, instead of deleting the memory or forgetting about it, we
 can reuse it.  We need to be really careful to make sure our ref(X) is
 actually precise, we don't want to trample on the user's hard work.
@@ -151,18 +139,25 @@ Latter approach time complexity:

 Former approach is better time complexity wise, but latter is way
 better in terms of simplicity of code.  Must deliberate.
-** TODO Design Big Integers
-We currently have 62 bit integers implemented via immediate values
-embedded in a pointer.  We need to be able to support even _bigger_
-integers.  How do we do this?
-** DONE Test value constructors and destructors :test:
+*** TODO Test system registration of allocated units :test:
+In particular, does clean up work as we expect?  Do we have situations
+where we may double free or not clean up something we should've?
+*** TODO Design Strings
+We have ~sv_t~ so our basic C API is done.  We just need pluggable
+functions to construct and deconstruct strings as lisps.
+*** TODO Capitalise symbols (TBD) :optimisation:design:
+Should we capitalise symbols?  This way, we limit the symbol table's
+possible options a bit (potentially we could design a better hashing
+algorithm?) and it would be kinda like an actual Lisp.
+** Completed
+*** DONE Test value constructors and destructors :test:
 Test if ~make_int~ works with ~as_int,~ ~intern~ with ~as_sym~.
 Latter will require a symbol table.
-** DONE Test containers constructors and destructors :test:
+*** DONE Test containers constructors and destructors :test:
 Test if ~make_vec~ works with ~as_vec~, ~cons~ with ~as_cons~ AND
 ~CAR~, ~CDR~.

 We may need to think of effective ways to deal with NILs in ~car~ and
 ~cdr~.  Maybe make functions as well as the macros so I can choose
 between them?
-*** DONE Write more tests
+**** DONE Write more tests