Simple arl-mode plugin for Emacs

lexer/token: update tokeniser to recognise puts
examples/hello-world: putstr -> puts
2026-02-01 19:25:13 +00:00 · 2026-01-29 15:49:20 +00:00 · 2026-01-29 15:40:30 +00:00 · 2026-01-29 05:42:16 +00:00 · 2026-01-29 05:19:51 +00:00 · 2026-01-29 05:19:40 +00:00
22 changed files with 763 additions and 543 deletions
--- a/.dir-locals.el
+++ b/.dir-locals.el
@@ -1,6 +1,6 @@
 ;;; Directory Local Variables            -*- no-byte-compile: t -*-
 ;;; For more information see (info "(emacs) Directory Variables")
-((nil      . ((compile-command . "make MODE=debug -k")
+((nil      . ((compile-command . "make -k MODE=debug examples")
              (+license/license-choice . "MIT License")))
 (c-mode   . ((mode . clang-format))))
--- a/20
+++ b/20
@@ -3,13 +3,13 @@ CC=cc
 DIST=build
 OUT=$(DIST)/arl.out
-MODULES=. lib parser
+MODULES=$(shell cd include/arl; find . -type 'd' -printf "%f\n")
-UNITS=main lib/vec lib/sv parser/ast parser/parser
+UNITS=main cli lib/vec lib/sv lexer/token lexer/lexer
 OBJECTS:=$(patsubst %,$(DIST)/%.o, $(UNITS))
 LDFLAGS=
-GFLAGS=-Wall -Wextra -Wpedantic -std=c23 -I./src/
+GFLAGS=-Wall -Wextra -Wpedantic -std=c23 -I./include/
-DFLAGS=-ggdb -fsanitize=address -fsanitize=undefined
+DFLAGS=-ggdb -fsanitize=address -fsanitize=undefined -DVERBOSE_LOGS=1
 RFLAGS=-O3
 MODE=release
@@ -26,7 +26,7 @@ DEPDIR=$(DIST)/deps
 $(OUT): $(OBJECTS) | $(DIST)
 	$(CC) $(CFLAGS) -o $@ $^ $(LDFLAGS)
-$(DIST)/%.o: src/arl/%.c | $(DIST) $(DEPDIR)
+$(DIST)/%.o: src/%.c | $(DIST) $(DEPDIR)
 	$(CC) $(CFLAGS) $(DEPFLAGS) $(DEPDIR)/$*.d -c -o $@ $<
 $(DIST):
@@ -35,7 +35,11 @@ $(DIST):
 $(DEPDIR):
 	mkdir -p $(patsubst %,$(DEPDIR)/%, $(MODULES))
-.PHONY: run clean
+clangd: compile_commands.json
 compile_commands.json: Makefile
 	bear -- $(MAKE) -B MODE=debug
 .PHONY: run clean examples
 ARGS=
 run: $(OUT)
 	./$^ $(ARGS)
@@ -43,5 +47,9 @@ run: $(OUT)
 clean:
 	rm -rf $(DIST)
 examples: $(OUT)
 	@echo "Example: Hello World"
 	./$^ examples/hello-world.arl
 DEPS:=$(patsubst %,$(DEPDIR)/%.d, $(UNITS))
 include $(wildcard $(DEPS))
--- a/14
+++ b/14
@@ -6,13 +6,12 @@
 │ /_/   \_\_| \_\_____| │
 └───────────────────────┘
-Similar to Forth.  Compiles to C.
+Similar to Forth.
 Native speed with simple semantics.
 -----
 Goals
 -----
- Complete operational transpiler to C
+- Complete operational transpiler, with C as a provisional working target
 - Ability to reuse compiled code (as object code) in top level ARL code.
 - Static type system with informative errors
@@ -44,3 +43,12 @@ $ make DIST=<folder>
 Similarly, the general flags used in the C compiler may be set via the CFLAGS
 variable, with linking arguments set via the LDFLAGS variable.
 ------------------
 Usage instructions
 ------------------
 Once built, simply use the built binary like so:
 $ ./build/arl.out <filename>
 Alternatively, you can run the examples automatically via the Makefile:
 $ make examples
--- a/arl.org
+++ b/arl.org
@@ -1,69 +1,64 @@
 #+title: ARL - Issue tracker
 #+date: 2026-01-23
 #+filetags: arl
 * TODO Write a minimum working transpiler
 We need to be able to compile the following file:
 [[file:examples/hello-world.arl]].  All it does is print "Hello,
 world!".  Should be relatively straightforward.
 ** Stages
 We need the following stages in our MVP transpiler:
 - Source code reading (read bytes from a file)
 - Parse raw bytes into tokens (Lexer)
 - Interpret tokens into a classical AST (Parser)
 - Stack effect and type analysis of the AST for soundness
 - Translate AST into C code (Codegen)
 - Compile C code into native executable (Target)
 It's a Eulerian Path from the source code to the native executable.
 ** DONE Read file
-** DONE Parser
+** DONE Lexer
-** TODO Intermediate representation (Virtual Machine)
+[[file:src/lexer/]]
-[[file:src/arl/vm/]]
+[[file:include/arl/lexer/]]
 ** WIP Parser
 [[file:src/parser/]]
 [[file:include/arl/parser/]]
-Before we get into generating C code and then compiling it, it might
+We need to generate some form of AST from the token stream.  This
-be worth translating the parsed ARL code into a generic IR.
+should be a little more advanced than our initial stream,
 distinguishing between
 - Literal values
 - Primitive calls
 - References to otherwise undefined words (may be defined through
  import or later on)
 ** TODO Stack effect/type analysis
 [[file:src/analysis/]]
 [[file:include/arl/analysis/]]
-The IR should be much more primitive in its semantics, and force clear
+Given the AST, we need to verify the soundness of it with regards to
-requirements of the platform we're compiling to.  This way, at the
+types and the stack.  We have this idea of "stack effects" attached to
-code generator stage we can figure out:
+every node in the AST; literals push values onto the stack and pop
- what can we reasonably use from the target platform to satisfy
+nothing, while operations may pop some operands and push some values.
  requirements?
 - what do we need to hand-roll on the target in order to make this
  work?
-Essentially, we want to write a virtual machine, and translate ARL
+We need a way to:
-code into bytecode for that VM.  Goals:
+- Codify the stack effects of each type of AST node
- Easier to optimise IR bytecode than the AST of our original program
+- Infer the total stack effect from a sequence of nodes
 - Easier to imagine translations from that IR bytecode into target
  platform code
 *** TODO Minimal IR representation
 We need the following clear items in our IR:
 - Static type values
 - Static type variables (possible DeBrujin numbering or other such
  mechanism to abstract naming away and leave it to the target to
  generate effectively)
 - Strongly typed primitive operators (numeric, strings, I/O) with
  packed arguments
-Read about [[https://en.wikipedia.org/wiki/Three-address_code][TAC]].
+These stack effects work in tandem with our type analysis.  Stack
-*** TODO IR Compiler
+shape analysis tells us what operands are being fed into primitives,
-We should have a rough grouping between AST objects and this IR.  As
+while the type analysis will tell us if the operands are well formed
-ARL is Forth-like, we can use the stack semantics to generate this IR
+for the primitives.
 as we walk the AST in a linear manner.
 Consider the following ARL code:
 #+begin_src text
 34 35 +
 #+end_src
 When we walk through this code:
 - 34 (an integer) is pushed onto the stack
 - 35 (an integer) is pushed onto the stack
 - + is encountered
  - Pop two values off the stack and verify their type against the
    contract for "+" (something like (-> i32 i32 i32))
  - Generate IR, something like ~prim-add(34, 35)~
 *** TODO Consider optimisers
 Certainly we should perform optimisations on the IR itself before
 passing it over to the code generator.  Currently we haven't got much
 in the way of optimisations to consider, but it may be worth
 considering.
 ** TODO Code generator
-[[file:src/arl/target-c/]]
+[[file:src/codegen/]]
 [[file:include/arl/codegen/]]
-This should take the IR translated from the AST generated by the
+This should take the AST generated by the parser (which should already
-parser, and write equivalent C code.
+have been analysed), and write equivalent C code.
 ** TODO Target compilation
 [[file:src/target/]]
 [[file:include/arl/target/]]
-After we've generated the C code, we need to call a C compiler on it
+=gcc= and =clang= take C code via /stdin/, so we don't need to write
-to generate a binary.  GCC and Clang allow passing source code through
+the C code to disk - we can just leave it as a buffer of bytes.  So
-stdin, so we don't even need to write to disk first which is nice.
+we'll call the compilers and feed the generated code from the previous
 stage into it via stdin.
--- a/examples/hello-world.arl
+++ b/examples/hello-world.arl
@@ -1 +1 @@
-"Hello, world!" println
+"Hello, world!\n" puts
--- a/extensions/arl-mode.el
+++ b/extensions/arl-mode.el
@@ -0,0 +1,37 @@
 ;;; arl-mode.el --- ARL mode for Emacs               -*- lexical-binding: t; -*-
 ;; Copyright (C) 2026  Aryadev Chavali
 ;; Author: Aryadev Chavali <aryadev@aryadevchavali.com>
 ;; Keywords:
 ;; Copyright (C) 2026 Aryadev Chavali
 ;; This program is distributed in the hope that it will be useful, but WITHOUT
 ;; ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 ;; FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 ;; You may distribute and modify this code under the terms of the MIT License,
 ;; which you should have received a copy of along with this program.  If not,
 ;; please go to <https://opensource.org/license/MIT>.
 ;;; Commentary:
 ;;
 ;;; Code:
 (defvar arl-mode-comments '(?\; ";;" ("#|" . "|#")))
 (defvar arl-mode-keywords '("if" "then" "else"))
 (defvar arl-mode-expressions '(("\".*\"" . font-lock-string-face)))
 (defvar arl-mode-automode-list '("\\.arl"))
 (define-derived-mode arl-mode
  arl-mode-comments
  arl-mode-keywords
  arl-mode-expressions
  arl-mode-automode-list
  nil)
 (provide 'arl-mode)
 ;;; arl-mode.el ends here
--- a/include/arl/cli.h
+++ b/include/arl/cli.h
@@ -0,0 +1,31 @@
 /* cli.h: CLI helpers
 * Created: 2026-01-29
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary:
 */
 #ifndef CLI_H
 #define CLI_H
 #include <stdio.h>
 #include <arl/lib/sv.h>
 int read_file(const char *filename, sv_t *ret);
 int read_pipe(FILE *pipe, sv_t *ret);
 void usage(FILE *fp);
 #endif
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/include/arl/lexer/lexer.h
+++ b/include/arl/lexer/lexer.h
@@ -1,38 +1,38 @@
-/* parser.h: Parser which takes character buffers and yields an AST
+/* lexer.h: Lexer which takes character buffers and yields a sequence of tokens.
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary:
 */
-#ifndef PARSER_H
+#ifndef LEXER_H
-#define PARSER_H
+#define LEXER_H
-#include <arl/parser/ast.h>
+#include <arl/lexer/token.h>
-/// Parser streams, utilised when generating an AST.
+/// Token streams, utilised when lexing.
 typedef struct
 {
  u64 byte;
  sv_t contents;
-} parse_stream_t;
+} lex_stream_t;
-/// Types of errors that may occur during parsing
+/// Types of errors that may occur during lexing
 typedef enum
 {
-  PARSE_ERR_OK = 0,
+  LEX_ERR_OK = 0,
-  PARSE_ERR_EXPECTED_SPEECH_MARKS,
+  LEX_ERR_EXPECTED_SPEECH_MARKS,
-  PARSE_ERR_UNKNOWN_CHAR,
+  LEX_ERR_UNKNOWN_CHAR,
-} parse_err_t;
+} lex_err_t;
-const char *parse_err_to_string(parse_err_t err);
+const char *lex_err_to_string(lex_err_t err);
-// Generates an AST from STREAM, storing it in OUT.  Returns any errors it may
+// Generates a token stream from a lex_stream_t, storing it in OUT.  Returns any
-// generate.
+// errors it may generate.
-parse_err_t parse(ast_t *out, parse_stream_t *stream);
+lex_err_t lex_stream(token_stream_t *out, lex_stream_t *stream);
 // Computes the line and column that STREAM is currently pointing at in its
 // buffer, storing it in LINE and COL.
-void parse_stream_get_line_col(parse_stream_t *stream, u64 *line, u64 *col);
+void lex_stream_get_line_col(lex_stream_t *stream, u64 *line, u64 *col);
 #endif
--- a/include/arl/lexer/token.h
+++ b/include/arl/lexer/token.h
@@ -0,0 +1,73 @@
 /* token.h: General definition of tokens, and a sequence of them.
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary:
 */
 #ifndef TOKEN_H
 #define TOKEN_H
 #include <arl/lib/base.h>
 #include <arl/lib/sv.h>
 #include <arl/lib/vec.h>
 /// Types of tokens
 typedef enum
 {
  TOKEN_TYPE_KNOWN = 0,
  TOKEN_TYPE_SYMBOL,
  TOKEN_TYPE_STRING,
  NUM_TOKEN_TYPES,
 } token_type_t;
 /// Known symbols which later stages would benefit from.
 typedef enum
 {
  TOKEN_KNOWN_PUTS,
  NUM_TOKEN_KNOWNS,
 } token_known_t;
 const char *token_known_to_cstr(token_known_t);
 /// Tokens are a tagged union
 typedef struct
 {
  u64 byte_location;
  token_type_t type;
  union
  {
    token_known_t as_known;
    sv_t as_symbol;
    sv_t as_string;
  };
 } token_t;
 token_t token_known(u64 byte, token_known_t known);
 token_t token_symbol(u64 byte, sv_t symbol);
 token_t token_string(u64 byte, sv_t string);
 void token_print(FILE *fp, token_t *token);
 /// Sequence of tokens
 typedef struct
 {
  vec_t vec;
 } token_stream_t;
 void token_stream_free(token_stream_t *token);
 void token_stream_print(FILE *fp, token_stream_t *token);
 #endif
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/include/arl/lib/base.h
+++ b/include/arl/lib/base.h
@@ -33,13 +33,35 @@ typedef double f64;
 #define MIN(A, B)  ((A) > (B) ? (B) : (A))
 #define ARRSIZE(A) ((sizeof(A)) / sizeof((A)[0]))
-#define FAIL(...)                 \
+#ifndef VERBOSE_LOGS
 #define VERBOSE_LOGS 0
 #endif
 #if VERBOSE_LOGS
 #define LOG(...)                  \
  do                              \
  {                               \
    fprintf(stdout, "LOG: ");     \
    fprintf(stdout, __VA_ARGS__); \
  } while (0);
 #else
 #define LOG(...)
 #endif
 #define LOG_ERR(...)              \
  do                              \
  {                               \
    fprintf(stderr, "FAIL: ");    \
    fprintf(stderr, __VA_ARGS__); \
-    assert(0);                    \
+  } while (0);
 #define FAIL(...)         \
  do                      \
  {                       \
    LOG_ERR("FAIL: ");    \
    LOG_ERR(__VA_ARGS__); \
    assert(0);            \
  } while (0)
 #endif
 /* Copyright (C) 2026 Aryadev Chavali
--- a/include/arl/lib/sv.h
+++ b/include/arl/lib/sv.h
--- a/include/arl/lib/vec.h
+++ b/include/arl/lib/vec.h
@@ -31,9 +31,13 @@ typedef struct
 static_assert(sizeof(vec_t) == 64, "Expected sizeof(vec_t) to be 64");
 // standard old appending methods
 void vec_append(vec_t *vec, const void *const ptr, u64 size);
 void vec_append_byte(vec_t *vec, u8 byte);
 // vector-as-a-stack
 u8 *vec_pop(vec_t *vec, u64 size);
 // Returns pointer to the start of the buffer VEC is currently using to store
 // data (either its inline buffer or the heap buffer).
 void *vec_data(vec_t *vec);
@@ -43,8 +47,13 @@ void vec_ensure_capacity(vec_t *vec, u64 capacity);
 // Ensure VEC has at least SIZE bytes free
 void vec_ensure_free(vec_t *vec, u64 size);
 // Free the memory associated with the vector
 void vec_free(vec_t *vec);
 // Reset a vector while preserving any allocations
 void vec_reset(vec_t *vec);
 // Copy all data from V1 into V2.
 void vec_clone(vec_t *v2, vec_t *v1);
--- a/src/arl/main.c
+++ b/src/arl/main.c
@@ -1,82 +0,0 @@
 /* main.cpp:
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary:
 */
 #include <assert.h>
 #include <ctype.h>
 #include <stdbool.h>
 #include <stdio.h>
 #include <stdlib.h>
 #include <string.h>
 #include <arl/lib/base.h>
 #include <arl/lib/sv.h>
 #include <arl/lib/vec.h>
 #include <arl/parser/ast.h>
 #include <arl/parser/parser.h>
 /// Parser
 sv_t read_file(const char *filename)
 {
  FILE *fp = fopen(filename, "rb");
  if (!fp)
    FAIL("File `%s` does not exist\n", filename);
  fseek(fp, 0, SEEK_END);
  long size = ftell(fp);
  fseek(fp, 0, SEEK_SET);
  char *buffer = calloc(1, size + 1);
  fread(buffer, size, 1, fp);
  fclose(fp);
  buffer[size] = '\0';
  return SV(buffer, size);
 }
 int main(void)
 {
  const char *filename = "./examples/hello-world.arl";
  sv_t contents        = read_file(filename);
  printf("%s\n=> `" PR_SV "`\n", filename, SV_FMT(contents));
  parse_stream_t stream = {.byte = 0, .contents = contents};
  ast_t ast             = {0};
  parse_err_t perr      = parse(&ast, &stream);
  if (perr)
  {
    u64 line = 1, col = 0;
    parse_stream_get_line_col(&stream, &line, &col);
    fprintf(stderr, "%s:%lu:%lu: %s\n", filename, line, col,
            parse_err_to_string(perr));
    goto fail;
  }
  printf("=> Parsed %lu nodes\n", ast.nodes.size / sizeof(ast_node_t));
  ast_print(stdout, &ast);
  printf("\n");
  free(contents.data);
  ast_free(&ast);
  return 0;
 fail:
  if (contents.data)
    free(contents.data);
  if (ast.nodes.capacity > 0)
    ast_free(&ast);
  return 1;
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/arl/parser/ast.c
+++ b/src/arl/parser/ast.c
@@ -1,117 +0,0 @@
 /* ast.c: Implementation of AST constructor/destructor functions
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary: See ast.h.
 */
 #include <arl/lib/base.h>
 #include <arl/lib/vec.h>
 #include <arl/parser/ast.h>
 const char *ast_prim_to_cstr(ast_prim_t prim)
 {
  switch (prim)
  {
  case AST_PRIM_NIL:
    return "nil";
  case AST_PRIM_PRINTLN:
    return "println";
  default:
    FAIL("Unexpected AST primitive value: %d\n", prim);
  }
 }
 ast_node_t ast_node_prim(u64 byte, ast_prim_t primitive)
 {
  return (ast_node_t){
      .byte_location = byte,
      .type          = AST_NODE_TYPE_PRIMITIVE,
      .value         = {.as_prim = primitive},
  };
 }
 ast_node_t ast_node_string(u64 byte, sv_t string)
 {
  return (ast_node_t){
      .byte_location = byte,
      .type          = AST_NODE_TYPE_STRING,
      .value         = {.as_string = string},
  };
 }
 ast_node_t ast_node_symbol(u64 byte, sv_t symbol)
 {
  return (ast_node_t){
      .byte_location = byte,
      .type          = AST_NODE_TYPE_SYMBOL,
      .value         = {.as_symbol = symbol},
  };
 }
 void ast_node_print(FILE *fp, ast_node_t *node)
 {
  if (!node)
  {
    fprintf(fp, "NIL");
    return;
  }
  switch (node->type)
  {
  case AST_NODE_TYPE_PRIMITIVE:
    fprintf(fp, "PRIMITIVE(%s)", ast_prim_to_cstr(node->value.as_prim));
    break;
  case AST_NODE_TYPE_SYMBOL:
    fprintf(fp, "SYMBOL(" PR_SV ")", SV_FMT(node->value.as_symbol));
    break;
  case AST_NODE_TYPE_STRING:
    fprintf(fp, "STRING(" PR_SV ")", SV_FMT(node->value.as_string));
    break;
  case NUM_AST_NODE_TYPES:
  default:
    FAIL("Unexpected node type: %d\n", node->type);
  }
 }
 void ast_print(FILE *fp, ast_t *ast)
 {
  if (!ast)
  {
    fprintf(fp, "{}");
    return;
  }
  fprintf(fp, "{");
  if (ast->nodes.size == 0)
  {
    fprintf(fp, "}\n");
    return;
  }
  fprintf(fp, "\n");
  for (u64 i = 0; i < ast->nodes.size / sizeof(ast_node_t); ++i)
  {
    ast_node_t item = VEC_GET(&ast->nodes, i, ast_node_t);
    fprintf(fp, "\t[%lu]: ", i);
    ast_node_print(fp, &item);
    fprintf(fp, "\n");
  }
  fprintf(fp, "}");
 }
 void ast_free(ast_t *ast)
 {
  // we can free the vector itself and we're done
  vec_free(&ast->nodes);
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/arl/parser/ast.h
+++ b/src/arl/parser/ast.h
@@ -1,75 +0,0 @@
 /* ast.h: General definition of the AST and nodes within it.
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary:
 */
 #ifndef AST_H
 #define AST_H
 #include <arl/lib/base.h>
 #include <arl/lib/sv.h>
 #include <arl/lib/vec.h>
 /// Types the AST can encode
 typedef enum
 {
  AST_NODE_TYPE_PRIMITIVE = 0,
  AST_NODE_TYPE_SYMBOL,
  AST_NODE_TYPE_STRING,
  NUM_AST_NODE_TYPES,
 } ast_node_type_t;
 /// Primitives (values, callables, etc) as symbols
 typedef enum
 {
  AST_PRIM_NIL = 0,
  AST_PRIM_PRINTLN,
  NUM_AST_PRIMS,
 } ast_prim_t;
 const char *ast_prim_to_cstr(ast_prim_t);
 /// Node of the AST as a tagged union
 typedef struct
 {
  u64 byte_location;
  ast_node_type_t type;
  union
  {
    ast_prim_t as_prim;
    sv_t as_symbol;
    sv_t as_string;
  } value;
 } ast_node_t;
 ast_node_t ast_node_prim(u64 byte, ast_prim_t primitive);
 ast_node_t ast_node_symbol(u64 byte, sv_t symbol);
 ast_node_t ast_node_string(u64 byte, sv_t string);
 void ast_node_print(FILE *fp, ast_node_t *node);
 /// The AST as a flat collection of nodes
 typedef struct
 {
  vec_t nodes;
 } ast_t;
 void ast_free(ast_t *ast);
 void ast_print(FILE *fp, ast_t *ast);
 #endif
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/arl/parser/parser.c
+++ b/src/arl/parser/parser.c
@@ -1,183 +0,0 @@
 /* parser.c: Implementation of parser.
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary: See parser.h
 */
 #include <ctype.h>
 #include <string.h>
 #include <arl/lib/sv.h>
 #include <arl/parser/ast.h>
 #include <arl/parser/parser.h>
 /// Expected characters in a symbol
 static const char *SYMBOL_CHARS =
    "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz!#$%&'()*+,-./"
    ":;<=>?@\\^_`{|}~0123456789";
 const char *parse_err_to_string(parse_err_t err)
 {
  switch (err)
  {
  case PARSE_ERR_OK:
    return "OK";
  case PARSE_ERR_EXPECTED_SPEECH_MARKS:
    return "EXPECTED_SPEECH_MARKS";
  case PARSE_ERR_UNKNOWN_CHAR:
    return "UNKNOWN_CHAR";
  default:
    FAIL("Unexpected parse_err_t value: %d\n", err);
  }
 }
 /// Prototypes for streams
 bool stream_eos(parse_stream_t *stream);
 char stream_peek(parse_stream_t *stream);
 void stream_advance(parse_stream_t *stream, u64 size);
 u64 stream_size(parse_stream_t *stream);
 void parse_stream_get_line_col(parse_stream_t *stream, u64 *line, u64 *col)
 {
  assert(stream && line && col && "Expected valid pointers.");
  for (u64 i = 0; i < stream->byte; ++i)
  {
    char c = stream->contents.data[i];
    if (c == '\n')
    {
      *line += 1;
      *col = 0;
    }
    else
    {
      *col += 1;
    }
  }
 }
 /// Prototypes for parsing subroutines
 parse_err_t parse_string(parse_stream_t *stream, ast_node_t *ret);
 parse_err_t parse_symbol(parse_stream_t *stream, ast_node_t *ret);
 parse_err_t parse(ast_t *out, parse_stream_t *stream)
 {
  assert(out && stream && "Expected valid pointers");
  while (!stream_eos(stream))
  {
    char cur = stream_peek(stream);
    if (isspace(cur))
    {
      while (isspace(cur) && !stream_eos(stream))
      {
        stream_advance(stream, 1);
        cur = stream_peek(stream);
      }
    }
    else if (cur == '"')
    {
      // we make a copy for parse_string to mess with
      ast_node_t ret   = {0};
      parse_err_t perr = parse_string(stream, &ret);
      if (perr)
        return perr;
      vec_append(&out->nodes, &ret, sizeof(ret));
    }
    else if (strchr(SYMBOL_CHARS, cur) && !isdigit(cur))
    {
      // we make a copy for parse_symbol to mess with
      ast_node_t ret   = {0};
      parse_err_t perr = parse_symbol(stream, &ret);
      if (perr)
        return perr;
      vec_append(&out->nodes, &ret, sizeof(ret));
    }
    else
    {
      return PARSE_ERR_UNKNOWN_CHAR;
    }
  }
  return PARSE_ERR_OK;
 }
 parse_err_t parse_string(parse_stream_t *stream, ast_node_t *ret)
 {
  // Increment the cursor just past the first speechmark
  stream_advance(stream, 1);
  sv_t current_contents = sv_chop_left(stream->contents, stream->byte);
  u64 string_size       = sv_till(current_contents, "\"");
  if (string_size + stream->byte == stream_size(stream))
    return PARSE_ERR_EXPECTED_SPEECH_MARKS;
  // Bounds of string are well defined, generate an object and advance the
  // stream
  *ret =
      ast_node_string(stream->byte - 1, SV(current_contents.data, string_size));
  stream_advance(stream, string_size + 1);
  return PARSE_ERR_OK;
 }
 parse_err_t parse_symbol(parse_stream_t *stream, ast_node_t *ret)
 {
  sv_t current_contents = sv_chop_left(stream->contents, stream->byte);
  sv_t symbol =
      SV(current_contents.data, sv_while(current_contents, SYMBOL_CHARS));
  // see if symbol is one of the AST primitives we can parse AOT
  static_assert(NUM_AST_PRIMS == 2, "Expected number of AST primitives");
  for (ast_prim_t i = 0; i < NUM_AST_PRIMS; ++i)
  {
    const char *possible_prim = ast_prim_to_cstr(i);
    if (strlen(possible_prim) == symbol.size &&
        strncmp(possible_prim, symbol.data, symbol.size) == 0)
    {
      // Found a matching primitive
      *ret = ast_node_prim(stream->byte, i);
      goto end;
    }
  }
  // otherwise, it must be a fresh symbol i.e. user defined
  *ret = ast_node_symbol(stream->byte, symbol);
 end:
  stream_advance(stream, symbol.size);
  return PARSE_ERR_OK;
 }
 bool stream_eos(parse_stream_t *stream)
 {
  return stream->byte >= stream->contents.size;
 }
 char stream_peek(parse_stream_t *stream)
 {
  if (stream_eos(stream))
    return '\0';
  else
    return stream->contents.data[stream->byte];
 }
 void stream_advance(parse_stream_t *stream, u64 size)
 {
  if (stream->byte + size >= stream->contents.size)
    stream->byte = stream->contents.size;
  else
    stream->byte += size;
 }
 u64 stream_size(parse_stream_t *stream)
 {
  return stream->contents.size;
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/cli.c
+++ b/src/cli.c
@@ -0,0 +1,83 @@
 /* cli.c:
 * Created: 2026-01-29
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary: See /include/arl/cli.h
 */
 #include <stdlib.h>
 #include <string.h>
 #include <arl/cli.h>
 #include <arl/lib/vec.h>
 int read_file(const char *filename, sv_t *ret)
 {
  // NOTE: Stupidly simple.  Presumes the file is NOT three pipes in a trench
  // coat.
  FILE *fp = fopen(filename, "rb");
  if (!fp)
    return 1;
  fseek(fp, 0, SEEK_END);
  ret->size = ftell(fp);
  fseek(fp, 0, SEEK_SET);
  ret->data = calloc(1, ret->size + 1);
  fread(ret->data, ret->size, 1, fp);
  fclose(fp);
  ret->data[ret->size] = '\0';
  return 0;
 }
 int read_pipe(FILE *pipe, sv_t *ret)
 {
  // NOTE: We can't read an entire pipe at once like we did for read_file.  So
  // let's read in buffered chunks, with a vector to keep them contiguous.
  vec_t contents = {0};
  char buffer[1024];
  while (!feof(pipe))
  {
    size_t bytes_read = fread(buffer, 1, sizeof(buffer), pipe);
    vec_append(&contents, buffer, bytes_read);
  }
  ret->size = contents.size;
  // Get that null terminator in, but only after we've recorded the actual size
  // of what's been read.
  vec_append_byte(&contents, '\0');
  if (contents.not_inlined)
  {
    // Take the heap pointer from us.
    ret->data = vec_data(&contents);
  }
  else
  {
    // vec_data(&contents) is stack allocated; can't carry that out of this
    // function!
    ret->data = calloc(1, contents.size);
    memcpy(ret->data, vec_data(&contents), contents.size);
  }
  return 0;
 }
 void usage(FILE *fp)
 {
  fprintf(fp, "Usage: arl [FILE]\n"
              "Compiles [FILE] as ARL source code.\n"
              "  [FILE]: File to compile.\n"
              "If FILE is \"--\", then read from stdin.\n");
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/lexer/lexer.c
+++ b/src/lexer/lexer.c
@@ -0,0 +1,184 @@
 /* lexr.c: Implementation of lexr.
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary: See /include/arl/lexer/lexer.h
 */
 #include <ctype.h>
 #include <string.h>
 #include <arl/lexer/lexer.h>
 #include <arl/lexer/token.h>
 #include <arl/lib/sv.h>
 /// Expected characters in a symbol
 static const char *SYMBOL_CHARS =
    "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz!#$%&'()*+,-./"
    ":;<=>?@\\^_`{|}~0123456789";
 const char *lex_err_to_string(lex_err_t err)
 {
  switch (err)
  {
  case LEX_ERR_OK:
    return "OK";
  case LEX_ERR_EXPECTED_SPEECH_MARKS:
    return "EXPECTED_SPEECH_MARKS";
  case LEX_ERR_UNKNOWN_CHAR:
    return "UNKNOWN_CHAR";
  default:
    FAIL("Unexpected lex_err_t value: %d\n", err);
  }
 }
 /// Prototypes for streams
 bool stream_eos(lex_stream_t *stream);
 char stream_peek(lex_stream_t *stream);
 void stream_advance(lex_stream_t *stream, u64 size);
 u64 stream_size(lex_stream_t *stream);
 void lex_stream_get_line_col(lex_stream_t *stream, u64 *line, u64 *col)
 {
  assert(stream && line && col && "Expected valid pointers.");
  for (u64 i = 0; i < stream->byte; ++i)
  {
    char c = stream->contents.data[i];
    if (c == '\n')
    {
      *line += 1;
      *col = 0;
    }
    else
    {
      *col += 1;
    }
  }
 }
 /// Prototypes for lexing subroutines
 lex_err_t lex_string(lex_stream_t *stream, token_t *ret);
 lex_err_t lex_symbol(lex_stream_t *stream, token_t *ret);
 lex_err_t lex_stream(token_stream_t *out, lex_stream_t *stream)
 {
  assert(out && stream && "Expected valid pointers");
  while (!stream_eos(stream))
  {
    char cur = stream_peek(stream);
    if (isspace(cur))
    {
      while (isspace(cur) && !stream_eos(stream))
      {
        stream_advance(stream, 1);
        cur = stream_peek(stream);
      }
    }
    else if (cur == '"')
    {
      // we make a copy for lex_string to mess with
      token_t ret    = {0};
      lex_err_t perr = lex_string(stream, &ret);
      if (perr)
        return perr;
      vec_append(&out->vec, &ret, sizeof(ret));
    }
    else if (strchr(SYMBOL_CHARS, cur) && !isdigit(cur))
    {
      // we make a copy for lex_symbol to mess with
      token_t ret    = {0};
      lex_err_t perr = lex_symbol(stream, &ret);
      if (perr)
        return perr;
      vec_append(&out->vec, &ret, sizeof(ret));
    }
    else
    {
      return LEX_ERR_UNKNOWN_CHAR;
    }
  }
  return LEX_ERR_OK;
 }
 lex_err_t lex_string(lex_stream_t *stream, token_t *ret)
 {
  // Increment the cursor just past the first speechmark
  stream_advance(stream, 1);
  sv_t string = sv_chop_left(stream->contents, stream->byte);
  string.size = sv_till(string, "\"");
  // If we're at the edge of the stream, there must not have been any
  // speechmarks.
  if (string.size + stream->byte == stream_size(stream))
    return LEX_ERR_EXPECTED_SPEECH_MARKS;
  // `string` is well defined, package and throw it back.
  *ret = token_string(stream->byte - 1, string);
  stream_advance(stream, string.size + 1);
  return LEX_ERR_OK;
 }
 lex_err_t lex_symbol(lex_stream_t *stream, token_t *ret)
 {
  sv_t symbol = sv_chop_left(stream->contents, stream->byte);
  symbol.size = sv_while(symbol, SYMBOL_CHARS);
  // see if symbol is one of the already known symbols
  static_assert(NUM_TOKEN_KNOWNS == 1, "Expected number of TOKEN_KNOWNs");
  for (token_known_t i = 0; i < NUM_TOKEN_KNOWNS; ++i)
  {
    const char *possible_known = token_known_to_cstr(i);
    if (strlen(possible_known) == symbol.size &&
        strncmp(possible_known, symbol.data, symbol.size) == 0)
    {
      // Found a matching known symbol
      *ret = token_known(stream->byte, i);
      goto end;
    }
  }
  // otherwise, it must be a fresh symbol i.e. user defined
  *ret = token_symbol(stream->byte, symbol);
 end:
  stream_advance(stream, symbol.size);
  return LEX_ERR_OK;
 }
 bool stream_eos(lex_stream_t *stream)
 {
  return stream->byte >= stream->contents.size;
 }
 char stream_peek(lex_stream_t *stream)
 {
  if (stream_eos(stream))
    return '\0';
  else
    return stream->contents.data[stream->byte];
 }
 void stream_advance(lex_stream_t *stream, u64 size)
 {
  if (stream->byte + size >= stream->contents.size)
    stream->byte = stream->contents.size;
  else
    stream->byte += size;
 }
 u64 stream_size(lex_stream_t *stream)
 {
  return stream->contents.size;
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/lexer/token.c
+++ b/src/lexer/token.c
@@ -0,0 +1,115 @@
 /* token.c: Implementation of TOKEN constructor/destructor functions
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary: See /include/arl/lexer/token.h.
 */
 #include <arl/lexer/token.h>
 #include <arl/lib/base.h>
 #include <arl/lib/vec.h>
 const char *token_known_to_cstr(token_known_t known)
 {
  switch (known)
  {
  case TOKEN_KNOWN_PUTS:
    return "puts";
  default:
    FAIL("Unexpected TOKEN_KNOWN value: %d\n", known);
  }
 }
 token_t token_known(u64 byte, token_known_t known)
 {
  return (token_t){
      .byte_location = byte,
      .type          = TOKEN_TYPE_KNOWN,
      .as_known      = known,
  };
 }
 token_t token_string(u64 byte, sv_t string)
 {
  return (token_t){
      .byte_location = byte,
      .type          = TOKEN_TYPE_STRING,
      .as_string     = string,
  };
 }
 token_t token_symbol(u64 byte, sv_t symbol)
 {
  return (token_t){
      .byte_location = byte,
      .type          = TOKEN_TYPE_SYMBOL,
      .as_symbol     = symbol,
  };
 }
 void token_print(FILE *fp, token_t *token)
 {
  if (!token)
  {
    fprintf(fp, "NIL");
    return;
  }
  switch (token->type)
  {
  case TOKEN_TYPE_KNOWN:
    fprintf(fp, "KNOWN(%s)", token_known_to_cstr(token->as_known));
    break;
  case TOKEN_TYPE_SYMBOL:
    fprintf(fp, "SYMBOL(" PR_SV ")", SV_FMT(token->as_symbol));
    break;
  case TOKEN_TYPE_STRING:
    fprintf(fp, "STRING(" PR_SV ")", SV_FMT(token->as_string));
    break;
  case NUM_TOKEN_TYPES:
  default:
    FAIL("Unexpected token type: %d\n", token->type);
  }
 }
 void token_stream_print(FILE *fp, token_stream_t *token)
 {
  if (!token)
  {
    fprintf(fp, "{}");
    return;
  }
  fprintf(fp, "{");
  if (token->vec.size == 0)
  {
    fprintf(fp, "}\n");
    return;
  }
  fprintf(fp, "\n");
  for (u64 i = 0; i < token->vec.size / sizeof(token_t); ++i)
  {
    token_t item = VEC_GET(&token->vec, i, token_t);
    fprintf(fp, "\t[%lu]: ", i);
    token_print(fp, &item);
    fprintf(fp, "\n");
  }
  fprintf(fp, "}");
 }
 void token_stream_free(token_stream_t *stream)
 {
  // we can free the vector itself and we're done
  vec_free(&stream->vec);
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
--- a/src/arl/lib/sv.c
+++ b/src/arl/lib/sv.c
@@ -2,7 +2,7 @@
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
- * Commentary: See /include/sv.h
+ * Commentary: See /include/arl/lib/sv.h
 */
 #include <string.h>
--- a/src/arl/lib/vec.c
+++ b/src/arl/lib/vec.c
@@ -2,7 +2,7 @@
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
- * Commentary:
+ * Commentary: See /include/arl/lib/vec.h
 Taken from prick_vec.h: see https://github.com/oreodave/prick.
 */
@@ -31,6 +31,14 @@ void vec_append_byte(vec_t *vec, u8 byte)
  ++vec->size;
 }
 u8 *vec_pop(vec_t *vec, u64 size)
 {
  if (!vec || vec->size < size)
    return NULL;
  vec->size -= size;
  return (u8 *)vec_data(vec) + vec->size;
 }
 void *vec_data(vec_t *vec)
 {
  if (!vec)
@@ -89,6 +97,14 @@ void vec_free(vec_t *vec)
  memset(vec, 1, sizeof(*vec));
 }
 void vec_reset(vec_t *vec)
 {
  if (!vec)
    return;
  memset(vec_data(vec), 0, vec->capacity);
  vec->size = 0;
 }
 void vec_clone(vec_t *v2, vec_t *v1)
 {
  if (!v1 || !v2)
--- a/src/main.c
+++ b/src/main.c
@@ -0,0 +1,96 @@
 /* main.c:
 * Created: 2026-01-22
 * Author: Aryadev Chavali
 * License: See end of file
 * Commentary:
 */
 #include <assert.h>
 #include <ctype.h>
 #include <stdbool.h>
 #include <stdio.h>
 #include <stdlib.h>
 #include <string.h>
 #include <arl/lexer/lexer.h>
 #include <arl/lexer/token.h>
 #include <arl/lib/base.h>
 #include <arl/lib/sv.h>
 #include <arl/lib/vec.h>
 #include <arl/cli.h>
 int main(int argc, char *argv[])
 {
  int ret        = 0;
  char *filename = "";
  if (argc == 1)
  {
    usage(stderr);
    ret = 1;
    goto end;
  }
  else
  {
    filename = argv[1];
  }
  int read_err  = 0;
  sv_t contents = {0};
  if (strcmp(filename, "--") == 0)
  {
    filename = "stdin";
    read_err = read_pipe(stdin, &contents);
  }
  else
  {
    read_err = read_file(filename, &contents);
  }
  if (read_err)
  {
    LOG_ERR("ERROR: Reading `%s`: ", filename);
    perror("");
    ret = 1;
    goto end;
  }
  LOG("%s => `" PR_SV "`\n", filename, SV_FMT(contents));
  lex_stream_t stream   = {.byte = 0, .contents = contents};
  token_stream_t tokens = {0};
  lex_err_t perr        = lex_stream(&tokens, &stream);
  if (perr)
  {
    u64 line = 1, col = 0;
    lex_stream_get_line_col(&stream, &line, &col);
    LOG_ERR("%s:%lu:%lu: %s\n", filename, line, col, lex_err_to_string(perr));
    ret = 1;
    goto end;
  }
 #if VERBOSE_LOGS
  LOG("Lexed %lu tokens ", tokens.vec.size / sizeof(token_t));
  token_stream_print(stdout, &tokens);
  printf("\n");
 #endif
 end:
  if (contents.data)
    free(contents.data);
  token_stream_free(&tokens);
  return ret;
 }
 /* Copyright (C) 2026 Aryadev Chavali
 * This program is distributed in the hope that it will be useful, but WITHOUT
 * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
 * FOR A PARTICULAR PURPOSE.  See the MIT License for details.
 * You may distribute and modify this code under the terms of the MIT License,
 * which you should have received a copy of along with this program.  If not,
 * please go to <https://opensource.org/license/MIT>.
 */
Author	SHA1	Message	Date
Aryadev Chavali	00786d9cb9	Simple arl-mode plugin for Emacs	2026-02-01 19:25:13 +00:00
Aryadev Chavali	b391fe9a74	lexer/token: update tokeniser to recognise puts	2026-01-29 15:49:20 +00:00
Aryadev Chavali	12e82e64d0	examples/hello-world: putstr -> puts	2026-01-29 15:40:30 +00:00
Aryadev Chavali	9b55b9ec32	arl.org: rewrite parser bit	2026-01-29 05:42:16 +00:00
Aryadev Chavali	6545bd1302	dir-locals: make examples by default	2026-01-29 05:19:51 +00:00
Aryadev Chavali	c11b69092d	arl.org: massive updates	2026-01-29 05:19:40 +00:00
Aryadev Chavali	82b96e23d5	Adjusted README	2026-01-29 04:18:07 +00:00
Aryadev Chavali	0fdfbef1de	Makefile: added recipe to run examples	2026-01-29 04:17:53 +00:00
Aryadev Chavali	6f6b747540	lib/vec \| lexer: Clean up comments	2026-01-29 04:13:41 +00:00
Aryadev Chavali	321b06ca0d	Makefile: modules is now dynamically found	2026-01-29 04:12:46 +00:00
Aryadev Chavali	d46ee32775	main: split off reading and usage function to its own unit	2026-01-29 04:12:33 +00:00
Aryadev Chavali	dc96e12145	parser -> lexer That's the real purpose of this module; it's not really generating an AST since ARL's syntax isn't tree like whatsoever. The next stage will be something closer to an AST, in the sense we'll be introducing: - Syntactical analysis - Type Checking	2026-01-29 03:43:04 +00:00
Aryadev Chavali	42ac4f6bbb	lib/vec: vec_reset and vec_pop	2026-01-29 03:23:43 +00:00
Aryadev Chavali	7e3dd2679d	parser/ast: Remove union name in ast_node_t Destructures the names of the union into the namespace of the structure itself; bit easier to use IMO.	2026-01-29 03:15:51 +00:00
Aryadev Chavali	8764b65aff	parser: remove NIL as a known value Not really needed or necessary.	2026-01-29 03:15:12 +00:00
Aryadev Chavali	2e24d3a618	parser/parser: slight tidy up in parse_string/parse_symbol	2026-01-29 03:14:36 +00:00
Aryadev Chavali	645ea5a04e	main: Fix trivial pipes error in read_pipe, better comments overall.	2026-01-28 09:49:42 +00:00
Aryadev Chavali	9d8a0c1e22	examples: Remove newline from end of hello-world	2026-01-28 09:07:22 +00:00
Aryadev Chavali	65e4dc0b29	main: command line arguments for filename, and allow stdin parsing - Now take a single command line argument for the filename to read and compile. - If filename is "--", then read stdin until EOF using a different read handler (using ~vec_t~ along with buffered reading).	2026-01-28 09:06:00 +00:00
Aryadev Chavali	afc0f9c034	main: deal with file read errors more appropriately, unify error interface - ~read_file~ now returns an error code and takes the ~sv_t~ (which contains the file contents) by pointer. We can now deal with the error in ~main~ directly. - Make the return code of ~main~ a variable which error branches can set. Unify the error branch and normal branch code. Pattern for error handling is now unified.	2026-01-28 09:02:46 +00:00
Aryadev Chavali	84996130b7	base: Added some extra logging macros LOG, LOG_ERR. LOG_ERR will always compile to a /stderr/ print. LOG, on the other hand, may not actually do anything if VERBOSE_LOGS is not 1. By default it is 0, so it must be defined when compiling to enable - hence the adjustment of the Makefile.	2026-01-28 08:59:29 +00:00
Aryadev Chavali	0beab4e11d	*: small changes	2026-01-28 07:35:45 +00:00
Aryadev Chavali	2dc0de78b5	parser/ast\|parser: PRIMITIVE -> KNOWN Primitive is a bit of a word conflict here; primitives are what we'd expect our callables to be named eventually. However, these parser "primitives" are just well known symbols that we want to optimise the representation of for later stages. Thus, KNOWN is a bit better for signalling intent then PRIMITIVE is.	2026-01-28 07:35:39 +00:00
Aryadev Chavali	947e05cdc8	parser/ast: update println -> putstr	2026-01-28 07:13:41 +00:00
Aryadev Chavali	e5152aac67	examples: Adjusted hello world example for new ideas in arl.org	2026-01-28 07:03:19 +00:00
Aryadev Chavali	42447e5bd8	arl.org: Lots of thinking	2026-01-28 07:03:12 +00:00
Aryadev Chavali	85f5502681	Makefile: Added recipe for generating compile_commands.json	2026-01-24 14:17:40 +00:00
Aryadev Chavali	fda4a63732	main: Fix comments referencing main.cpp	2026-01-24 03:06:37 +00:00
Aryadev Chavali	64fe3fc112	lib\|parser: Fix references to headers	2026-01-24 03:06:19 +00:00
Aryadev Chavali	76872179f9	*: Split off headers into their own folder Main reason is so we don't have that stupid arl prefix directory in our source code. Now our source code is flat, and we can still reference headers by linking from root.	2026-01-24 03:02:54 +00:00
		`@@ -1 +1 @@`
			`"Hello, world!" println`				`"Hello, world!\n" puts`