ovm - ARCHIVED - A stack based virtual machine to act as a target for other programming languages

Age	Commit message (Collapse)	Author
2024-04-14	Implemented tokenise_buffer	Aryadev Chavali
	Note that this is basically the same as the previous version, excluding the fact that it uses C++ idioms more and does a bit better in error checking.
2024-04-14	Implemented tokenise_literal_string	Aryadev Chavali
	One thing I've realised is that even methods such as this require error tracking. I won't implement it in the tokenise method as it's not related to consuming the string per se but instead in the main method.
2024-04-14	Implemented tokenise_literal_char (tokenise_char_literal)	Aryadev Chavali
	I made the escape sequence parsing occur here instead of leaving it to the main tokenise_buffer function as I think it's better suited here.
2024-04-14	Implemented tokenise_literal_hex	Aryadev Chavali
	Note the overall size of this function in comparison to the C version, as well as its clarity. Of course, it is doing allocations in the background through std::string which requires more profiling if I want to make this super efficient™ but honestly the assembler just needs to work, whereas the runtime needs to be fast.
2024-04-14	Implemented tokenise_literal_number (tokenise_number)	Aryadev Chavali

2024-04-14	Started implementing lexer in lexer.cpp	Aryadev Chavali
	The implementation for tokenise_symbol is already a lot nicer to look at and add to due to string/string_view operator overloading of ==. Furthermore, error handling through pair<> instead of making some custom structure which essentially does the same thing is already making me happy for this rewrite.
2024-04-14	Wrote a new lexer API in C++	Aryadev Chavali
	Essentially a refactor of the C formed lexer into C++ style. I can already see some benefits from doing this, in particular speed of prototyping.
2024-04-14	Added C++ dir locals	Aryadev Chavali

2024-04-14	Created custom functions to convert (h)words to and from bytecode format	Aryadev Chavali
	Instead of using endian.h that is not portable AND doesn't work with C++, I'll just write my own using a forced union based type punning trick. I've decided to use little endian for the format as well: it seems to be used by most desktop computers so it should make these functions faster to run for most CPUs.
2024-04-14	Merge branch 'master' into asm-rewrite-cpp	Aryadev Chavali

2024-04-14	Start writing assembler in C++	Aryadev Chavali
	Best language to use as it's already compatible with the headers I'm using and can pretty neatly enter the build system while also using the functions I've built for converting to and from bytecode!
2024-04-14	Documented lib/darr.h	Aryadev Chavali

2024-04-14	Moved struct definitions lib/inst.h -> lib/prog.h	Aryadev Chavali
	This means if I write the new assembler in another language I only need to FFI this header rather than all the functions as well which may not be as useful.
2024-04-14	Documented lib/darr.h	Aryadev Chavali

2024-04-14	Moved struct definitions lib/inst.h -> lib/prog.h	Aryadev Chavali
	This means if I write the new assembler in another language I only need to FFI this header rather than all the functions as well which may not be as useful.
2024-04-14	Added todo to rewrite assembler in a different language	Aryadev Chavali

2024-04-14	Finished todo on importing another file	Aryadev Chavali

2024-04-14	fix! loops in preprocess_use_blocks iterate to the wrong bound0.0.1	Aryadev Chavali
	A token_stream being constructed on the spot has different used/available properties to a fully constructed one: a fully constructed token stream uses available to hold the total number of tokens and used as an internal iterator, while one that is still being constructed uses the semantics of a standard darr. Furthermore, some loops didn't divide by ~sizeof(token_t)~ which lead to iteration over bound errors.
2024-04-12	Fix problems with running programs due to mismatched endian	Aryadev Chavali
	Basically ensure we're converting to big endian when writing bytecode and converting from big endian when reading bytecode.
2024-04-12	Fixing build problems due to endian.h	Aryadev Chavali
	Have to define _DEFAULT_SOURCE before you can use the endian conversion functions. As most standard library headers use features.h, and _DEFAULT_SOURCE must be defined before features.h is included, we have to include base.h before other headers.
2024-04-09	Reworking todos on library linking	Aryadev Chavali

2024-04-09	Some rewording of spec.org	Aryadev Chavali

2024-04-09	Added some TODOs to lib/inst.c to enforce endian	Aryadev Chavali

2024-04-09	Mid-work through documenting darr.h	Aryadev Chavali

2024-04-09	Done TODO: Comment coverage > lib > base.h	Aryadev Chavali
	Pretty simple
2024-04-09	Fixed code in vm_pop_hword DWORD -> DHWORD	Aryadev Chavali
	Though practically this would work, as the storage for the half word is not limited in any way, nevertheless it isn't syntactically right and it's better to fix now.
2024-04-09	Completed TODO: Rigid Endian	Aryadev Chavali
	Just used the endian.h functions to convert host endian to and from big endian.
2024-04-09	Added todo to force an endian convention	Aryadev Chavali
	I've flip flopped a bit on this but I believe the virtual machine bytecode format must have a convention on endianness. This is because of the issue stated in the TODO which may very well happen.
2024-04-08	Added better documentation to TODO list	Aryadev Chavali

2024-04-07	Changed limit for examples/factorial.asm	Aryadev Chavali
	Did some analysis and found that 21! takes above 64 bit integers to store hence set the limit to 20 instead.
2023-11-29	Use a limit on $I rather than on $B for examples/fib.asm	Aryadev Chavali

2023-11-29	Fixed issues with getting and setting words for heap pages	Aryadev Chavali
	Because I was using the hword macros instead of word macros, this causes truncation of bytes when I didn't want it.
2023-11-29	Fixed logs in vm/runtime	Aryadev Chavali
	Just changing some messages and the format of heap printing
2023-11-29	Cleaned up logs in assembler/parser	Aryadev Chavali

2023-11-29	Easier to read documentation in examples	Aryadev Chavali

2023-11-29	Fixed incorrect free of tokens in error for preprocess_use_blocks	Aryadev Chavali
	Also error now points to the correct place in the file.
2023-11-29	Report some stats of the actual program when working	Aryadev Chavali

2023-11-29	Refactored preprocessor to preprocess_(use\|macro)_blocks and process_presults	Aryadev Chavali
	We have distinct functions for the use blocks and the macro blocks, which each generate wholesale new token streams via `token_copy` so we don't run into weird errors around ownership of the internal strings of each token. Furthermore, process_presults now uses the stream index in each presult to report errors when stuff goes wrong.
2023-11-29	Refactored presult_t to include a stream pointer	Aryadev Chavali
	So when a presult_t is constructed it holds an index to where it was constructed in terms of the token stream. This will be useful when implementing an error checker in the preprocessing or result parsing stages.
2023-11-29	Added parse errors for %USE calls	Aryadev Chavali
	So %USE <STRING> is the expected call pattern, so there's an error if there isn't a string after %USE. The other two errors are file I/O errors i.e. nonexistent files or errors in parsing the other file. We don't report specifics about the other file, that should be up to the user to check themselves.
2023-11-29	Fixed tokenise_string_literal	Aryadev Chavali
	Forgot to increment buffer->used and memcpy call was just incorrect.
2023-11-29	Added function to copy tokens	Aryadev Chavali
	This essentially just copies the internal string of the token into a new buffer.
2023-11-29	Added TOKEN_PP_USE to lexer with implementation	Aryadev Chavali

2023-11-29	Moved preprocessor>Constants to Completed and started work on %USE	Aryadev Chavali

2023-11-29	Added todo for preprocessor "%MACRO"	Aryadev Chavali
	This is different to "%CONST" in that it can take token parameters and use them. This allows the construction of user code at compile time, which can be very useful for a variety of use cases.
2023-11-29	Added todo for preprocessor "%USE" blocks	Aryadev Chavali
	Essentially importing another file literally into the file. This would happen before parse results are gathered, similar to how "%CONST" is implemented currently.
2023-11-29	Cleaned up todos standard library a bit more	Aryadev Chavali

2023-11-11	Added string literals in tokeniser	Aryadev Chavali
	Doesn't do much, invalid for most operations.
2023-11-09	Use constants in examples where possible	Aryadev Chavali
	Stuff like numeric limits can be codified in constants which act self documenting.
2023-11-09	Mark off constants as done in TODO.org	Aryadev Chavali