Use String, Dict, and read_bytes to shorten and simplify #91

mikowals · 2024-04-22T03:49:07Z

This is based off current nightly branch (mojo 2024.4.161). It is a demo of some clean ups that can happen now that Mojo and its stdlib have added a lot of functionality that was missing when this was originally released. There could probably also be another round to remove TensorSlice and just use List[TensorF32] for each layer of weights.

The main changes are:

replace PointerString and PointerStrings with String and List[String] respectively.
Use a Dict to lookup token indices. This allows removing Quicksort and binary search code.
small changes like using Tensor's own SIMD operations and argmax.
use read_bytes to handling of pointers without copying. That means FileBuf is no longer used.

I am not sure this is ready to merge mostly because the of handling of special bytes handling in wrap and the old print function. I tried to persevere the functionality but haven't tested extensively. Ideally we could get proper handling from String and if not fix it in stdlib.

Also, I think the stdlib is going to shift to List[UInt8] for all bytes representations, including in String. So this change could also wait until after has happened and is incorporated.

I didn't mess with Llamatune since this is going across Mojo versions but locally there was no change in tokens / sec. It is probably loading faster and more memory efficiently since this avoids the vocab sort and no longer reads entire tokenizer.bin.

toffaletti · 2024-05-17T23:13:58Z

llama2.mojo

-                right = mid - 1
+        var index = self.map_vocab_to_index.find(token)
+        if index:
+            return index.value()


return index.or_else(-1)

mikowals added 3 commits April 22, 2024 08:19

remove .to_int() and DTypePointer casts

44f76b8

PointerString -> String

feadc98

remove FileBuf

c09de0c

mikowals mentioned this pull request Apr 24, 2024

str_concat memory leak #92

Open

toffaletti reviewed May 17, 2024

View reviewed changes

Fix optional value retrieve

6cb1a3b

tairov marked this pull request as ready for review May 21, 2024 10:33

Supported version -> 24.3

625a965

tairov merged commit f41dde4 into tairov:master May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use String, Dict, and read_bytes to shorten and simplify #91

Use String, Dict, and read_bytes to shorten and simplify #91

mikowals commented Apr 22, 2024

toffaletti May 17, 2024

Use String, Dict, and read_bytes to shorten and simplify #91

Use String, Dict, and read_bytes to shorten and simplify #91

Conversation

mikowals commented Apr 22, 2024

toffaletti May 17, 2024

Choose a reason for hiding this comment