How to convert a positionally encoded predicted embedding from a decoder to its matching token?

yboutros@infosec.pub · 5 months ago

Thanks for the feedback! I also asked a similar question on the ai stack exchange thread and got some helpful feedback there

It was a great project for brushing up on seq2seq modeling, but I decided to shelve it since someone released a polished website doing the same thing.

The idea was the vocabulary of music composition are chords and the sentences / paragraphs that are measures are sequences of chords or sequences of measures

I think it’s a great project because the limited vocab size and max sequence length are much shorter than what is typical for transformers applied to LLM tasks like digesting novels for example. So for consumer grade harder (12GB VRam) it’s feasible to train a couple different model architectures in tandem

Additionally, nothing sounds bad in music composition, it’s up to the musician to find a creative way to make it sound good. So even if the model is poorly trained, so long as it doesn’t output EOS immediately after BOS, and the sequences are unique enough, it’s pretty hard to find something that isn’t different that still works.

It’s also fairly easy to gather data from a site like iRealPro

The repo is still disorganized, but if you’re curious the main script is scrape.py

https://github.com/Yanall-Boutros/pyRealFakeProducer

yboutros@infosec.pub · 6 months ago

25% of reddit comments are chatgpt trash if not worse. It used to be an excellent Open Source Intelligence tool but now it’s just a bunch of fake supportive and/or politically biased bots

I will miss reddits extremely niche communities, but I believe Lemmy has reached the inflection point to eventually reach the same level of niche communities

yboutros@infosec.pub · 6 months ago

Don’t tell him, if too many people get ad blockers they’re just going to keep evolving

yboutros@infosec.pub · 6 months ago

How to convert a positionally encoded predicted embedding from a decoder to its matching token?

yboutros@infosec.pub · 8 months ago

We might as well change the baseline for ADHD since technology has hammered everyone’s dopamine receptors

yboutros@infosec.pub · 1 year ago

I’ve tried a few IDEs, mainly Microsoft ones as of recently, but I still prefer my neospacevim setup. Microsoft has a very nice debugger and other useful features for navigating large software projects, but even on my 3080 12th Gen i7 rig with 32GB the plugins I use end up slowing things down. Plus, a similar debugger interface can normally be found in an init.toml layer

With neospacevim, I can specify which plugins get loaded for which file types, so my LaTeX plugins don’t interfere with my Python plugins for example.

Also the macro language locks me into vim, I even installed vimium keybinds for my browser. Spacevim is nice because you can see all the available keybinds option trees by pressing Space.

I mentioned spacevim/SpacEmacs because your post focused on emacs/vim, if you do choose either to make an IDE in I would imagine SpacEmacs/spacevim might be a little closer to an IDE than a text editor.

Spacevim is nice because it will auto install packages declared in the init.toml, sometimes with vanilla vim or neovim you need a plugin manager installed separately

yboutros@infosec.pub · 1 year ago

I like Spacevim a lot (inspired by SpacEmacs), you can use neovim as the underlying vim package as well. Then update init.toml with whatever layers/plugins you want