Anand Sampat

GitHub
Scholar
LinkedIn

🎙️#19: VP of Data Science - Shabaz Patel @ One Concern

🎙️ #18 Andy Beck - CEO and Co-Founder of PathAI

Musing #3: 🎹 MIDI Representations & AI 🤖- A History & Tutorial

🤖 "The Good AI Podcast"🎙️- New Name, Same Ethos

🎥 Video Insights on Instagram!🎙️

Musing #2: AI-powered Lofi Bollywood Music?

Musing #1: Music Creation with AI

🎙️ Noosheen Hashemi (Pt. 2) - CEO of January AI, CGM and Eden's Supplements

🎙️ Rohit Gupta - Founder and Impact Investor @ Future Communities Capital 💵

# 🎙️ Anuradha Ramachandran - Investment Director @ Flourish Ventures and Experienced Impact Investor

🎙️ Noosheen Hashemi - CEO of January AI, Seasoned Tech Leader and Philanthropist

🎙️ Brian Trelstad - HBS Professor and Partner at Bridges on Service-Based Impact Investing

🎙️ Vikram Gandhi - Managing Partner @ Asha Impact & HBS Professor

🎙️ MyFugo CEO Allan Tollo on Empowering Rural Dairy Farmers in Kenya

🎙️ Elevar Equity Partner Sandeep Farias

May 2021 DWDG Digest #18

📅 April 2021 DWDG Digest #17

📅 March 2021 DWDG Digest #16

🎙️ Leaf Global CEO Nat Robinson on Financial Access for Refugees

📅 [1/25/21 - 02/21/21] DWDG Digest #15

🎙️ Perlara CEO Ethan Perlstein on New Models for Biopharma

📈 Is Robinhood Doing Good?

📅 [1/11/21 -1/24/21] DWDG Bi-Weekly Digest #14

📅 [12/28/20 - 01/10/21] DWDG "Weekly" Digest #13

📈 Tech IPOs — Are DoorDash & Airbnb Doing Good?

📅 [12/14/20-12/27/20] DWDG "Weekly" Digest #12

📅 [12/7/20 - 12/13/20] DWDG Weekly Digest #11

📅 [11/30/20 - 12/6/20] DWDG Weekly Digest #10

🎙️ One Concern CEO Ahmad Wani on Building a Resilient World

📅 [11/23/20-11/29/20] DWDG Weekly Digest #9

📅 [11/16/20-11/22/20] DWDG Weekly Digest #8

📅 [11/9/20-11/15/20] DWDG Weekly Digest #7

📅 [11/2/20-11/8/20] DWDG Weekly Digest #6

📅 [10/26/20-11/1/20] DWDG Weekly Digest #5

🎙️ Bonus Episode: What we've learned so far in < 5 min

📅 [10/19/20-10/25/20] DWDG Weekly Digest #4

🏢 C-corps, PBCs, B-corps, and Nonprofits: What's the difference?

📅 [10/12/20-10/18/20] DWDG Weekly Digest #3

📅 [10/5/20-10/11/20] DWDG Weekly Digest #2

🎙️ Copia CEO Komal Ahmad on Hunger - the "World's Dumbest Problem"

[9/28/20-10/4/20] DWDG Weekly Digest #1

🌟 Introducing DWDG Weekly Digests 📅

📈 Impact Investing for a Better World 🌍

🎙️ Shelf Engine CEO Stefan Kalb on Food Waste

🎙️ Ride Health CEO Imran Cronk on Better Care through Transport

🎙️ Nova Credit CEO Misha Esipov on Financial Inclusion for Immigrants

🤖 AI Startups Building a Better 🌎

🛠️ Fixing the Broken US Immigration System 🇺🇸

Social Enterprise ✨ vs DWDG 🚀

Juneteenth, BLM, Startups & Equality ✊🏾

Profitable with a Purpose 🌟

Introducing Doing Well by Doing Good

Datmo Partners With Leading Indian Fintech Company Ezetap

Pushing The Needle Forward For Ai Deployments

Learning Data Science You May Already Know The Basics

Part 1 Deploying Machine Learning In Production

The Future Of Collaboration In Data Projects

Tracking And Reproducibility In Data Projects

Building Artificial Intelligence Together

❯

Musing #2: AI-powered Lofi Bollywood Music?

Jul 29, 20235 min read

Original Link: https://musicalai.substack.com/p/musing-2-ai-powered-lofi-bollywood

Musing #2: AI-powered Lofi Bollywood Music?

Will AI help or hurt the creative process?

July 30, 2023

Lofi Music Generation - Can AI really make music better? I decided to combine my passion for music and ML to explore this in 4 steps. You can check out the results below, peruse my video of the experiment flow and take a look at the google colab for code details and to try it yourself!

Loom Video Link (Attached Video Below)

Thanks for reading Muse-ical AI ! Subscribe for free to receive new posts and support my work.

Subscribe

My goal was to create a lofi bollywood sample so I started where you might expect…with the data.

_ Data Gathering & Finetuning_: I gathered ~150 lofi bollywood w/ ~2.5k clips of 30 seconds with simple descriptions to finetune the decoder of the MusicGen models for 5 epochs. The core idea behind MusicGen (vs. previous generators) is that it added text and melody conditioning.

The original model was trained on 20k hours across all genres, but I’m doubtful lofi bollywood played a role (see the Original Model link below). The original 300M param model did a great job creating indian instrumentals like the sitar and tabla but did not create a lofi vibe. In order to help this model along, I decided finetuning could help and so collected my own dataset w/ 20.8 hours (1/1000x of the original), and finetuned the 300M param model (due to limited GPU compute) however I did see a noticeable improvement when prompting the original and finetuned 300M param (limited GPU compute) MusicGen model with “indian lofi beat with eastern instrumentals” in terms outputting a lofi vibe.

Original Model (AI) :https://voca.ro/1dRCve3mCCg1

1×

0:00

-0:20

Audio playback is not supported on your browser. Please upgrade.

_ Finetuned Model (AI) :https://voca.ro/1fbsdfT1dObF_

1×

0:00

-0:20

Audio playback is not supported on your browser. Please upgrade.

_ Manual Intervention_ : To create a somewhat fair comparison, I aimed to use the AI as a human and focus on music continuation, by giving it a bit of music and see how it extends it. But first…I made a few modifications in Ableton - especially around the speed, crackle and adding indian instruments like a sitar and percussion.

My Edited Clip (AI + Human):https://voca.ro/1fRBVK6utNT3

1×

0:00

-0:29

Audio playback is not supported on your browser. Please upgrade.

_ Music Continuation_ : Prompted the finetuned model with 10 second clip of my edition to see what the last 10 seconds might sound like. Needless to say…it left much to be desired.

_ Audio-only Continuation (AI):https://voca.ro/1eciGnvUcyle_

1×

0:00

-0:20

Audio playback is not supported on your browser. Please upgrade.

_ Music Continuation (w/ text) :_ Prompted the finetuned model with a 5 second clip of my edition and a text prompt “chill relaxed indian lofi with soft vocals and a tabla”. After the first 5 seconds it adds an interesting twist to the song.

Audio + Text Continuation (AI):https://voca.ro/170HqVp3eHDW

1×

0:00

-0:20

Audio playback is not supported on your browser. Please upgrade.

The human ear is the best discriminator for consonance vs. dissonance and well…the music continuation wasn’t great without text prompting and even then it was interesting but quality could still be improved.

MusicGen was helpful in seeding an initial musical idea and extending an idea and transitioning to a new beat and ending a song (albeit not my initial intention). However, the work needed to obtain this was not worth the effort for most musicians, so a few things could be improved

Reducing friction is key to making this a viable sample generator during the creative process

_ For longer form generation, stringing outputs together is likely the most computationally viable way to make generative AI a useful tool_

_ Audio quality is still not great from many of these outputs, but for production level audio a sound engineering pipeline will need to accompany these outputs_

This was a very simple experiment, but musicians rarely use language to describe music in the development process — instead we “jam” with other musicians going back-and-forth with samples. Given all of this I have some ideas for where to go from here.

Next Steps

Share code and notebooks from this experiment to see what others get
Test other transformer-based music encoder-decoder architectures (e.g. MusicLM) and diffusion models (Riffusion)
Extend the lofi dataset and acquire more GPU resources to train deeper models.
String together models to create an organic “jam” session and longer samples (especially using language as an intermediate representation)

I have no idea where this is going, but I’m excited to see where it takes me and if nothing else, I’ll hopefully create my own “jam buddy” to expand my creative process.

References and Credits:

MusicGen HF: https://huggingface.co/spaces/facebook/MusicGen
MusicGen Github: https://github.com/facebookresearch/audiocraft
MusicGen Trainer (c/o neverix): https://github.com/neverix/musicgen_trainer/tree/main

Stay tuned for more…

✌🏽

Thanks for reading Muse-ical AI ! Subscribe for free to receive new posts and support my work.

20230729-musing-1-music-creation-with-ai

Graph View

Musing #2: AI-powered Lofi Bollywood Music?
Will AI help or hurt the creative process?
Next Steps
References and Credits:

Backlinks

Musing #3: 🎹 MIDI Representations & AI 🤖- A History & Tutorial