/co/ - Comics & Cartoons

Where cartoons and comics collide!

Index Catalog Archive Bottom Refresh
Mode: Reply
Name
Options
Subject
Message

Max message length: 8000

Files

Max file size: 32.00 MB

Max files: 5

Supported file types: GIF, JPG, PNG, WebM, OGG, and more

E-mail
Password

(used to delete files and postings)

Misc

Remember to follow the rules

The backup domain is located at 8chan.se. .cc is a third fallback. TOR access can be found here, or you can access the TOR portal from the clearnet at Redchannit 2.0.



8chan.moe is a hobby project with no affiliation whatsoever to the administration of any other "8chan" site, past or present.

8chan is now on LynxChan 2.4, be mindful of some bugs. Also be aware of 8chan's other domains.

15.ai Anonymous 01/09/2021 (Sat) 20:47:40 No. 8738
>15.-what? Basically some dev decided to make a program to mimic popular voices, just feed it enough of one voice and it'll slowly learn how to mimic it. Its mostly being used by a bunch of horsefuckers after their show ended explaining why there's so many mlp voices but its been getting popular as the dev adds more non-horse shit. It recently came back online after the dev did some work making the more popular voices sound even better, while adding around 100 more voices. At certain points ai.15 is a flatout deadringer and its only going to get better from here. At this rate the typical "voice actor" may go the way of the dinosaur.
(41.22 KB 380x338 not_twilight_sparkle.jpg)
>Special shoutouts go to cuckchannel's /mlp/ and its anons who have collectively spent hundreds of hours collecting, cleaning, and organizing clips of dialogue taken from the show My Little Pony: Friendship Is Magic. Thanks to everyone else who helped curate datasets for their favorite characters. Further thanks to Cookie for entertaining a couple illuminating discussions that contributed to the latest iteration of the model, and for proposing the idea of using DeepMoji embeddings to aid in context-based emotive generations; thanks to persona nerd for suggesting using a GAN vocoder as an alternative, leading to a significant improvement in audio quality. Finally, thanks to everyone with whom I've corresponded with regarding related issues within the past two months. (And as always, thanks to /v/ for the excellent content.) I don't remember any anon thanking an imageboard while working with machine learning tech, let alone cuckchan; it's usually disowned. Also, I've yet to see anything similar that relies on seiyuu clips in the animu side. Perhaps it's the way speech synthesis works for the Japs?
>>8740 Kinda, they have the vocaloid stuff which is pretty advanced but English Miku sounds like shit. And they have no fair-use laws so copyright will bite them in the ass.
(187.35 KB 500x448 spongebob.webm)
if you can't get the inflection right, try adding a second sentence with the pipe symbol (it's that vertical line above backslash) for altered deepmoji analysis (I cannot believe I'm unironically using the word deepmoji). to wit, I got the inflection on webm related by adding "|Fuck You!" to the original line >>8738 every time I work with Carl the voice always sounds like they missed the last 1/8 second of each word. He talks way too fast. Scout too
>>8743 What are you trying to make him say?
>>8743 >if you can't get the inflection right Have you tried using other words like "buttox" or "behind"?
(553.13 KB 640x360 EQXC1AfgmTyhL9ub.mp4)
There's been some good ones floating around.
Oh yeah I tried it but it's shit, it doesn't let you input your own data. There are only so many comedic situations you can put Carl in.
(83.50 KB 477x559 cakebutt.jpg)
>>8744 various memetic expressions >>8746 the way the analyzer works with the second sentence is it uses it for deepmoji analysis and ignores the first sentence. just changing it to buttocks wouldn't provide the analyzer with an emotional impact if you're suggesting changing the original sentence itself, surely you've at least seen pic related
>>8759 But Carl works in any situation.
>>8760 I mean if it helps smooth out how the Sponge voice says it while keeping it funny, then it couldnt be too bad. The sharp "butt" sound really breaks up the flow.
>>8753 This is great. Got any more to share? By the way, considering Rick May is dead, how good is the synthesis for soldier?
(449.60 KB 480x270 e4LX_RJNSiTibYAF.mp4)
>>8766 I've been getting them from twitter.
>422 server error >422 server error >422 server error >422 server error this is getting tiresome
>>8767 >twitter Well at least you honest.
>>8780 It was either that or going to discord, tiktok or cuckchannel.
(7.11 MB 1024x580 My Little WMAF.webm)
(726.28 KB 1920x1080 Spongebob WMAF.webm)
(661.54 KB 1920x1080 white Male Asian Female Force.webm)
(1.61 MB 1920x1080 Steven WMAFverse.webm)
(1.24 MB 1920x1080 Equestrian WMAF.webm)
>>8781 Or you could have made your own
>>8763 the sharp "butt" was supposed to expressed shock. As sentences get longer they tend to lose energy without finagling it
>>8784 Steven Universe one needs the star iris zoom in at the end instead of a fade to black
>>8740 Japs had a meme on NicoNico for years where they would sentence mix the dialog of Broly and Paragus from the first Broly movie, all because they said every letter of Japanese alphabet at least once, the people making these videos could essentially make them say anything they wanted. Essentially making Japanese Youtube Poops.
>>8787 That and Touhou cookie along with the Homo men edits.
>>8784 True but that would require editing.
>like oldish PC game I played as a kid >its so obscure its not even on several lists of all video games lists (GameFAQs, Mobygames ect.) despite being commercially released and its two prequels showing up fine >filled with speech since it was a CD-Rom game when developers wanted to show how much data they could throw onto one >almost all the voices happens to be in unobfuscated .aif that's already mostly organized and clear of anything but the snippits of speech >submissions are open and want audio submitted in pretty much exactly the way it already is I'm quite tempted to spend a few hours transcribing this game's dialog just so I can make one of the characters say funny things that will only amuse me.
>>8823 Give it a shot, there's no real barrier on who's allowed in or not. Just whether the reference audio is well prepared.
Of all the work people are doing to upload voices, the focus should honestly go to characters that'll never be heard again.
>>8824 OK, I've got over a third of it done. Should provide 7+ minutes of dialog when done, and I think even more could be obtained from the parts that aren't so easily extracted and the previous game the character was in.
I couldnt figure out how to add audio tracks
>>9009 I think you need to send it directly to the developer.
>>9009 Yes. Read the contribute section: You have to transcribe all the audio and send a zip of it+audio files to the dev, with some requirements. The contact is in the about. >>8845 Finished. It's ultimately around ~10 mins in length (I know of no way to fully tally the exact length, but an average length of 2 seconds times the number of files gives me over 10 mins)
>>9009 >>9012 Are the devs FINALLY accepting new characters? Seriously it needs more if it ever wants to stay alive.
>>9013 I doubt 10 minutes is long enough to train decently the neural network, but tell me if you character gets accepted to have a reference.
(20.34 KB 153x203 155728911069.png)
/v/ related, but I just realized I can send them WC3 audio tracks and make Arthas memes for the rest of my life.
>>9017 When you're done mod the game and change all dialogue to https://invidious.048596.xyz/watch?v=7gCo8fajC7o
>>9018 There's an argentinian guy working on a remake of the original WC3 storyline with Wow models, this come really in handy form him.
>>9016 The majority of characters have under 11 minutes of audio. https://15.ai/characters
(121.38 KB 604x360 Carl's Bizarre Adventure.mp4)
(103.43 KB 828x813 Ert8mWKVkAE11Fk.jpg)
Some of these are really good.
>>9028 Those sound terrible...
Any idea of what kind of override sentences I need to make the AI sing?
>>9031 If its anything like moonbase alpha then something like this. Space odyssey: [dah<2000,17>][dah<2000,24>][dah<2500,29>][dah<200,35>][dah<2000,34>]
>>9032 This isn't a normal text to speech layout.
>>9033 I mean 15.ai
>>8835 I agree with you Anon, on the bright side if you can find clean voice tracks the guy is apparently willing to do it. but you may have to venture to cuckchannel's /mlp/
>>8835 Strongly agree, no one could contest the validity because while you could copyright strike shit like pre-recorded messages on movies and cartoons, how can you justify being able to mimic someone else's voice when they can't do it anymore? All reenactors would have to be shot on sight. Anyhow, does anyone have any idea if this guys ever released the source code or have found his original papers? >>9031 Have you tried just having them play the lines with a |happy deepmoji encoding and then just using something like audacity to give it higher or lower pitch?
>>9046 He hasn't released the source, I think it some sort of patent or something.
(1.06 MB 712x480 BHn1JvCm0fEE0OTn.mp4)
(818.98 KB 640x360 video0(5).mp4)
(1.35 MB 640x360 video0(4).mp4)
(2.49 MB 1058x720 TFchungus.mp4)
(539.18 KB 640x360 Heavy_cub.mp4)
(1.59 MB 1080x608 6bC396owXHge-JeY.mp4)
(53.89 KB 112x112 1601522705249.gif)
>>9084 15.ai truly has unlimited potential.
(862.43 KB 1920x1080 Daria's opinion on WMAF.webm)
>>9084 HOLY FUCK I couldn't breath for a minute straight >>9086 >that ending Nice lol
When is someone going to make a moonrap feat. one or more of these characters?


Quick Reply
Extra
Delete
Report

no cookies?