Skip Navigation

Cr4yfish

@ Cr4yfish @lemmy.world

Posts

3
Comments

58
Joined

2 yr. ago

Developer fighting 502s from Lemmys Servers.

1y ago

I'm building a Community powered Duolingo-like App
Jump
1

Cr4yfish @lemmy.world 1y ago
Thanks :)

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

Hm that's very weird. I can't replicate it and I used some random SSL checker website and it checks out as well.

Really not sure why that's happening.

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

it’s implied it’s licensed under "all rights reserved", so not open source!

Oh, I actually did not know that. I'll try to remember adding a License right from the get-go from now on, thanks :)

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

It's GPLv3 now.

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

oh, right. Forget that every time. I'll add one.

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

I use Gemini, which supports PDF File uploads, combined with structured outputs to generate Course Sections, Levels & Question JSON.

When you upload a PDF, it first gets uploaded to a S3 Database directly from the Browser, which then sends the Filename and other data to the Server. The Server then downloads that Document from the S3 and sends it to Gemini, which then streams JSON back to the Browser. After that, the PDF is permanently deleted from the S3.

Data Privacy wise, I wouldn't upload anything sensitive since idk what Google does with PDFs uploaded to Gemini.

The Prompts are in English, so the output language is English as well. However, I actually only tested it with German Lecture PDFs myself.

So, yes, it probably works with any language that Gemini supports.

Here is the Source Code for the core function for this feature:

js

    
export async function createLevelFromDocument(
    { docName, apiKey, numLevels, courseSectionTitle, courseSectionDescription }: 
    { docName: string, apiKey: string, numLevels: number, courseSectionTitle: string, courseSectionDescription: string }) 
    {
    
    const hasCourseSection = courseSectionTitle.length > 0 && courseSectionDescription.length > 0;

    // Step 1: Download the PDF and get a buffer from it
    const blob = await downloadObject({ filename: docName, path: "/", bucketName: "documents" });
    const arrayBuffer = await blob.arrayBuffer();
    
    // Step 2: call the model and pass the PDF
    //const openai = createOpenAI({ apiKey: apiKey });
    const gooogle = createGoogleGenerativeAI({ apiKey: apiKey });

    const courseSectionsPrompt = createLevelPrompt({ hasCourseSection, title: courseSectionTitle, description: courseSectionDescription });
    
    const isPDF = docName.endsWith(".pdf");

    const content: UserContent = [];

    if(isPDF) {
        content.push(pdfUserMessage(numLevels, courseSectionsPrompt) as any);
        content.push(pdfAttatchment(arrayBuffer) as any);
    } else {
        const html = await blob.text();
        content.push(htmlUserMessage(numLevels, courseSectionsPrompt, html) as any);
    }

    const result = await streamObject({ 
        model: gooogle("gemini-1.5-flash"),
        schema: multipleLevelSchema,
        messages: [
            {
                role: "user",
                content: content
            }
        ]
    })
    

    return result;
}

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

Understandable. I added a proper offline mode back to the Roadmap on github.

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

I added it back to the roadmap :).

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

Thanks :). Yeah, it's publicly accessible: nouv.app/. I use it daily already but it still has tons of bugs.

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

The UI mostly works offline once loaded in due to aggressive caching. Downloading Course Content was on the initial Roadmap but I removed it since I wasn't sure if anyone would like the feature.

Syncing stuff is a real pain in the ass but I'll implement it if at least a couple people want it.

1y ago

I'm building a Community powered Duolingo-like App

Jump

Cr4yfish @lemmy.world 1y ago

Thanks for the suggestion, I’ll definitely try to make the app as language inclusive as possible!

Also, sorry if I might’ve been too vague with the post title. The app is just similar to Duolingo in terms of structure and the idea, however it’s not specific to language learning but supposed to cater to any subject, really.

For example, I personally use it to study for my university subjects.

2y ago

probably my biggest gripe with Lemmy right now. Feels like I'm just stuck in a loop.

Jump

Cr4yfish @lemmy.world 2y ago

I'm actually trying to solve this issue on my own Lemmy app. It automatically switches instances when the requested one is down. Works only in the Feed right now and, of course, accounts are still instance-bound - but I will fix that soon.

2y ago

Which Lemmy android app is the most battery efficient?

Jump

Cr4yfish @lemmy.world 2y ago

I always wonder when people say something like this. I also develop a Lemmy app myself and don't understand this point, like are you afraid people will complain about your code cleanliness or commenting techniques?

I mean what extra work is there really? Moving secrets to environment variables is annoying, I get that at least.

I mean no offense to you at all, really, but when I check out other Lemmy apps I don't even bother with closed source ones since I can't possibly know if you just steal login information. Especially since this is so immensely easy with Lemmy.

Again, I'm not saying you do these things but it's always better being able to check yourself, you know?

2y ago

I just hit Cancel by accident AGAIN.

Jump

Cr4yfish @lemmy.world 2y ago

Which App are you using?

2y ago

An plugin or app store and count my upvote, comment, meme ?

Jump

Cr4yfish @lemmy.world 2y ago

Is this a bot?

2y ago

Why the fuck do cars still have analog speedometers? Surely digital ones would be more accurate and much easier to read without looking away from the road for too long.

Jump

Cr4yfish @lemmy.world 2y ago

You could make like a circular shape on the screen with numbers correlating to the speed on different angles. Then maybe add some rectangle which points at the current speed and effectively changes the angle when the speed changes.

Oh wait..

2y ago

Lemmy enjoys growth as developers pivot from Reddit amid API charging controversy

Jump

Cr4yfish @lemmy.world 2y ago

I just wished the Lemmy API docs were better lol.

2y ago

hmm

Jump

Cr4yfish @lemmy.world 2y ago

You already have 13 on your comment. You're like a lemmy celebrity now.

What is it like being famous?

2y ago

Every time I try to touch grass

Jump

Cr4yfish @lemmy.world 2y ago

Don't trust the dudes telling you to chmod 777 everything.

2y ago

Are you still using AI tools to help with development?

Jump

Cr4yfish @lemmy.world 2y ago

I'm a FSE and I use GitHub copilot and Perplexity. I wouldn't want to code without them anymore.

I want to get things done (especially when I'm at work) and not spent time reading docs or having 20 tabs of stackflow open. I've had enough of that lol.

I think everyone here knows copilot but perplexity is a lot smaller and newer. It's basically like chatgpt but faster and it googles stuff, giving sources for each claim that I can read for myself.

For example, for my latest project I decided to give tailwind a try and instead of having to look through the docs for every little thing I just ask perplexity and it sums it up for me, even giving examples.

And I use copilot a lot for mundane tasks, for example when I write an API that takes an object of type Foo, Copilot auto Fills making variables and checking each for nulls and then I use that API in the frontend copilot already knows what I'm about to do and auto-fills the fetch.