700 followers 157 cikk/hét
[D] Can MLP layers within GPTs be approximated using KAN layers

This paper KAN: Kolmogorov–Arnold Networks https://arxiv.org/pdf/2404.19756 (digested in https://towardsdatascience.com/kolmogorov-arnold-networks-the-latest-advance-in-neural-networks-simply-explained-f083cf994a85 ) suggests that any 2-hidden-layer MLP layers (like used in GPTs) can be approximated by KANs: "KANs outperform MLPs in terms of accuracy...

Sat May 18, 2024 09:46
[R] 1:10 Radio Controlled Car autonomous driving

I heavily need some advice want to be able to create a machine learning model that takes image input from 2 stereo cameras and outputs throttle and steering. I am racing this car on a track, from a platform that outlooks the track. My thinking is to create a depth and disparity map of the racetrack and pin point the car and then track it based on the...

Sat May 18, 2024 09:46
[D] Computer Vision with Transformers and NLP

Hi My use case is in the clarification of different types of matter using computer vision. Let's say I have 200s of these matters. I not only would like to classify them using just plain image but also descriptions using LLM. So an example is User: pls see this image.jpg The matter glows when it is near heat. The matter is a solid at -2c LLM: the answer...

Sat May 18, 2024 06:46
[D] SFT has higher grad norm but lower loss compared to pre-trainig, why?

Recently I have been continual pre-training OpenELM-1.1B on a custom corpus w/ 2B tokens, and followed by finetuning it with custom instruction dataset w/ 4M samples. I found that the loss (both training and eval) of PT is consistently higher than in the SFT stage when trained on similar amount of tokens, but the grad norm for SFT is higher than PT....

Sat May 18, 2024 06:46
Cross validation Train/validation graphs [D]

A common trend for model evaluation that we often see is the use of Cross validation CV. Authors often report accuracy and other metrics (f-measure, precision,..etc) derived from this approach. Alongside that, they plot training and validation graphs for both loss and accuracy as well as confusion matrices. My question is about how these graphs are...

Sat May 18, 2024 06:46
[D] Labeling software advice for my use cases

I'm looking for software to help me segment and categorize DICOM datasets (both images and cines (movies)), at first 2D, but later probably CT/MRI as well, and also allow me to annotate procedural videos as well. Because I need to keep track of so many images (including which images I want to use/exclude from a given series from analysis, and keep track...

Sat May 18, 2024 06:46

Készítse el saját hírfolyamát

Készen áll, hogy kipróbálja?
Indítson egy 14 napos próbaverziót, ehhez nincs szüksége bankártyára.

Fiók létrehozása