1k followers 90 نوشته / هفته
[R] autoencoder’s decoder layers have smallest gradients?

As title states. I made an auto encoder with ELU activation and a sigmoid for the last activation function of the decoder layers. Looking at the gradients, my decoder layers have very small gradients. I thought the earlier layers are supposed to have the small gradients? E.g. the encoder layers since they are the furthest from the output (and backprop...

Sat Nov 2, 2024 03:05
[D] problem with dataset?

I have been working with this dataset: https://www.kaggle.com/datasets/parvmodi/automotive-vehicles-engine-health-dataset for quite a while now. I have tried various preprocessing techniques and classification trainers but no matter what I was unable to get over 68% accuracy on the models. I am not sure if I am doing something wrong or the datasets...

Sat Nov 2, 2024 00:05
[R] Very Attentive Tacotron: Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech

Paper: https://arxiv.org/abs/2410.22179 Audio Examples: https://google.github.io/tacotron/publications/very_attentive_tacotron/index.html Reference implementation (GitHub): https://github.com/google/sequence-layers/blob/main/examples/very_attentive_tacotron.py Tweet containing preview video: https://twitter.com/EricBattenberg/status/1852113437176029419...

Sat Nov 2, 2024 00:05
[D] Thinking LLMs - Instruction following with "Thought Generation"

https://arxiv.org/abs/2410.10630 Greg Schoeninger u/FallMindless3563, Oxen.ai CEO and Master of Plain Speak, has attempted to reproduce the findings in this paper using only model inferencing, datasets, and a fine-tuning API. Call to show results and dive in the paper starts at today at 10:00 AM Pacific, 1:00 PM Eastern. https://www.oxen.ai/community/?utm_source=x&utm_content=y...

Fri Nov 1, 2024 21:06
[D] How to identify which layer(s) have been skipped in (res/dense)net in testing phase?

I am new to computer vision, I have read a few model architectures like resnet, densenet, and efficientnet. I have trained these networks on a dataset. I am now currently playing with my test set, I am taking the saliency map of the output (d(output)/ d(input)). I am trying to debug the model while generating the saliency. As we know that the above...

Fri Nov 1, 2024 18:06
[D] What is the current state on getting an "inverse" of a Neural network

To Clarify what I mean (also my background is more statistical but I've a problem with a quite nonlinear relationship) Say I have inputs (predictor variables) for example: [x1,...,x10] which are all inherently numerical (ie no dummies) , and a continuous numerical output y, and say I fit some NN as y ~ x1 +... x10 (we can assume a relatively simple...

Fri Nov 1, 2024 18:06

خوراک خبری خود را بسازید

آیا آماده هستید تا آن را اجرا کنید؟
بدون نیاز به کارت اعتباری، یک دوره آزمایشی 14 روزه را شروع کنید.

ایجاد حساب‌کاربری