/sci/ - Science & Math » Thread #12612218

134KiB, 1000x383, 2deep4me.png

View Same Google iqdb SauceNAO

Anonymous Fri 22 Jan 05:05:28 2021 No.12612218 View Reply Original Report

Quoted By: >>12612227 >>12612238 >>12612271 >>12612321 >>12614280 >>12614649 >>12614818 >>12617189

Assume a deep neural network. How does one compute the average processing power spent on each layer? Each node?

Papers or other studies on this are appreciated.

Gary !3bEffhmerM

Gary !3bEffhmerM Fri 22 Jan 2021 05:08:54 No.12612227 Report

Quoted By: >>12612231 >>12612930 >>12614024 >>12617010

>>12612218

Each layer represents an attribute. Each weight represents a percentage of match to the input. It's much simpler then they try and make it out to be. There is an input, that input has attributes, each attribute has a layer, each layer has node of all permutations of that attribute, each node has a percentage match to the input. Its like stupid simple. They obfuscate it to make themselves sounds smarter lol. Which is OK.

Gary !3bEffhmerM

Gary !3bEffhmerM Fri 22 Jan 2021 05:10:03 No.12612231 Report

Quoted By: >>12617010

>>12612227

OPTIMUM THEORY shit of course. Give credit please. Thanks.

Gary !3bEffhmerM

Gary !3bEffhmerM Fri 22 Jan 2021 05:12:38 No.12612238 Report

Quoted By: >>12612250 >>12617010

>>12612218

All intelligence is just pattern matching and recognition. Full stop. So imagine you need to recognize, say, an orange. So the image input goes in and then that filters through various layers, all representing an attribute. Hair, skin, temperament, etc... if the input has fake hair, jaundice skin and infant temperament to a certain percentage then you have a match. Its simple.

Anonymous

Anonymous Fri 22 Jan 2021 05:15:11 No.12612240 Report

Quoted By: >>12612243

Gary, can you please not shit into my thread? Thanks.

Gary !3bEffhmerM

Gary !3bEffhmerM Fri 22 Jan 2021 05:17:16 No.12612243 Report

Quoted By: >>12612246 >>12617010

>>12612240

A simple "thanx u" would suffice Anon. Much love *MUAH*

Anonymous

Anonymous Fri 22 Jan 2021 05:20:13 No.12612246 Report

Quoted By:

>>12612243
Thanx u

Anonymous

Anonymous Fri 22 Jan 2021 05:21:50 No.12612250 Report

Quoted By: >>12612254 >>12612261 >>12612321

>>12612238
Nobody on 4chan says full stop. That is leftist propaganda found only on twitter. They are trying to have it mean something it doesn't actually mean. It is literally putting two periods. It is non-sensical token phrase construction that doesn't fit in with written communication. Everyone who uses it is a retard full stop

Anonymous

Anonymous Fri 22 Jan 2021 05:23:06 No.12612252 Report

Quoted By:

I’m being extorted by schizopost hustlers, just trying to run a thread shop but they have a racket set up.

Anonymous

Anonymous Fri 22 Jan 2021 05:24:06 No.12612254 Report

Quoted By: >>12612318

>>12612250
THANK YOU FOR YOUR INPUT STOP NOTED STOP MOM STOP

Anonymous

Anonymous Fri 22 Jan 2021 05:27:39 No.12612261 Report

Quoted By:

>>12612250
>full stop

OHHH OHHHH YOU SAID IT YOU SAID IT!!!!!!!

Anonymous

View Same Google iqdb SauceNAO MOAR LAYERS.jpg, 80KiB, 947x946

Anonymous Fri 22 Jan 2021 05:33:58 No.12612271 Report

Quoted By: >>12612293 >>12612300

>>12612218

Anonymous

View Same Google iqdb SauceNAO 1606899569860.png, 188KiB, 590x469

Anonymous Fri 22 Jan 2021 05:43:03 No.12612293 Report

Quoted By:

>>12612271

Anonymous

View Same Google iqdb SauceNAO 1606899569860.png, 188KiB, 590x469

Anonymous Fri 22 Jan 2021 05:44:26 No.12612300 Report

Quoted By:

>>12612271
>STACK MORE LAYERS

Anonymous

Anonymous Fri 22 Jan 2021 05:52:58 No.12612318 Report

Quoted By:

>>12612254
But can you shitpost in Morse?

Anonymous

Anonymous Fri 22 Jan 2021 05:55:04 No.12612321 Report

Quoted By: >>12612342

>>12612218
fuck off
>>12612250
full stop, not even wrong.
>tasty irony with the self-immolation it applies here too.
anyone else is satisfied with evenly weighting node variables in the layers.
why would i help you optimize/refine a model for free? this is homework

Anonymous

Anonymous Fri 22 Jan 2021 06:04:22 No.12612342 Report

Quoted By: >>12612347 >>12612394

>>12612321
No fun allowed brigade has arrived.

It’s not for optimization, I want to learn if CPU demand spikes in different stages of learning. If it’s uniform it’s also acceptable.

Anonymous

Anonymous Fri 22 Jan 2021 06:06:25 No.12612347 Report

Quoted By: >>12612361

>>12612342
>uniform

Uniform would make no fucking sense. Lol. He's trolling you. Now go build something fucking awesome and report back. Cheers.

Anonymous

Anonymous Fri 22 Jan 2021 06:12:24 No.12612361 Report

Quoted By:

>>12612347
I am building it man, can you suggest a software to calculate NN CPU demand at least?

Anonymous

Anonymous Fri 22 Jan 2021 06:26:48 No.12612394 Report

Quoted By: >>12612398 >>12612413

>>12612342
sorry, i'll take you at your word.
https://pdf.sciencedirectassets.com/280203/1-s2.0-S1877050919X00125/1-s2.0-S1877050919310440/main.pdf?X-Amz-Security-Token=IQoJb3JpZ2luX2VjEN3%2F%2F%2F%2F%2F%2F%2F%2F%2F%2FwEaCXVzLWVhc3QtMSJGMEQCIEZ5tGgSyOCYcZGMVgNBvW6tPFLXYS6prgrN3q92wydEAiARvP4n3%2BH%2BT6HBssLyE5U1AJbiLj8hcqB6dWbAif6nyiq9Awi2%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F8BEAMaDDA1OTAwMzU0Njg2NSIMBNwZ94ler%2B3e8mCPKpED6BlF4hb4egvJ5eWUSf0oybclPc5ouHciMmulMuuiH0eWpT7RlusVuc5tKS5IjV%2BpkJm0g%2FF9YBpTVEjSOPSxV86WnEHaEKnEHIskUr2BFzWBYpRWAalhSmFJtn4EQuxpTMR7sEHT0Q3YMqRYZ%2FMJJDt1ugmJbAqjgVU40gvTp62Y%2FO%2FhwfOqSfXKHnVhxUUAd4s3at67fImHdqyqPACYXmb62M6pI%2BuLUSJ2k93UgQvjHavjgb%2B86JLqADEjjlLu1i3x3BEIsRUloU7D5VAT0a0aWBrjsj7lyUadodO0vh1HSgMKURiG%2BweDTEmZKnZ3uD6FZ7QLg6HhJp%2BQQY4YVX3%2Bge6wApKBzISnoo0VMh7DGxiO6wxDl2iVTgJL2LpTMRGTvKZaOFhYT%2FWpYmgZod0pRgjVOKqlxMB1Oo8BKeQGJJ0sPMumtCGSKsORiigRw9En2DsGmNg1jjlmsjkn69xNhN8ndO4eSa3K8OK7HYyz8IkAmmNR1iVb6sUm%2FXESGcGkxWogHFY2%2FBrcP9ES%2FwgwsbOpgAY67AGfLPOC0dep%2B6VCQh1Ytf2Peb7FLH5QNWBb71Aqa9xnMlEtMcnNPxFZnCn2ZkvoKqNSDTw1vLYGjCb7soAFNFBAW8CcylewGdZ5pDi4Yp3i9TNqzVEbbqZoiXWHIJZR8Iji1swiX5xxxtiQy44xPUTLhgroygbMwJAvDaBYQRqLsVMAxYvWPiFf5kX4MNHzEG2j9KzhvWPdLvdrQMuhoHzi2ivRjzeThX4ZtajhL2uRNvFHcQ9aKKUa%2FyLFVx65avpqlMa3jN6qx41yTF1kIutFTZyf8kKSSYTDYQypG1wy0A7cmDjwAknE9Lp41A%3D%3D&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Date=20210122T062509Z&X-Amz-SignedHeaders=host&X-Amz-Expires=299&X-Amz-Credential=ASIAQ3PHCVTYRI6KNZ63%2F20210122%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Signature=e0b652d3e88d2119b29ce187398bca8f23c7d43aa9aa334607927645ec211088&hash=5f3d1aea39041565676491ed30038ea3235b30fad891d84da61d2767a15eedb6&host=68042c943591013ac2b2430a89b270f6af2c76d8dfd086a07176afe7c76c2c61&pii=S1877050919310440&tid=spdf-b45f52f7-0ccc-4d6b-ab84-a049333c0cbb&sid=9933f429659af6447f4a9481e745aabd58a0gxrqa&type=client
check 2 problem statement

>refine your inputs.
im a node to your layer function in this meta-art.

Anonymous

Anonymous Fri 22 Jan 2021 06:27:50 No.12612398 Report

Quoted By:

>>12612394
>already citing for someone else, not going to bother condensing

Anonymous

Anonymous Fri 22 Jan 2021 06:32:22 No.12612413 Report

Quoted By: >>12612420

>>12612394
Much appreciated, thanks.

Anonymous

Anonymous Fri 22 Jan 2021 06:35:10 No.12612420 Report

Quoted By: >>12612428

>>12612413
love you. happy to grunt some papers your way if you give me details.
>dont even care if you're google at this point kek.

Anonymous

Anonymous Fri 22 Jan 2021 06:37:29 No.12612428 Report

Quoted By: >>12612449

>>12612420
I’m a civil engineer, I’m looking to apply Bernoulli principle to deep learning if it makes sense.

Anonymous

View Same Google iqdb SauceNAO tenor.gif, 3MiB, 498x278

Anonymous Fri 22 Jan 2021 06:43:49 No.12612449 Report

Quoted By: >>12612463

>>12612428
>Bernoulli principle
i feel like this is backflow propagation prevention system of some kind.. or somethingsomething cavitiation.
>thanks for building my world anon.
>do you need citations and proofs or do you need practical tools/models?
are you using TensorFlow currently?

Anonymous

Anonymous Fri 22 Jan 2021 06:50:04 No.12612463 Report

Quoted By: >>12612490

>>12612449
I don’t know I need both? I’m trying to formulate an energy equation for deep neural networks so any lead helps.
Not TensorFlow, I use Python and C++. Could switch to TF if it’s feasable.

Anonymous

Anonymous Fri 22 Jan 2021 07:04:51 No.12612490 Report

Quoted By: >>12612503

>>12612463
sounds like you should check git?
have you tried keyword "workload" in searching for papers?
thats way too general..
i feel like youre trying to get me to find a paper you submitted or something..

https://www.google.com/search?q=neural+network+optimum+theory&oq=neural+network+optimum+theory&aqs=chrome..69i57j0i10i22i30i457.6020j1j7&sourceid=chrome&ie=UTF-8

https://www.google.com/search?ei=IXcKYI75GrbDz7sPuLSZuAw&q=measure+n+node+workload+-cognitive&oq=measure+n+node+workload+-cognitive&gs_lcp=CgZwc3ktYWIQA1DYqAJY_bkCYO2_AmgBcAB4AYAB7AOIAfkQkgEJMC45LjEuMC4xmAEAoAEBqgEHZ3dzLXdpesABAQ&sclient=psy-ab&ved=0ahUKEwjOjaiC-67uAhW24XMBHThaBscQ4dUDCA0&uact=5

what you want them do?

Anonymous

Anonymous Fri 22 Jan 2021 07:09:52 No.12612503 Report

Quoted By: >>12612517

>>12612490
Nope, I’ll search with workload. I found people suggesting a CPU power x runtime calculation but that’s total energy demand and I can get that with a simple power clock.

These will be enough for now, thanks Gary. Say hi to your team from me

Anonymous

Anonymous Fri 22 Jan 2021 07:15:18 No.12612517 Report

Quoted By: >>12612519

>>12612503
im a laborer dude. i fucking wish i was gary.

Anonymous

Anonymous Fri 22 Jan 2021 07:16:35 No.12612519 Report

Quoted By: >>12612527

>>12612517
Labor on labro

Anonymous

Anonymous Fri 22 Jan 2021 07:18:54 No.12612527 Report

Quoted By: >>12612534

>>12612519
any words on a cheat sheet for a cadetship?

Anonymous

Anonymous Fri 22 Jan 2021 07:21:25 No.12612534 Report

Quoted By:

>>12612527
Not following you there, sorry

Anonymous

Anonymous Fri 22 Jan 2021 10:47:04 No.12612930 Report

Quoted By:

>>12612227
>It's much simpler then they try and make it out to be
>they
Who?

Anonymous

Anonymous Fri 22 Jan 2021 15:45:41 No.12613951 Report

Quoted By: >>12614822

Every layer can be thought of as a vector matrix multiplication (or a matrix-matrix multiplication if you are doing batches)

So the computational power spent on each layer can be derived from the size of the vector and matrix (count the adds and multiplies)

You mentioned that you are interested in CPU demand spikes. The computational load of a neural network is constant, so the spiking you will observe will have more to do with cache misses and memory I/O than the actual workload.

Anonymous

Anonymous Fri 22 Jan 2021 16:10:42 No.12614024 Report

Quoted By: >>12614834

>>12612227
https://playground.tensorflow.org/#activation=tanh&regularization=L1&batchSize=10&dataset=xor&regDataset=reg-plane&learningRate=0.1&regularizationRate=0.001&noise=0&networkShape=4,4,4,4&seed=0.69966&showTestData=false&discretize=false&percTrainData=50&x=true&y=true&xTimesY=false&xSquared=false&ySquared=false&cosX=false&sinX=false&cosY=false&sinY=false&collectStats=false&problem=classification&initZero=false&hideText=false

Start this up and let it run for a while. I would say that not each layer corresponds to a property of the input, but each node.

Anonymous

View Same Google iqdb SauceNAO 1611323812571.png, 102KiB, 1667x1667

Anonymous Fri 22 Jan 2021 16:27:27 No.12614093 Report

Quoted By: >>12614156 >>12615783

A better question is: why do some people use hundreds of times more parameters than data points?

Anonymous

Anonymous Fri 22 Jan 2021 16:42:58 No.12614156 Report

Quoted By: >>12614208

>>12614093
This is counterintuitive, but very large neural networks actually generalize better when they have massively more parameters than data. This has been demonstrated experimentally time and time, so we know there is something different happening with these types of models.

I don't think there is a definitive answer on this question, but the current hypothesis seems to be that when you have massively more parameters, it creates "flatter" optima on your parameter landscape. When you have large flat optima, you get good generalization, because the network can't be bouncing around chasing after each data, you just don't have enough gradient to follow.

Anonymous

View Same Google iqdb SauceNAO 4shi.jpg, 114KiB, 1200x857

Anonymous Fri 22 Jan 2021 16:58:07 No.12614208 Report

Quoted By: >>12614225 >>12619145

>>12614156
That does leave a fundamental problem though, without the restrictive parameter space there is no telling that what the network does isn't just a glorified fitting database

Anonymous

Anonymous Fri 22 Jan 2021 17:01:57 No.12614225 Report

Quoted By: >>12614246

>>12614208
There is that potential. But the randomness of stochastic gradient descent is what saves you.

Anonymous

View Same Google iqdb SauceNAO 1577628032082.jpg, 96KiB, 700x509

Anonymous Fri 22 Jan 2021 17:06:24 No.12614246 Report

Quoted By: >>12614267

>>12614225
Coming from polynomial fitting I find that immensely hard to swallow, knowing full well NNs are different beasts.

Anonymous

Anonymous Fri 22 Jan 2021 17:11:42 No.12614267 Report

Quoted By: >>12614322

>>12614246
You aren't alone. This intuition about parameter counts is one of the big reasons why neural networks only saw a major revival in the past 10-15 years. We've known about them for a very long time, but everyone thought they surely would overfit because of the paramaters.

But luckily they don't because of the dynamics of the optimization process. Its not because of the neural network itself. If you could somehow figure out a closed form solution to optimize a neural network, that would certainly overfit.

Anonymous

Anonymous Fri 22 Jan 2021 17:16:15 No.12614280 Report

Quoted By:

>>12612218
>average processing power
it's not that hard to count nodes and the number of operations they perform.

it's more tricky to estimate power/operation, which is really what's important.

i know a startup company trying to make 4f optical processors to do the convolutions/correlations at the speed of light. it was important to first establish such a solution could theoretically beat current digital hardware in terms of power/op

Anonymous

Anonymous Fri 22 Jan 2021 17:27:15 No.12614322 Report

Quoted By: >>12614348

>>12614267
>But luckily they don't because of the dynamics of the optimization process. Its not because of the neural network itself.
That seems rather counterintuitive, given you can use very similar optimization techniques for things other than NNs and it would fuck you up.

Anonymous

Anonymous Fri 22 Jan 2021 17:36:42 No.12614348 Report

Quoted By:

>>12614322
Well usually other kinds of models have vastly fewer parameters than a neural network. The parameter count is a key ingredient here, because that is what produces the flat optima surfaces. A neural network with few parameters has very sharp optima.

I would suspect the other models you are talking about retain their sharp optima even with high parameter counts, preventing them from taking advantage of this effect.

Anonymous

Anonymous Fri 22 Jan 2021 18:40:59 No.12614649 Report

Quoted By:

>>12612218
What do you mean by average processing power? The amount of operations performed at each layer? That is rather trivial.

Anonymous

Anonymous Fri 22 Jan 2021 19:10:23 No.12614818 Report

Quoted By:

>>12612218
That looks like it would suck anon. It's too rigid and too balanced, in an unbalanced sort of way.

Anonymous

Anonymous Fri 22 Jan 2021 19:11:12 No.12614822 Report

Quoted By: >>12615639

>>12613951
>so the spiking you will observe will have more to do with cache misses and memory I/O than the actual workload.

Okay so what if instead of neural networks I work with spiking models of the brain?

Anonymous

Anonymous Fri 22 Jan 2021 19:12:15 No.12614834 Report

Quoted By:

>>12614024
Thanks

Anonymous

Anonymous Fri 22 Jan 2021 22:38:09 No.12615639 Report

Quoted By:

>>12614822
That's probably what you are looking for

Anonymous

Anonymous Fri 22 Jan 2021 23:17:14 No.12615783 Report

Quoted By:

>>12614093
Source your claim. I'm interested to see such people.

Anonymous

Anonymous Sat 23 Jan 2021 07:08:00 No.12617010 Report

Quoted By:

>>12612227
>>12612231
>>12612238
>>12612243
>gary is tripfagging now

Anonymous

Anonymous Sat 23 Jan 2021 07:43:38 No.12617109 Report

Quoted By:

I only know about logistic and linear regression right now,any idea as to when I can get acquainted with advanced neural networking stuff?(keep in mind that I'm great at maths)

Anonymous

Anonymous Sat 23 Jan 2021 08:10:47 No.12617189 Report

Quoted By: >>12619134

>>12612218
If you still want a real answer... God this board is a mess.

Assuming this is a simple MLP or simple Feed Forward, no drop out or skips or not an RNN. Nothing than would make it more complex.
Just take the power consumption (like via nvidia-smi if you are training via GPU), the training time and the number of total nodes and make a simple division.

Anonymous

Anonymous Sat 23 Jan 2021 19:38:09 No.12619134 Report

Quoted By:

>>12617189
But that would give an average distribution of the power consumption. Shouldn't there be a difference between layers?

Anonymous

Anonymous Sat 23 Jan 2021 19:40:32 No.12619145 Report

Quoted By:

>>12614208
Does it matter?
Anyway even if you reduce the network with regularition,you still need to start with as large network aspossible.

Capcode	All Only User Posts Only Moderator Posts Only Admin Posts Only Developer Posts
Show Posts	All Only With Images Only Without Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Your latest searches