So how long are tasks taking on the new Raspberry Pi2 as apposed to the old Pi's?
Obviously there can be four at a time, but how much faster is the neon ARM7 versus the ARM6 for Einstein@home.
I have i think 20 Arm cores crunching across various projects, Here they get:
My Pi2 running 2 up is coming in at around 68k secs:
My 2012 Nexus 7 on Android 4.4.4 (clocked at 1200MHz) comes in 50k to 65k secs:
(only running three up, running three BRP tasks make it very laggy, so try and keep at least one task from another project running)
My 2012 HTC One S running Android 4.1.1 (clocked at 1500MHz) is fastest at around 27k secs, But produces eithier validate error, or Error while computing:
(I still think there's an api problem hanging around to do with power saving and critical sections)
The EaH client running on the board is a custom build with specific optimization flags for this ARM.
Interesting. How much faster is that version than the stock E@H version on that board? We know we are losing some performance by doing more stuff in-place in our ARM variants (Linux and Android), so that the RAM used is max ca 125 MB, which matters a lot on those tiny devices.
Interesting. How much faster is that version than the stock E@H version on that board? We know we are losing some performance by doing more stuff in-place in our ARM variants (Linux and Android), so that the RAM used is max ca 125 MB, which matters a lot on those tiny devices.
HB
With the default boinc (ubuntu) version 7.0.27 this board is not supported by EaH.
To be clean, I do not change the source code. I only change the optimization flags for the GCC in the build script and Makefiles.
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
24229 boinc 39 19 130304 128188 5024 R 99.5 12.8 886:47.71 einsteinbi+
24366 boinc 39 19 130304 128188 5024 R 99.5 12.8 875:44.73 einsteinbi+
24066 boinc 39 19 130304 128268 5104 R 98.8 12.9 899:39.99 einsteinbi+
24134 boinc 39 19 130304 128184 5020 R 98.5 12.8 894:21.25 einsteinbi+
[/pre]
As you can see mine has a bit of free memory that it could utilise.
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
24229 boinc 39 19 130304 128188 5024 R 99.5 12.8 886:47.71 einsteinbi+
24366 boinc 39 19 130304 128188 5024 R 99.5 12.8 875:44.73 einsteinbi+
24066 boinc 39 19 130304 128268 5104 R 98.8 12.9 899:39.99 einsteinbi+
24134 boinc 39 19 130304 128184 5020 R 98.5 12.8 894:21.25 einsteinbi+
[/pre]
As you can see mine has a bit of free memory that it could utilise.
Probably it would gain something. But with memory requirements close to 200 MB or even a bit higher per task IIRC for the out-of-place version, it would be really tight to run 4 of them in parallel even on a device like your's with 1GB RAM so it's no option for our official ARM apps.
We are currently re-unifying our code branches for the regular x86 CPU&GPU apps with the (up until now) somewhat experimental ARM code line, and after that we'll put a new BRP source-code package online so everyone can do experiments on their own.
This give CPU temperature in degrees Celsius x 1000. My Raspi2 is running at around 68deg C with a tiny heat sink installed. [Thermal self-protection of the SoC is triggered at around 85 deg C or some such IIRC.]
ca 50k sec per task, or close to 14 h, at 1GHz, running 4 in parallel. Shortens the time to earn 1Mio credits to 6 years 5 months :-).
Note that for this to work, your RasPi2 needs to execute the NEON enabled app version, and you won't get that one if you are using the outdated BOINC 7.0.x client that is part of Raspbian "wheezy". You can however build your own from recent sources (I haven't tried the Raspbian "jessie" repo yet but that should work as well)
ca 50k sec per task, or close to 14 h, at 1GHz, running 4 in parallel. Shortens the time to earn 1Mio credits to 6 years 5 months :-).
Note that for this to work, your RasPi2 needs to execute the NEON enabled app version, and you won't get that one if you are using the outdated BOINC 7.0.x client that is part of Raspbian "wheezy". You can however build your own from recent sources (I haven't tried the Raspbian "jessie" repo yet but that should work as well)
HB
I've put it onto one of my Pi2's. I have two with copper heat sinks. One in a case is doing 62-63 degrees (stock speed). One still waiting on case to arrive (it got back ordered) so its sitting on top of the box it came in. It was doing 57-58 degrees before I put the fftw plan onto it, 40 degrees at idle (room temp is 30 degrees). After a few minutes running its up to 55 degrees.
As for the BOINC versions, Jessie has 7.4.23 which reports cpu features so it picks up the Neon app. Both of mine are running Jessie.
I presume this fftw plan would also work on the Parallella although probably not as optimal for the cortex A9 cpu (the Pi2 has cortex A7).
RE: So how long are tasks
)
I have i think 20 Arm cores crunching across various projects, Here they get:
My Pi2 running 2 up is coming in at around 68k secs:
http://einsteinathome.org/host/11741356/tasks
One of my Parallellas running 2 up comes in with similar times:
http://einsteinathome.org/host/11381548/tasks
My Samsung S5 mini on Android 4.4.2 (clocked at 1400MHz) comes in with similar times:
http://einsteinathome.org/host/11667599/tasks
My 2012 Nexus 7 on Android 4.4.4 (clocked at 1200MHz) comes in 50k to 65k secs:
(only running three up, running three BRP tasks make it very laggy, so try and keep at least one task from another project running)
http://einsteinathome.org/host/11543282/tasks
My 2012 HTC One S running Android 4.1.1 (clocked at 1500MHz) is fastest at around 27k secs, But produces eithier validate error, or Error while computing:
(I still think there's an api problem hanging around to do with power saving and critical sections)
http://einsteinathome.org/host/9721755/tasks
My single core HTC Desire S at Albert on Android 2.2.2 (clocked at 998MHz) comes in at 82k Secs:
http://albertathome.org/host/12645/tasks
I don't have times for my Model B Pi at present.
Claggy
RE: I think that EaH client
)
Yup, only that the ARM version of the BRP app is using in-place transforms to get a small memory footprint. rof --> rif
Nice performance from that board!
HB
RE: Yup, only that the ARM
)
Yes, you are right it is in-place.
The EaH client running on the board is a custom build with specific optimization flags for this ARM.
RE: The EaH client running
)
Interesting. How much faster is that version than the stock E@H version on that board? We know we are losing some performance by doing more stuff in-place in our ARM variants (Linux and Android), so that the RAM used is max ca 125 MB, which matters a lot on those tiny devices.
HB
RE: Interesting. How much
)
With the default boinc (ubuntu) version 7.0.27 this board is not supported by EaH.
To be clean, I do not change the source code. I only change the optimization flags for the GCC in the build script and Makefiles.
RE: Yup, only that the ARM
)
Would if be worthwhile using a "normal" version app to see if it gains anything for the extra memory usage?
[pre]
KiB Mem: 998096 total, 728616 used, 269480 free, 83984 buffers
KiB Swap: 102396 total, 0 used, 102396 free. 102236 cached Mem
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
24229 boinc 39 19 130304 128188 5024 R 99.5 12.8 886:47.71 einsteinbi+
24366 boinc 39 19 130304 128188 5024 R 99.5 12.8 875:44.73 einsteinbi+
24066 boinc 39 19 130304 128268 5104 R 98.8 12.9 899:39.99 einsteinbi+
24134 boinc 39 19 130304 128184 5020 R 98.5 12.8 894:21.25 einsteinbi+
[/pre]
As you can see mine has a bit of free memory that it could utilise.
BOINC blog
RE: RE: Yup, only that
)
Probably it would gain something. But with memory requirements close to 200 MB or even a bit higher per task IIRC for the out-of-place version, it would be really tight to run 4 of them in parallel even on a device like your's with 1GB RAM so it's no option for our official ARM apps.
We are currently re-unifying our code branches for the regular x86 CPU&GPU apps with the (up until now) somewhat experimental ARM code line, and after that we'll put a new BRP source-code package online so everyone can do experiments on their own.
Cheers
HB
Time to share some wisdom:
)
Time to share some wisdom: next attempt, tailored to the Raspberry Pi 2 running the neon version of the BRP app.
Put into /etc/fftw/wisdomf
and restart boinc to force a restart of the science apps.
This might stress the SoC quite a bit, you can monitor CPU temperature like this :
cat /sys/devices/virtual/thermal/thermal_zone0/temp
This give CPU temperature in degrees Celsius x 1000. My Raspi2 is running at around 68deg C with a tiny heat sink installed. [Thermal self-protection of the SoC is triggered at around 85 deg C or some such IIRC.]
HB
YES, this wisdom file does
)
YES, this wisdom file does help quite a bit:
http://einsteinathome.org/task/484845661
ca 50k sec per task, or close to 14 h, at 1GHz, running 4 in parallel. Shortens the time to earn 1Mio credits to 6 years 5 months :-).
Note that for this to work, your RasPi2 needs to execute the NEON enabled app version, and you won't get that one if you are using the outdated BOINC 7.0.x client that is part of Raspbian "wheezy". You can however build your own from recent sources (I haven't tried the Raspbian "jessie" repo yet but that should work as well)
HB
RE: YES, this wisdom file
)
I've put it onto one of my Pi2's. I have two with copper heat sinks. One in a case is doing 62-63 degrees (stock speed). One still waiting on case to arrive (it got back ordered) so its sitting on top of the box it came in. It was doing 57-58 degrees before I put the fftw plan onto it, 40 degrees at idle (room temp is 30 degrees). After a few minutes running its up to 55 degrees.
As for the BOINC versions, Jessie has 7.4.23 which reports cpu features so it picks up the Neon app. Both of mine are running Jessie.
I presume this fftw plan would also work on the Parallella although probably not as optimal for the cortex A9 cpu (the Pi2 has cortex A7).
BOINC blog