Samsung Fudging Benchmarks Again On Galaxy Note 3 258
tlhIngan writes "A few months ago, Samsung was caught gaming benchmarks on the Galaxy S4 (International version). They would lock the GPU at a higher-than-normal frequency when certain applications were run, including many popular Android benchmarking programs. These had the expected result of boosting the performance numbers. This time, the Galaxy Note 3 was caught doing the same thing, boosting CPU scores by 20% over the otherwise identical LG G2 (which uses the same SoC at the same clock). Samsung defends these claims by saying the other apps make use of such functionality, but Ars reversed-engineered the relevant code and discovered it applied only to benchmark applications. Even more damning was that the Note 3 was still faster than the G2 when run using 'stealth' (basically renamed) versions of the benchmarking apps which did not get the boost."
If this was Apple... (Score:2, Insightful)
Re:If this was Apple... (Score:5, Insightful)
Re: (Score:3, Insightful)
Why would you even consider synthetic benchmarks when choosing a phone? Planning on trying some bitcoin mining on the go?
Re: (Score:2)
Why do people ever engage in pissing contests? So they can say "my phone's faster than your phone, nah nah," to try and get a "yeah, well, my dad can beat up your dad" response.
Re: (Score:2)
Not that simple. In fact, I've never seen anyone strut up and say "wanna race phones"?
Sure, given two of anything, the natural human thing to do is race them. Turtles, Frogs, even snails.
But when laying out money, people try to get the best buy for the buck, and not knowing how a phone will perform once you get it loaded down with apps, means that they have to turn to something that really stresses the phone over a short period of time hoping to measure the phone's ability to remain future proof a little b
Re: (Score:2)
For most phones this just means filling flash storage with apps. Generally speaking you can't run apps simultaneously and even when you can/do background apps it sucks the battery badly. And on phones the screen is too small to make use of any kind of PC style multi window multitasking even if the platform supports it (AFAIK even Android doesn't really support this).
While I think you're right that it would be nice to compare phone perf
Re: (Score:2)
Because I don't personally have the opportunity to benchmark every phone I'm considering buying - and benchmark data provides a clearer picture than the names and speeds of similar-sounding chips.
We know people game benchmarks - but a least we get in the ballpark.
Re: (Score:2, Flamebait)
Re: (Score:3)
http://androidandme.com/2013/07/devices/last-years-nexus-7-may-get-lag-fix-in-4-3-ota/ [androidandme.com]
http://www.engadget.com/2012/06/27/project-butter-improves-android-4-1s-speed/ [engadget.com]
http://www.engadget.com/2012/06/27/project-butter-improves-android-4-1s-speed/ [engadget.com]
Re: (Score:3)
Ohh BTW the search term "Windows phone 8 sucks" gets me 14 million hits. Obviously going by hits in Google isn't that great of a bench mark is it?
Re: (Score:2)
I'd bet 95% of that is from shit the carriers forced into it...
Re: (Score:2)
True.
I can't tell you how happy I've been buying the Google-direct versions of the Galaxy Nexus (original) and HTC One - both completely free of not only carrier bloat, but vendor homescreens and app-launchers.
Re: (Score:2)
Re: If this was Apple... (Score:2)
That's cause it runs Gears of Halo Theft Auto Mobile
Re:If this was Apple... (Score:4, Informative)
Re: (Score:2, Funny)
one million times each
Re: (Score:2)
Re: (Score:2)
Why?
If you want a closed down system you might as well get an iPhone.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Last I check the Nexus was a rebranded, unlocked, Samsung phone.
Futhermore, most of the common people, when they now say an "Android Phone", what they mean is a Samsung phone. That's the product they now associate with being the "Android Phone/Tablet". You'd be surprised how many don't get that LG, HTC, and others have the same "android" OS.
In fact I think it will be interesting to see what happens in a few years and if a couple players in the Android phone market drop out, after all I don't think the oth
Re:If this was Apple... (Score:5, Informative)
Not so much Samsung in the Nexus dept.
Nexus One was by HTC
Nexus 4 was by LG
Nexus 7 is by Asus
Nexus 10 is by Asus
Anyway, it doesn't matter to me who makes the phone, I look at the features, OS and apps. Samsung has done a good job of marketing the Galaxy series. Some people buy because of good marketing. I still hate it when the manufacturer or the telecoms giant mess with the interface and applications... it's usually not an improvement.
Re: (Score:3)
Nexus 5 is LG as well.
Galaxy Nexus was Samsung.
Re: (Score:2)
The Nexus 4 and up coming Nexus 5 are LG devices.
The Galaxy Nexus was the last Samsung Nexus device and it is already over a year out of date.
Samsung could try that, but then they lose access to the google play market and that ends their game right there.
Re: (Score:2)
Check again, the current Nexus 4 is manufactured by LG Electronics. The first Nexus was made by HTC. The Nexus S and Galaxy Nexus were made by Samsung.
Citation? Nexus
No Implication (Score:5, Insightful)
Are you implying that us android users wouldn't be "up in arms"?
No implication is needed, we can see quite plainly there is very little outcry over this, just as there wasn't before. Android users simply accept this is the way things are, in a way they do not with any Apple problem whatsoever.
Re: (Score:2)
Re: (Score:3)
Apple is held to a higher standard among consumers and industry for their behavior than Android makers are. This is just fact. Good news is that for the most part, they meet that standard.
Re: (Score:3)
Re: (Score:2)
Re: If this was Apple... (Score:2)
Almost as difficult as just hitting you over the head. Then just using your fingers. (Hopefully still attached)
That's probably the strongest case against it right there... You don't have to be AWAKE for your finger to unlock the phone. Thugs and thugs with uniforms don't have to beat a pass code out of you now.
Re: (Score:2)
Then don't use the fingerprint sensor. You can still use a good-old-fashioned 16 character password made up of lowercase, uppercase, numbers and symbols.
But in introducing the feature, apple even said that 50% of users don't create a passcode, even the 4 digits one, because people find it inconvenient. The fingerprint sensor is for THOSE users. To make "good enough" security convenient.
If you need/want something more than "good enough" you can still do that.
Re: (Score:3)
But in introducing the feature, apple even said that 50% of users don't create a passcode, even the 4 digits one, because people find it inconvenient. The fingerprint sensor is for THOSE users. To make "good enough" security convenient.
I like the locking feature, but would like it to be GPS based, or even phone tower / wifi name. When I'm at home I don't want (or need) a lock on my phone, but when I'm away from home I do. Work could be an options for people too, but I work in a school - there's no way I'm leaving that unlocked, even on my desk.
Re: (Score:2)
You'd think they'd stop doing this, since they keep getting caught. However, crime pays. They've made substantially more money off of their phones than the puny fine they had to pay in the Samsung vs Apple trial. They doing it again to Dyson vacuums.
The market is rewarding their bad behavior, and they're going to just keep going.
Samsung's integrity is closely tied to the size of their profits. Once the lack of integrity starts impacting their bottom line, they'll find some. Until then, it's business as usua
Re:If this was Apple... (Score:4, Insightful)
You've obviously never used their Blu-Ray players.
Re: (Score:2)
I've got one, and haven't found it that bad.
Granted, there's no way in hell I'm connecting it to a network so they can decide on a whim to update it, so I haven't seen the full extent of how annoying they can be.
But the device itself I've never had issues with ... what unspecified evil are you alluding to?
Re: (Score:2)
Don't they force you to update to watch the latest discs? Or are you implying that those updates are better-tested
Re: (Score:2)
Count yourself lucky. Both of mine experienced major performance degradation immediately after firmware updates, and they never fixed the bugs.
I'm sorry, but if your Blu-Ray player can't play a DVD without pausing at the layer gap, you really have no business designing hardware. My laptop can spin up the optical drive for a few seconds ever
Re: (Score:2)
I'll take your word for it.
I've never been interested in whole streaming thing, and I think the whole idea of having my DVD player hooked up to the internet means sooner or later someone will disable something/break it for me because it can access the internet and update itself.
I just don't trust the vendors of consumer electronics with direct access to the internet -- because t
Re:If this was Apple... (Score:5, Informative)
Re: (Score:2)
What is the point of a 64-bit phone? Do you really need more than 4gb of RAM allocated to a single process on a phone?
Re: (Score:2)
Re: (Score:2)
Didn't we have PAE to solve that on x86?
Re: (Score:2)
What is the point of a 64-bit phone? Do you really need more than 4gb of RAM allocated to a single process on a phone?
Do a bit of googling about it. 64 bit = 28 general purpose 64 bit registers instead of 13 general purpose 32 bit registers. Significant advantages in Objective-C. Significant advantages in C++, like any std::string up to 22 chars using just three words of storage.
Re: (Score:2)
Re: (Score:2, Funny)
If Apple did this, people would be up in arms!
Both Apple and Android use ARM CPUs.
Now, if it had been happening on the Surface tablets (non-RT, of course) people would be up in Atoms.
Re:If this was Apple... (Score:5, Funny)
If Apple did this, they would be suing Samsung for copying.
Re: (Score:2)
If Apple did this, people would be up in arms!
Both Apple and Samsung use ARM CPUs.
Now, if this had been a Surface (non-RT) tablet, people would be up in Atoms!
Re: (Score:2)
If Apple did this, people would be up in arms!
There's a long history of apple making exaggerated claims of what their devices are going to do for you. People have been up in arms for a long time.
Re: (Score:2)
Re: (Score:2)
If apple *got caught* people would be up in arms.
Re: (Score:2)
By insisting on being the only
Re:If this was Apple... (Score:5, Funny)
Wrong, they DID care about benchmarking once, while Macs were still Power PC based, then they got caught fudging the benchmarks approximately like Samsung did. They suddenly decided that benchmarks didn't matter soon thereafter. They ought to sue Samsung again, because business methods are patentable...
Re:If this was Apple... (Score:5, Informative)
Citation required, because all I can find is: http://www.theregister.co.uk/2003/06/25/apple_denies_fiddling_g5_xeon/ [theregister.co.uk] ... which seems to be refuting the claim...
Simon.
Re: (Score:2)
It's apparently refuting the fudging part, but it's confirming that Apple DID care about benchmarks, you don't deny fudging something you don't care about at all, no?
Re: (Score:2)
Here is your citation. (Score:5, Informative)
Ok, I remember reading the Apple benchmarks myself (in utter disbelief - even for Apple it seemed too much), and this article you linked to does not agree with my memory. So let's go directly to the source. Read that benchmark paper yourself on archive.org : http://web.archive.org/web/20030727103031/http://veritest.com/clients/reports/apple/apple_performance.pdf [archive.org]
I gave it a quick look to refresh my memory and here are some highlights:
- They DISABLE hyper-threading on the SPEC rate test, which is the multi-processor test. Then, they ENABLE hyper-threading on the SPEC base, which is the single-processor test!!! They defend this by saying something like "hyper-threading is slower some times". Well, they sure know that, since they only enable it when it will slow down the Pentium! I would have given them the benefit of doubt if they had disabled (or enabled) it for both tests, but selectively enabling/disabling it means you know what you are doing.
- They use -O3 -fast -ffast when compiling for Apple, which uses fast math non-IEEE optimizations. Of course they had the Intel CPU run accurate/IEEE spec code - there is no equivalent -ffast-math used.
- They go on making some other "crazy" optimizations on the G5 like "modify CPU registers to enable memory Read By-pass", or installing a special malloc library that optimizes for speed by sacrificing memory just for the single-threaded benchmark. This is not how you benchmark for comparison purposes, especially if your optimizations for the competing platform are "turning off update" and "turning off hard drive sleep" (they obviously put that stuff just to pretend they "optimized" there as well).
And I am sure there are other things as well, this was from a quick read. And of course let's not mention that they compare the G5 with an Intel P4 CPU, when, at the time, AMD's Athlons/Opterons (64bit versions were just out as well) were destroying Intel (in performance, not sales - but that is another story).
In general, that paper is so ridiculous that I can't believe Apple had kept promoting it after they had been outed. But then again, given Apple's target audience, the explanation is simple. What was even more ridiculous is that when Apple started selling the Intel-based Mac they had kept for a while the section of their website that showed how much faster the G5 Mac was compared to Intel and then on the Intel Mac pages they had comparisons which showed how the Intel Mac is faster than the G5 Mac. No shame!
Re: (Score:3, Interesting)
Samsung denies also, i guess it's all fine then.
Refutation and denial are not synonymous terms. While the G5 case has decent responses, Samsung's responses have been less than honest.
With the Galaxy S4 GPU, their rationale was that other apps (not benchmarking tools) *can* enjoy the higher speed, yet to avoid heat issues this will be strictly limited in availability and time. Portable computing always has to strike this balance, so that's fine. But why then would benchmarking applications get carte blanche to run at higher speeds if the apps (including g
Re: (Score:2)
I think you are lying. Sources?
Approximately the same way would be deliberately checking for a benchmark and artificially boosting performance to do well on that particular benchmark which is un-producable outside of that benchmark.
That is not the same thing as choosing benchmarks your product is good and and highlighting those benchmarks in your marketing material. That's just highlighting your strengths.
8 years ago? (Score:5, Insightful)
It's hard to find info on it, and it was at least 8 years ago. So you're saying that Samsung's benchmark juicing today is like Apple choosing the Intel compiler with extra options back in the day?
This is what Samsung does, in pseudocode:
if app.name == benchmark speed up
This is what Apple did on its benchmarks:
# for G5
cc test.c -altivec
# for x86
gcc test.c
If you can't tell the difference between the two, you're either stupid, or Samsung.
Re: (Score:2)
Should have done a battery benchmark (Score:5, Interesting)
Re: (Score:3, Insightful)
The benchmarking itself seems to be flawed. Samsung wants to benchmark the devices at their full capacity, to see what they are capable of (the higher setting is reached in normal use of some apps anyway). The testers would probably like to do real world comparison tests (and not rely just on numbers). I don't see Samsung doing anything wrong here, even though the benchmarking apps are specifically chosen.
Re:Should have done a battery benchmark (Score:4, Insightful)
If the hack were to go to full power when plugged into the wall, then I could maybe see a case for this being legitimate, since it would mean you could theoretically achieve the same results by simply playing your game or whatnot while plugged in, but because they're only switching on full power mode for a handful of specific benchmark applications there is just no excuse.
Re: (Score:2)
I'm not sure if they actually inflate anything, because they simply set the device to the highest setting (and it's not higher than where the device sometimes operate even normally).
If you read that Ars Technica article it's evident that the max setting is reached in typical camera use for example. It's not limited to benchmarking apps. For benchmarking apps it's always on. This would be consistent with the idea of testing the device at max capacity. How about if the OS itself is hacked to give 533 MHz cons
Re: (Score:2)
The benchmarking itself seems to be flawed. Samsung wants to benchmark the devices at their full capacity, to see what they are capable of (the higher setting is reached in normal use of some apps anyway). The testers would probably like to do real world comparison tests (and not rely just on numbers). I don't see Samsung doing anything wrong here, even though the benchmarking apps are specifically chosen.
You have a point, that a device should be tested to its highest capacity.
However, if EVERY other application is PREVENTED from achieving that highest capacity the tests are already biased.
And in truth, that is not always a BAD thing. After all, locking the other cores to full power for a test, but preventing that for day to day usage is simply a battery saving feature. Having multiple cores is useful so that other work can be performed at the same time as your phone checks mail or some such.
But on the oth
Re: (Score:2)
4 cores at 2.3Ghz? It might catch on fire. This is using the big.LITTLE chip right? I don't think they're designed with a thermal envelope to stay fully clocked and on for that long.
Comment removed (Score:5, Interesting)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
The Note was a couple of percent faster on renamed benchmark apps, and a whopping 20% faster on normally named benchmark apps. The point is that they were already faster so cheating wasn't even necessary.
Not really. A couple of percent faster is not humanly detectable. 20% is.
Any programmer worth his pay can tune code to achieve 2%.
Re: (Score:2)
I wouldn't call it damning, either, but it doesn't prove that the optimizations aren't app specific. The phone might still bench faster with non-renamed tools than with renamed ones (indeed, that's what the summary seems to claim). I would argue, however, that if the Note 3 beats the G2 "naturally", then there wasn't really any need to cheat this way.
Regardless, I'm with others: it's misleading at best, false advertising at worst.
Re:Does not computer (Score:4, Informative)
There is a file containing a list of all the common benchmarking apps, and everything in the list is a benchmarking app - nothing else. When one of those packages is run, the phone locks the frequency of all cores to fMax and also seems to fiddle with the GPU.
The result is a battery-nightmare, but a boost of 20% to *only* benchmark apps. This is despicable - plain and simple.
See http://arstechnica.com/gadgets/2013/10/galaxy-note-3s-benchmarking-adjustments-inflate-scores-by-up-to-20/ [arstechnica.com]
Simon.
Re: (Score:3)
Why did you feed us a link to the exact same Article as the Summary does?
Did you think that would trick us into reading the full article?
This is Slashdot. We are wise to those ploys.
Re:Does not computer (Score:5, Informative)
Here is what the article actually says:
The ironic thing is that even with the benchmark booster disabled, the Note 3 still comes out faster than the G2 in this test. If the intent behind the boosting was simply to ensure that the Note 3 came out ahead in the benchmark race, it doesn't appear to have been necessary in the first place.
Apparently the "damning" part was completely fabricated by the submitter.
Re: (Score:2)
Here is what the damning article actually says:
Apparently the "damning" part was completely fabricated by the submitter.
This kind of stuff is done by everybody. I was forced to go to COMDEX once and marketing wanted something to draw people at the show and so somebody :) created a " matching tweak" because they looked in the competitors code and saw a cheat. I have seen some really good bench mark cheats and what I saw most at COMDEX was engineers upset because somebody else's cheat was so flagrant and over the
Re: (Score:2)
Looks like the submitter confused irony with "damning". Not exactly great journalistic writing in the summaries provided by submitters lately. Meh, that's what we get for paying them absolutely nothing I guess.
Re: (Score:2)
Re: (Score:3, Insightful)
Not sure how this is "damning". I'd have thought it would prove the principle that the optimizations aren't app specific.
What am I missing?
It's not app-specific, it's app *name* specific. It's analogous to the Quake/Quack benchmark scandal [hardocp.com] years (OMG, more than a decade...time flies) ago. Samsung wrote this boosting protocol to enable itself when running benchmarks and *only* when running benchmarks. There is no legitimate way to invoke it, so no user will ever see the benefit of it when running any app *other than* the benchmark itself.
For the inevitable car analogy: you take a Samsung car for a test drive, and when you floor it you feel 200h
Re: (Score:2)
I'd have thought it would prove the principle that the optimizations aren't app specific.
The optimizations ARE app specific, in that they are specific to all apps except benchmarks.
Granted, you can twist your mind around to see the reasoning behind it, but in doing so you must come to the conclusion that using all 4 cores is so expensive in terms of battery power that running 4 cores is for the most part forbidden. And if forbidden, why have 4 cores?
It locking fore cores to high power mode yielded a 2% advantage in performance this wouldn't be a big deal.
But it yields a 20% performance boost in an app that is already stressing the processor.
Doesn't that mean that the penalty for allowing apps to spin up 4 cores to handle peak load is SO BATTERY INTENSIVE, that Samsung won't allow it?
Isn't that an argument for two cores?
Yawn (Score:4, Insightful)
Samsung != Android (Score:3)
Take it out on Samsung for doing evil, or at the very least getting caught at it.
Re: (Score:3)
No shock ... (Score:5, Insightful)
There's lies, damned lies, statistics, and vendor performance numbers.
I'm a little disappointed that there isn't actually any penalties for fudging your benchmarks -- it's blatantly lying to consumers about your product.
And to me, that seems like it's bordering on fraud.
Re: (Score:2)
I'm a little disappointed that there isn't actually any penalties for fudging your benchmarks
But there are. For example here in Finland you could release the hounds of KKV [www.kkv.fi] (Competition and Consumer Authority).
Huh? (Score:4, Insightful)
Wait, what? How is that more damning? It sounds like that means the benchmark is faster even without cheating, which means that they've changed the kernel scheduler/idle timers/clock stepping in a way that, at least for the sorts of tests performed in the benchmark, improves performance—presumably because their case design and/or battery capacity is better, allowing them to get away with less processor throttling. That sounds like it is almost inarguably a good thing. And that's coming from somebody who has dealt with several of Samsung's products and hated almost all of them. What's with the hate?
Unless, of course, they're being too aggressive about keeping the clock speed high, in which case you might argue that their battery life isn't what it should be... but that's pretty subjective.
Re: (Score:2, Informative)
I was perplexed too. After reading the article, I figured it out:
It's damning in the sense that it's meant to skew comparisons with other Android devices, not to make it look better than their own previous offerings.
Some People just have to cheat (Score:2)
All of this and the benchmarks are still subpar (Score:2)
Browser speed still behind the iPhone: http://images.anandtech.com/graphs/graph7376/58440.png [anandtech.com]
Graphics performance still behind the iPhone: http://images.anandtech.com/graphs/graph7376/58425.png [anandtech.com]
For a device this much larger and heavier, you would expect they could overclock the chipset and get more performance than this.
Re: (Score:2)
Re:All of this and the benchmarks are still subpar (Score:4, Insightful)
Battery life still behind the iPhone: http://images.anandtech.com/graphs/graph7376/58409.png [anandtech.com]
You are comparing a phone with a 4 inch screen, with a "phone" that has a 5.7 inch screen. You can't compare battery life when the screen is what uses up most of the power. If you want a huge screen you have to compromise on battery life (and many other things - seriously, the note is ridiculously big to use as an every-day phone).
Browser speed still behind the iPhone: http://images.anandtech.com/graphs/graph7376/58440.png [anandtech.com]
I don't suppose Samsung can do much about that. It is quite possible that with the same CPU, an Android would still be slower than an iOS device. Sure, Google has made a fast Java VM, but it still is a Java VM, right? For example, I had a Nokia N9 running Meego/Maemo. It could run circles around Android phones with the same CPU.
Graphics performance still behind the iPhone: http://images.anandtech.com/graphs/graph7376/58425.png [anandtech.com]
Ehm, this result (to which you cleverly linked directly - hiding the context) is ran in native resolution. The Note has almost 3x the iphone's resolution, so it would be pretty strange to come on top in fps. But in all the other GPU benchmarks which are ran at 1080p it does come on top of the iphone.
But in any case I personally prefer a phone that has a good battery life, it can fit in my hand and lets me do whatever I want with it. So that rules out the note and the iphone ;)
Benchmarks are bad metrics (Score:2)
Benchmarks are problematic by their very nature in that they are typically predictable and a manufacturer can simply say they have tuned their product for a given application. Let me give a good example of this from a product that isn't made for consumer use just to make my point:
http://www.anandtech.com/show/7165/ocz-announces-zdxl-pcie-sql-accelerator-ssd-solution [anandtech.com]
This is a PCIE SSD product designed to boot server performance explicitly for Microsoft MS SQL Server. This product has been explicitly designed
Re: (Score:2)
The difference is whether they admit up-front that it's been optimized for a specific application, or act like it handles *everything* better when that is not in fact the case.
Does anyone really look at Manufacturer's numbers? (Score:2)
Really? That's like using Slashdot polls for something important. Non-story.
The only meaningful benchmarks... (Score:2)
Anything else is just a made up number.
Re:Humans will be Humans (Score:5, Insightful)
If someone is surprised Humans are willing to cheat, rip off, etc to get ahead... well you haven't really been paying attention.
Fixed that for you.
I wouldn't want people to unfairly categorize you as a racist moron.
Re:Humans will be Humans (Score:4, Insightful)
That's fair comment on the original post, but let's narrow it down a bit...
"If someone is surprised that a manufacturer with a track-record of fudging benchmarks is willing to cheat, rip off, etc to get ahead... well you haven't really been paying attention"
Not all humans are morally and ethically bankrupt. Samsung (as a corporate entity) is though.
Simon
Re: (Score:2)
If someone is surprised Humans are willing to cheat, rip off, etc to get ahead... well you haven't really been paying attention.
Fixed that for you.
I wouldn't want people to unfairly categorize you as a racist moron.
Samsung represents a very big chunk of Korean electronics industry and they have the responsibility to choose what kind of image they want to give about the practices of that particular industry.
On individual level, I believe there are many honest Korean people too.
Re: (Score:2)
Re:Failed to note (Score:4, Informative)