What Makes the Web Slow? Really Slow?
January 28, 2021
I read “We Rendered a Million Web Pages to Find Out What Makes the Web Slow.” My first reaction was the East Coast Internet outage which ruined some Type A workers’ day. I can hear the howls, “Mommy, I can’t attend class, our Internet is broken again.”
Here’s a passage from the “Rendered a Million Web Pages” which I found interesting:
Internet commentators are fond of saying that correlation does not equal causation, and indeed we can’t get at causality directly with these models. Great caution should be exercised when interpreting the coefficients, particularly because a lot confounding factors may be involved. However, there’s certainly enough there to make you go “hmm”.
Yep, I went “hmm.” But for these reasons:
- Ad load times slow down my Web experiences. Don’t you love those white page hung ads on the YouTube or the wonky baloney on the Daily Mail?
- How about crappy Internet service providers?
- Are you thrilled with cache misses?
- Pages stuffed full of trackers, bugs, codes, and spammy SEO stuff.
Hmm, indeed.
Stephen E Arnold, January 28, 2021
Oracle: Dons White Hat and Tries to Lasso the StreamScam Stallion
January 6, 2021
Oracle is only of passing interest to me. I paid attention to Oracle’s search efforts which I think once involved Applied Linguistics, TripleHop, Endeca, and a natural language processing company crafting smart software to replace trained human customer support types. Oh, I did check out Oracle SES. As I recall, the first “s” meant secure. The idea was that Oracle had discovered that most vendors of enterprise search provided systems which were not focused on security. Just getting these overhyped and jargon infused puppies to behave consumed quite a bit of developer, MBA, and ultimately CPA energy.
I read “Oracle Exposes largest CTV Ad Fraud Operation Ever” reminded me of the Oracle Secure Enterprise Search hook: Enterprise search vendors did not provide Oracle-grade security. That was, as it turned out, mostly true. The problem was that organizations purchasing enterprise search were not buying security. The organizations were trying to find a way to deal with the increasing flows of digital information. Prior to the implosion of Delphes, Entopia, and Fast Search & Transfer — tools were not widely available. Today, of course, an enterprising and enthusiastic developer can download or just use a variant of Lucene/Solr. Amazon and IBM support these open source solutions, and developers of proprietary systems like Coveo, Mindbreeze, and others have to put in extra hours on their sales Peloton’s to generate sustainable revenues. Oracle’s run at enterprise search as security disasters went nowhere.
Now Oracle has donned its white hat and is now lassoing or trying to lasso the online advertising sector. In my research, we have encountered numerous reports of online advertising fraud. Making the charges stick to the Google, its subsidiaries, outfits like Facebook, and the third party intermediaries has been difficult. Gobbledygook explanations and intentional complexity are designed to keep those advertising dollars flowing.
The StreamScam is probably the first in a series of oracular pronouncements about alleged fraud: Click, view time, reach, etc., etc. The write up states:
StreamScam perpetrators capitalized on vulnerabilities in the technology used to improve the video viewing experience in CTV. Known as Server-Side Ad Insertion (SSAI), the technology combines content and ads into a single video stream that can play seamlessly with no delays on end-user devices, such as Roku, AppleTV, and FireTV.
The idea is that advertisers don’t know if online advertising generates sales. Marketers emphasize misty notions of brand and reach. But what’s an advertiser to do? The answer is that newspaper, television, and radio have been replaced among certain desirable market segments by streaming, Twitter, and podcasts. Therefore, a device like a smart TV seems to straddle some of these technology. Put money into connected televisions, YouTube, Facebook, and maybe — just maybe — TikTok.
I agree with “CTV Ad Fraud Schemes Like the One Oracle Exposed Will Become More Common But That Won’t Affect Marketers’ Spend.” I would, however, add a caveat to this write up’s assertion about not impacting what marketers spend and where; to wit: What choice does an advertiser have? Old fashioned direct mail? When the post office worked, that was an option. How about telemarketing? Some desirable demographics don’t answer their mobile’s chirps. What about billboard advertising? Covid and work from home may have reduced the impact of these view enhancing objects of art.
With more and more GenX and millennials positioning themselves as experts in social media, online ads are necessary complements to influencer campaigns. The Google ad purveyors are reassuring and armed with data illustrating that online ads really do work. Don’t like the Google rep, check out the Facebook pitch with micro and nano targeting that really works better than Googzilla’s approach. Amazon, Apple, yes. Options.
Net net: Ad fraud is endemic. No one survives who documents it. But Oracle has a white hat and maybe will own TikTok some day. There’s gold at the end of the digital rainbow even though one end is in Beijing.
My take: “Whoa, StreamScam! The Lone Oracle is gonna break you down. Right, Don Quixote.” (Tonto is in Covid lockdown.)
Stephen E Arnold, January 6, 2021
Another Xoogler Explains Algorithmic Manipulation
December 25, 2020
I don’t want to make a big deal about a former Google engineer talking about algorithmic manipulation. I know what happened to the Google AI expert who pointed out that training data can and does bias how numerical recipes make decisions.
I spotted this 2019 statement in an ancient write up called “‘YouTube Recommendations Are Toxic,’ Says Dev Who Worked on the Algorithm.” The speaker is a Xoogler (the semi official name for someone who worked at the Google) who allegedly worked on the YouTube recommendation algorithm. Now keep in mind that the Google is a pretty chaotic outfit, and it is possible for people to “work on” something and the outside world will have zero idea whether the contribution was a quality test or something substantive like fiddling thresholds to meet a harried superior’s goal. (Bonus time causes some interesting activities I have heard.)
Here’s the quote, and I have put in bold face the important statement which I found important and possibly accurate:
“It isn’t inherently awful that YouTube uses AI to recommend video for you, because if the AI is well tuned it can help you get what you want. This would be amazing,” Chaslot told TNW. “But the problem is that the AI isn’t built to help you get what you want — it’s built to get you addicted to YouTube. Recommendations were designed to waste your time.”
The write up does not dig into wasting time. I want to share my perception of the time wasting angle. In the good old days, Web sites wanted to be sticky. That’s why mere search engines became portals and eventually massive one stop shops with everything in one “experience.”
For YouTube, the more time a person invests in watching videos on YouTube, the more ads Google can slam into the video. If you think there are a lot of ads for a video now, just wait until the “game plan” is rolled out to the Googlers in the Spring of 2021.
Therefore, the purpose of the YouTube algorithm is to create opportunities to display ads. Are these relevant or irrelevant. I must say that I am quite adept at clicking past blandishments for Grammarly, Liberty Mutual, and many other hapless companies dumping cash into the coffers of the world’s most wonderfulest Web search system. Grammarly, isn’t “wonderfulest” a real word when used with “Google”? Maybe I should as DeepMind? Oh, right. DeepMind is busy doing healthy things and losing hundreds of millions of dollars every year.
Burn that ad inventory. Absolutely.
Stephen E Arnold, December 25, 2020
A Facebook Promise: Good As Gold
December 3, 2020
Oops. A Facebook algorithm’s mistake is causing the company to offer apologies and refunds, we learn from CNBC’s article, “Facebook to Reimburse Some Advertisers After Miscalculating Effectiveness Data.” Citing a report from Ad Exchanger, writer Lucy Handley informs us:
“The company’s ‘conversion lift’ tool suffered a glitch that reportedly affected thousands of ads between August 2019 and August 2020. Facebook fixed the error in September and is now offering a credit to clients ‘meaningfully affected’ by the bug. Conversion lift helps brands understand how ads lead to sales, using a ‘gold-standard methodology’ that links ads on Facebook’s platforms, including Instagram, to business performance, according to an explanation of the tool on Facebook’s website. The free tool shows ads to separate test and control groups and then compares sales conversions for each. Then, based on the results of the study, an advertiser can decide how much to spend on the social network.”
Though the error was discovered and fixed in September, the company is just now getting around to informing clients. According to Facebook, a “small number” of advertisers were affected, though what that behemoth considers a small number is unclear. Handley reminds us:
“This isn’t the first time Facebook has admitted mistakes in reporting. In September 2016, it said it overestimated the average time people spent viewing video ads over a two-year period, and in 2017 a report found that Facebook claimed to reach more people in some U.S. states and cities than official population data said existed in those areas.”
Yep, Facebook is starting 2021 with its true colors flying. I suppose it is nice to see some things remain consistent.
Cynthia Murrell, December 3, 2020
The European Competition Commission Goes for the Throat
November 3, 2020
I wanted to note the October 30, 3030, Reuters’ story “Online Giants Will Have to Open Ad Archives to EU Antitrust Regulators.” At last regulators are taking steps to gain access to the systems and methods used by Google and other online ad giants. The news story helps cement Margrethe Vestager as someone who uses her position to do more than posture. Also, the news story points out that there is a research agency called Algorithm Watch.
The problem is that the companies asked to provide information have legal options. The delays are likely to slow the regulators’ quest for data. If sufficient time goes by, the landscape can be reworked. Internet time is different from regulators’ time.
There is a counter point. Navigate to “Monopoly Power Is Less Dangerous Today Than in Past.” The argument set forth in this Telegraph Herald write up is unlikely to have a significant impact on the good ship SS Margrethe.
Stephen E Arnold, November 3, 2020
Google Filtering: How Smart Is Software?
October 27, 2020
I included a screenshot illustrating YouTube search results which make it clear how to obtain without charge copy-protected commercial software. You can read that story and see the screen shot at this link. I want to document a Reuters’ report called “Italy’s Communications Watchdog Fines Google for Betting Ad.” The news item documents that Google was fined for running gambling ads. The DarkCyber research team has been monitoring some of the questionable video streaming sites. Advertisements are appearing on these sites in greater numbers. What vendors are providing these paid messages? At this time, there’s no open source information about the intermediaries involved.
Questions:
- Why doesn’t filtering by key word work for Google advertisements? Gambling seems to be a no brainer.
- Why are Google YouTube search results providing recently updated links to video content which appears to violate a number of rules and regulations? The word “crack” is like gambling a seemingly obvious yellow caution light.
- What are the names of the ad agency intermediaries providing advertisements to what appear to be illegal video streaming sites?
Interesting? The DarkCyber research team finds the subject engaging. Smart software seems to have some blind spots.
Stephen E Arnold, October 27, 2020
Streaming Data: Does the Information Presage the Future for Google Advertising?
October 12, 2020
DarkCyber is not populated with work hour gamers. (Tibby is the exception.) One of the research team spotted “Streamlabs & Stream Hatchet Q3 Live Streaming Industry Report.” The summary contained an interesting factoid, which we assume is spot on. Here it is:
Twitch now represents 91.1% of the market share for hours streamed, up 14.5% from last quarter. This massive increase can be attributed to Mixer’s shutdown, which captured 14.2% of all content live-streamed last quarter. That is compared to Facebook Gaming, which now represents 3.4% of the market share, and increased by 1% since last quarter, and YouTube Gaming, which now represents 5.5% of the market share, and decreased by 1.2% since last quarter.
The data prompted a question from one of the DarkCyber researchers:
What is the likelihood that Amazon’s online streaming advertising follows a similar path?
At lunch on October 8, a number of ideas floated above the miasma of take out Chinese:
- No way, José. Google will find a way to get into the online streaming money flow.
- Yikes. Google may be too distracted by removing features from its lackluster mobile devices, fending off regulators, and dealing with its “human resource” issues to respond in an effective manner.
- The Bezos bulldozer grinds forward. The effectiveness of Amazon in multiple market sectors may push Google and others aside. Product searches and product advertising are likely to be more important as the retail sector in the US erodes.
Which is it? Worth watching.
Stephen E Arnold, October 12, 2020
After 20 Plus Years, Whoa! Surveillance by Big Tech
August 10, 2020
DarkCyber has noted a flurry of write ups expressing surprise, rage, indignation, and blusterification at the idea of a commercial company collecting data. Hello, services are free for a basic reason: Making money. Part of making money is to have something that other companies and organizations will purchase. A good example is personal information about users of free services. The way big companies work is that there is a constant pressure to find new ways to generate money. Thus, there are data sucking apps; there are advertisements and more advertisements; there are subscriptions which lock in revenue while providing an Amazon-style we know a lot about those who shop on Amazon; and there are many ornaments on these methods.
I got a kick out of “Silicon Valley’s Vast Data Collection Should Worry You More Than TikTok.” We know the story well. Commercial firms in the US gather data and license it, often to marketing firms and to other organizations. After two decades of blissful ignorance a devoted band of “real” journalists are now probing the core business model of many technology centric companies.
Give me a break. We are talking decades of business processes designed to generate useful reports from flows of actions by individuals. In some countries, the government performs this task. In others, commercial enterprises do the work and license the normalized data to governments.
This passage from the write up tickled my funny bone:
And none of this is unreasonable. We should be worried about private companies and governments potentially collecting data on millions of unsuspecting people and censoring content they don’t like. But those based in China represent just a sliver of that threat.
Yep, the old “woulda, coulda, shoulda” ploy. May I remind you, gentle reader, that we are decades into the automation of data about the actions of individuals. These are the happy and often ignorant humanoids who download apps, run queries, click on videos, and send personal message while leaving a data trail a foot deep and a mile wide.
And now the need for something?
And data collection is not a technical and economic issue. Nope. Data collection is politics; for example:
TikTok’s critics might point to the increasingly scary behavior of China’s government as to why Chinese control of information is particularly alarming. They’re right about the behavior, but they curiously ignore the fact that the United States itself is currently governed by a far-right demagogue with his own concentration camps and authoritarian repression, and that the party behind him, which aligns entirely with his politics, reliably cycles into power at least once every eight years.
What’s the fix? Well, “oppose it all.”
Where were the regulators, the users, and the competitors 20 years ago? Probably in grade school, blissfully unaware that those handheld gadgets would become more important than other activities. Okay, adult thumbtypers, your outrage is interesting. Step back, and perhaps you can see why the howls of outrage, the references to evil forms of government, and the horrors of toting around a device that usually provides real time documentation of one’s actions as a bad thing.
But after 20 years, is it surprising that personal data actions are captured, analyzed, and used to provide more data “stuff” to consume? As I said, its been 20 years with no lessening of the processes. Complain to your parents. Maybe they dropped the ball? Commercial enterprises and governments are like beavers. And beavers do what beavers do.
Stephen E Arnold, August 10, 2020
Hippy Dippy New Age Insight: Ads Are Numerous
July 30, 2020
I want to keep this brief. The number of ads is increasing. Avoiding them is difficult. Why? Zero controls, zero social responsibility, and zero regulatory oversight.
“I Was Horrified at How Many Ads the New Brave Browser and VPN for iOS Blocked” is amusing because it reveals the lack of awareness of the zip zip mobile world in which some hippy dippy New Age “real” news publications thrive.
The article states:
I visited a few of my favorite sites and then was promptly horrified when it told me that in about 3 minutes of browsing, it had blocked 107 ads and trackers and given me 2 HTTPS upgrades. Supposedly, this saved me five seconds of my life.
Not for long. Online advertisers share some DNA with bad actors creating novel malware. One difference: Law enforcement pursues the malware wizards. Online advertising outfits get invited to testify to a Congressional committee.
Stephen E Arnold, July 30, 2020
Scam Ads: Easy to Do Apparently
July 8, 2020
A somewhat shocking assertion appears in “Easy for Fraudsters to Post Scam Ads on Facebook and Google.” The article reports that researchers posted fake ads on Google:
They found that Google did review the adverts submitted, but failed to verify whether the business was real and did not ask for ID. In under an hour, the adverts were approved by the search engine firm for both dummy businesses, gaining almost 100,000 impressions over the space of a month. The fake advert for Natural Hydration was displayed above the official NHS Scotland pages when users searched for “hydration advice”.
A Facebook ad was given similar treatment:
using a personal Facebook account, Which? created a business page on the social network for Natural Hydration and produced a range of posts with pseudo health advice to promote it. A paid promotion of the page gained some 500 likes in the space of a week. Facebook responded to the investigation saying the page set up by Which? does not violate its community standards and is not currently selling products.
Are these data accurate? Regulatory authorities seem to lack tools to influence the large online advertising monopolies.
Stephen E Arnold, July 8, 2020