Social Media Outputs: Aloft Like a Cooling Hot Air Balloon?

August 4, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_tNote: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

I found the assertions in “”They Need Us. We Don’t Need Them: The Fall of Twitter Is Making the Trolls and Grifters Desperate” in line with my experience. The write up asserts:

The grifters that make up the troll-industrial complex are not okay.

If you want the political spin on this statement, please, navigate to the source document. I want to focus on the observation “They need us. We don’t need them.” I view social media companies and those who have risen to fame on clicks and hyperbole are going to try to inflate every more colorful balloons. Their hope is to be seen as rulers of the sky. F-35s, addled doctors flying Cessnas, and hobbyist drones are potential problems for the hot air crowd.

8 3 social media balloons

The colorful balloons compete for attention. What happens when the hot air source cools? MidJourney would not depict a balloon crash into a pre-school playground. Bummer.

Let’s go back in time. In the 1980s, there were two financially successful and highly regarded business information commercial databases. One of the two companies had the idea that it could generate more revenue by pulling out of the online distribution agreements upon which the commercial database ecosystem depended. I don’t expect anyone reading this essay to remember DataStar, Dialcom, ESA Quest, or the original LexisNexis service. The key factoid is that if one wanted to deliver an electronic business information product, the timesharing outfits were the enablers. Think of them as a proto-Google.

How did that work out?

After quite a bit of talking and thinking, the business information company resigned itself to the servitude under which it served. It was decades later that Web accessible content and paywalls began to make it possible for a handful of companies to generate without the old timesharing intermediaries.

Few know the names of these commercial databases which once were the cat’s pajamas.

The moral of the story, from my point of view, is that people or services which view themselves as important enough to operate outside of an ecosystem have to understand the ecosystem. Alas, too many individuals perceive themselves as being powerful magnets. Sure, these individuals or companies have a tiny bit of magnetic power. However, without the ecosystem and today’s enablers, the reality is that their “power” is not easily or economically amplified.

From my point of view, social media provided free, no friction amplification. For that reason, I want social media regulated and managed by responsible individuals. Editorial or content guidelines must be promulgated and enforced. The Wild West has be converted into a managed townhouse community. Keep in mind that I am a dinobaby, and I am not sure arguments about the “value” of social media will be processed by my aged mental equipment.

Just look around you in an objective manner. Nice environment, right? Now we have balloons of craziness drifting above in an effort to capture attention. What happens when the hot air source cools? Back down to earth and possibly without a gentle landing.

Stephen E Arnold, August 4, 2023

Need Research Assistance, Skip the Special Librarian. Go to Elicit

July 17, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_t[1]Note: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

Academic databases are the bedrock of research. Unfortunately most of them are hidden behind paywalls. If researchers get past the paywalls, they encounter other problems with accurate results and access to texts. Databases have improved over the years but AI algorithms make things better. Elicit is a new database marketed as a digital assistant with less intelligence than Alexa, Siri, and Google but can comprehend simple questions.

7 16 library hub

“This is indeed the research library. The shelves are filled with books. You know what a book is, don’t you? Also, will find that this research library is not used too much any more. Professors just make up data. Students pay others to do their work. If you wish, I will show you how to use the card catalog. Our online public access terminal and library automation system does not work. The university’s IT department is busy moonlighting for a professor who is a consultant to a social media company,” says the senior research librarian.

What exactly is Elicit?

“Elicit is a research assistant using language models like GPT-3 to automate parts of researchers’ workflows. Currently, the main workflow in Elicit is Literature Review. If you ask a question, Elicit will show relevant papers and summaries of key information about those papers in an easy-to-use table.”

Researchers use Elicit to guide their research and discover papers to cite. Researcher feedback stated they use Elicit to answer their questions, find paper leads, and get better exam scores.

Elicit proves its intuitiveness with its AI-powered research tools. Search results contain papers that do not match the keywords but semantically match the query meaning. Keyword matching also allows researchers to narrow or expand specific queries with filters. The summarization tool creates a custom summary based on the research query and simplifies complex abstracts. The citation graph semantically searches citations and returns more relevant papers. Results can be organized and more information added without creating new queries.

Elicit does have limitations such as the inability to evaluate information quality. Also Elicit is still a new tool so mistakes will be made along the development process. Elicit does warn users about mistakes and advises to use tried and true, old-fashioned research methods of evaluation.

Whitney Grace, July 16 , 2023

In the Midst of Info Chaos, a Path Identified and Explained

July 10, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_t[1]Note: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

The Thread – Twitter spat in the midst of BlueSky and Mastodon mark a modest change in having one place to go for current information. How does one maintain awareness with high school taunts awing, Mastodon explaining how easy it is to use, and BlueSky doing its deep gaze thing?

One answer and a quite good one at that appears in “RSS for Post-Twitter News and Web Monitoring.” The author knows quite a bit about finding information, and she also has the wisdom to address me as “dinobaby.” I know a GenZ when I get an email that begins, “Hey, there.” Trust me. That salutation does not work as the author expects.

In the cited article, you will get useful information about newsfeeds, screenshots, and practical advice. Here’s an example of what’s in the excellent how to:

If you want to check a site for RSS feeds and you think it might be a WordPress site, just add /feed/ to the end of the domain name. You might get a 404 error, but you also might get a page full of information!

There are more tips. Just navigate to Research Buzz, and learn.

This dinobaby awards one swish of its tail to Tara Calishain. Swish.

Stephen E Arnold, July 10, 2023

Wanna Be an MBA? You Got It and for Only $45US

June 30, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_t[1]Note: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

I managed to eek out of college as an ABD or All But Dissertation. (How useful would it be for me to write 200 pages about Chaucer’s alleged knowledge of the thrilling Apocrypha?) So no MBA or Doctor of Business Administration or the more lofty PhD in Finance. I am a 78 year old humanoid proudly representing the dull-normal in my cohort.

6 24 college grad

“So you got your MBA from which school?” asks the human people manager. The interviewee says, “I got it from an online course.” “Do you have student loans?” queries the interviewer. “Nah, the degree equivalent cost me about $50,” explains the graduate. “Where did you get the tassel and robe?” probes the keen eyed interviewer at blue chip consulting firm. The motivated MBA offers, “At the Goodwill store.” The image is the MFA grade output from MidJourney.

But you — yes, you, gentle reader — can do better. You can become a Master of Business Administration. You will be wined (or it that whined) and dined by blue chip consulting firms. You can teach as a prestigious adjunct professor before you work at Wal-Mart or tutor high school kids in math. You will be an MBA, laboring at one of those ethics factories more commonly known as venture capital firms. Imagine.

How can this be yours? Just pony up $45US and study MBA topics on your own. “This MBA Training Course Bundle Is 87% Off Right Now.” The article breathlessly explains:

The courses are for beginners and require no previous experience with the business world. Pick and choose which courses you want to complete, or take the whole package to maximize your knowledge. Work through materials at your own pace (since you have lifetime access) right on your mobile or desktop device.

There is an unfortunate disclaimer; to wit:

This course bundle will not replace a formal MBA degree—but it can get you some prior knowledge before pursuing one or give you certificates to include on your resume. Or, if you’re an aspiring entrepreneur, you may just be searching for some tips from experts.

A quick visit to a Web search system for “cheap online PhD” can convert that MBA learning into even more exciting job prospects.

The Beyond Search goose says, “Act now and become an eagle. Unlike me a silly goose.”

Stephen E Arnold, June 30, 2023

AI Tools That Make Cheating…Err… Research Easier

June 22, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_t[1]_thumbNote: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

Homework has been the bane of students since the inception of school. Students have dreamt about ways to make homework easier, either with the intervention of divine beings or a homework-finishing robot. While the gods of various religions have never concerned themselves with homework, ingenious minds have tackled the robot idea with artificial intelligence. While AI cannot succinctly write a decent essay, Euro News shares the next generation of tools that will make homework easier: “The Best AI Tools To Power Your Academic Research.”

6 17 girl cheasting

This young lady is not cheating. She is using her mobile phone to look up facts using Bard and ChatGPT. With the information in hand, she will interact with each system to obtain the required 500 words for her US history essay about ethics and Spiro Agnew. She is not cheating. She is researching. The image emerged from the highly original MidJourney system, which never cheats it users. But what does it do with those inputs?

OpenAI’s ChatGPT tool, a generative AI that creates and writes text, has thrown academic for a loop. ChatGPT is the first AI that can “write” a cohesive essay and can answer simple questions better than a search engine. Academics are worried it ruin the integrity of education, but others believe ChatGPT and other AI tools will democratize information.

Postdoctoral researcher Mushtaq Bilal, based at the University of Southern Denmark, believes ChatGPT is a wonderful invention. He explains that ChatGPT cannot produce a full journal article that contains truthful information, peer-reviewed, and well-cited. With incremental prompting, Bilal says the AI tool can generate ideas that resemble a conversation with an ivy league professor. Bilal proposes to use ChatGPT as a brainstorming tool. For example, he used it to create an article outline and he fact checked the information.

Bilal recommends scholars use other AI tools, such as Consensus. Consensus is an AI-driven search engine that answers questions and provides citations. Elicit.org is similar, except it is an AI research assistant and its database s based purely on research. Scite.ai provides fact based citations based on search queries. Research Rabbit fast tracks research similar to how Spotify recommends music. It learns researchers interests and recommends new information based on them. ChatPDF allows users to upload papers, then they can ask the AI questions or summarize the information.

Homework has not seen a revolution this huge since the implementation of the Internet.

“ ‘The development of AI will be as fundamental “as the creation of the microprocessor, the personal computer, the Internet, and the mobile phone,’ wrote Bill Gates in the latest post on his personal blog, titled ‘The Age of AI Has Begun’. ‘Computers haven’t had the effect on education that many of us in the industry have hoped,’ he wrote.  ‘But I think in the next five to 10 years, AI-driven software will finally deliver on the promise of revolutionizing the way people teach and learn’.

In other words, homework be much easier to complete and these new tools will make learning better. Students will also cleverly discover new ways to manipulate the tools to cheat just as they have been for centuries.

Whitney Grace, June 22, 2023

Two Creatures from the Future Confront a Difficult Puzzle

June 15, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_t[1]Note: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

I was interested in a suggestion a colleague made to me at lunch. “Check out the new printed World Book encyclopedia.”

I replied, “A new one. Printed? Doesn’t information change quickly today.”

My lunch colleague said, “That’s what I have heard.”

I offered, “Who wants a printed, hard-to-change content objects? Where’s the fun in sneaky or sockpuppet edits? Do you really want to go back to non-fluid information?”

My hungry debate opponent said, “What? Do you mean misinformation is good?”

I said, “It’s a digital world. Get with the program.”

Navigate to World Book.com and check out the 10 page sample about dinosaurs. When I scanned the entry, there was no information about dinobabies. I was disappointed because the dinosaur segment is bittersweet for these reasons:

  1. The printed encyclopedia is a dinosaur of sorts, an expensive one to produce and print at that
  2. As a dinobaby, I was expecting an IBM logo or maybe an illustration of a just-RIF’ed IBM worker talking with her attorney about age discrimination
  3. Those who want to fill a bookshelf can buy books at a second hand bookstore or connect with a zippy home designer to make the shelf tasteful. I think there is wallpaper of books on a shelf as an alternative.

69 aliens with book

Two aliens are trying to figure out what a single volume of a World Book encyclopedia contains? I assume the creatures will be holding the volume 6 “I”, the one with information about the Internet. The image comes from the creative bits at MidJourney.

Let me dip into my past. Ah, you are not interested? Tough. Here we go down memory lane:

In 1953 or 1954, my father had an opportunity to work in Brazil. Off our family went. One of the must-haves was a set of World Book encyclopedias. The covers were brown; the pictures were most black and white; and the information was, according to my parents, accurate.

The schools in Campinas, Brazil, at that time used one language. Portuguese. No teacher spoke English. Therefore, after failing every class except mathematics, my parents decided to get me a tutor. The course work was provided by something called Calvert in Baltimore, Maryland. My teacher would explain the lesson, watch me read, ask me a couple of questions, and bail out after an hour or two. That lasted about as long as my stint in the Campinas school near our house. My tutor found himself on the business end of a snake. The snake lived; the tutor died.

My father — a practical accountant — concluded that I should read the World Book encyclopedia. Every volume. I think there were about 20 plus a couple of annual supplements. My mother monitored my progress and made me write summaries of the “interesting” articles. I recall that interesting or not, I did one summary a day and kept my parents happy.

I hate World Books. I was in the fourth or fifth grade. Campinas had great weather. There were many things to do. Watch the tarantulas congregate in our garage. Monitor the vultures circling my mother when she sunbathed on our deck. Kick a soccer ball when the students got out of school. (I always played. I sucked, but I had a leather, size five ball. Prior to our moving to the neighborhood, the kids my age played soccer with a rock wrapped in rags. The ball was my passport to an abuse free stint in rural Brazil.)

But a big chunk of my time was gobbled by the yawing white maw of a World Book.

When we returned to the US, I entered the seventh grade. No one at the public school in Illinois asked about my classes in Brazil. I just showed up in Miss Soape’s classroom and did the assignments. I do know one thing for sure: I was the only student in my class who did not have to read the assigned work. Reading the World Book granted me a free ride through grade school, high school, and the first couple of years at college.

Do I recommend that grade school kids read the World Book cover to cover?

No, I don’t. I had no choice. I had no teacher. I had no radio because the electricity was on several hours a day. There was no TV because there were no broadcasts in Campinas. There were no English language anything. Thus, the World Book, which I hate, was the only game in town.

Will I buy the print edition of the 2023 World Book? Not a chance.

Will other people? My hunch is that sales will be a slog outside of library acquisitions and a few interior decorators trying to add color to a client’s book shelf.

I may be a dinobaby, but I have figured out how to look up information online.

The book thing: I think many young people will be as baffled about an encyclopedia as the two aliens in the illustration.

By the way, the full set is about $1,200. A cheap smartphone can be had for about $250. What will kids use to look up information? If you said, the printed encyclopedia, you are a rare bird. If you move to a remote spot on earth, you will definitely want to lug a set with you. Starlink can be expensive.

Stephen E Arnold, June 14, 2023

Moral Decline? Nah, Just Your Perception at Work

June 12, 2023

Here’s a graph from the academic paper “The Illusion of Moral Decline.”

image

Is it even necessary to read the complete paper after studying the illustration? Of course not. Nevertheless, let’s look at a couple of statements in the write up to get ready for that in-class, blank bluebook semester examination, shall we?

Statement 1 from the write up:

… objective indicators of immorality have decreased significantly over the last few centuries.

Well, there you go. That’s clear. Imagine what life was like before modern day morality kicked in.

Statement 2 from the write up:

… we suggest that one of them has to do with the fact that when two well-established psychological phenomena work in tandem, they can produce an illusion of moral decline.

Okay. Illusion. This morning I drove past people sleeping under an overpass. A police vehicle with lights and siren blaring raced past me as I drove to the gym (a gym which is no longer open 24×7 due to safety concerns). I listened to a report about people struggling amidst the flood water in Ukraine. In short, a typical morning in rural Kentucky. Oh, I forgot to mention the gunfire, I could hear as I walked my dog at a local park. I hope it was squirrel hunters but in this area who knows?

6 8 paper published

MidJourney created this illustration of the paper’s authors celebrating the publication of their study about the illusion of immorality. The behavior is a manifestation of morality itself, and it is a testament to the importance of crystal clear graphs.

Statement 3 from the write up:

Participants in the foregoing studies believed that morality has declined, and they believed this in every decade and in every nation we studied….About all these things, they were almost certainly mistaken.

My take on the study includes these perceptions (yours hopefully will be more informed than mine):

  1. The influence of social media gets slight attention
  2. Large-scale immoral actions get little attention. I am tempted to list examples, but I am afraid of legal eagles and aggrieved academics with time on their hands.
  3. The impact of intentionally weaponized information on behavior in the US and other nation states which provide an infrastructure suitable to permit wide use of digitally-enabled content.

In order to avoid problems, I will list some common and proper nouns or phrases and invite you think about these in terms of the glory word “morality”. Have fun with your mental gymnastics:

  • Catholic priests and children
  • Covid information and pharmaceutical companies
  • Epstein, Andrew, and MIT
  • Special operation and elementary school children
  • Sudan and minerals
  • US politicians’ campaign promises.

Wasn’t that fun? I did not have to mention social media, self harm, people between the ages of 10 and 16, and statements like “Senator, thank you for that question…”

I would not do well with a written test watched by attentive journal authors. By the way, isn’t perception reality?

Stephen E Arnold, June 12, 2023

Thinking about AI: Is It That Hard?

May 17, 2023

I read “Why I’m Having Trouble Covering AI: If You Believe That the Most Serious Risks from AI Are Real, Should You Write about Anything Else?” The version I saw was a screenshot, presumably to cause me to go to Platformer in order to interact with it. I use smart software to convert screenshots into text, so the risk reduced by the screenshot was in the mind of the creator.

Here’s a statement I underlined:

The reason I’m having trouble covering AI lately is because there is such a high variance in the way that the people who have considered the question most deeply think about risk.

My recollection is that Daniel Kahneman allegedly cooked up the idea of “prospect theory.” As I understand the idea, humans are not very good when thinking about risk. In fact, some people take risks because they think that a problem can be averted. Other avoid risk to because omission is okay; for example, reporting a financial problem. Why not just leave it out and cook up a footnote? Omissions are often okay with some government authorities.

I view the AI landscape from a different angle.

First, smart software has been chugging along for many years. May I suggest you fire up a copy of Microsoft Word, use it with its default settings, and watch how words are identified, phrases underlined, and letters automatically capitalized? How about using Amazon to buy lotion. Click on the buy now button and navigate to the order page. It’s magic. Amazon has used software to perform tasks which once required a room with clerks. There are other examples. My point is that the current baloney roll is swelling from its own gaseous emissions.

Second, the magic of ChatGPT outputting summaries was available 30 years ago from Island Software. Stick in the text of an article, and the desktop system spit out an abstract. Was it good? If one were a high school student, it was. If you were building a commercial database product fraught with jargon, technical terms, and abstruse content, it was not so good. Flash forward to now. Bing, You.com, and presumably the new and improved Bard are better. Is this surprising? Nope. Thirty years of effort have gone into this task of making a summary. Am I to believe that the world will end because smart software is causing a singularity? I am not reluctant to think quantum supremacy type thoughts. I just don’t get too overwrought.

Third, using smart software and methods which have been around for centuries — yep, centuries — is a result of easy-to-use tools being available at low cost or free. I find You.com helpful; I don’t pay for it. I tried Kagi and Teva; not so useful and I won’t pay for it. Swisscows.com work reasonably well for me. Cash conserving and time saving are important. Smart software can deliver this easily and economically. When the math does not work, then I am okay with manual methods. Will the smart software take over the world and destroy me as an Econ Talk guest suggested? Sure. Maybe? Soon. What’s soon mean?

Fourth, the interest in AI, in my opinion, is a result of several factors: [a] Interesting demonstrations and applications at a time when innovation becomes buying or trying to buy a game company, [b] avoiding legal interactions due to behavioral or monopoly allegations, [c] a deteriorating economy due to the Covid and free money events, [d] frustration among users with software and systems focused on annoying, not delighting, their users; [e] the inability of certain large companies to make management decisions which do not illustrate that high school science club thinking is not appropriate for today’s business world; [f] data are available; [g] computing power is comparatively cheap; [h] software libraries, code snippets, off-the-shelf models, and related lubricants are findable and either free to use or cheap; [i] obvious inefficiencies exist so a new tool is worth a try; and [j] the lure of a bright shiny thing which could make a few people lots of money adds a bit of zest to the stew.

Therefore, I am not confused, nor am I overly concerned with those who predict home runs or end-of-world outcomes.

What about big AI brains getting fired or quitting?

Three observations:

First, outfits like Facebook and Google type companies are pretty weird and crazy places. Individuals who want to take a measured approach or who are not interested in having 20-somethings play with their mobile when contributing to a discussion should get out or get thrown out. Scared or addled or arrogant big company managers want the folks to speak the same language, to be on the same page even it the messages are written in invisible ink, encrypted, and circulated to the high school science club officers.

Second, like most technologies chock full of jargon, excitement, and the odor of crisp greenbacks, expectations are high. Reality is often able to deliver friction the cheerleaders, believers, and venture capitalists don’t want to acknowledge. That friction exists and will make its presence felt. How quickly? Maybe Bud Light quickly? Maybe Google ad practice awareness speed? Who knows? Friction just is and like gravity difficult to ignore.

Third, the confusion about AI depends upon the lenses through which one observes what’s going on. What are these lenses? My team has identified five smart software lenses. Depending on what lens is in your pair of glasses and how strong the curvatures are, you will be affected by the societal lens, the technical lens, the individual lens (that’s the certain blindness each of us has), the political lens, and the financial lens. With lots to look at, the choice of lens is important. The inability to discern what is important depends on the context existing when the AI glasses are  perched on one’s nose. It is okay to be confused; unknowing adds the splash of Slap Ya Mama to my digital burrito.

Net net: Meta-reflections are a glimpse into the inner mind of a pundit, podcast star, and high-energy writer. The reality of AI is a replay of a video I saw when the Internet made online visible to many people, not just a few individuals. What’s happened to that revolution? Ads and criminal behavior. What about the mobile revolution? How has that worked out? From my point of view it creates an audience for technology which could, might, may, will, or whatever other forward forward word one wants to use. AI is going to glue together the lowest common denominator of greed with the deconstructive power of digital information. No Terminator is needed. I am used to being confused, and I am perfectly okay with the surrealistic world in which I live.

PS. We lectured two weeks ago to a distinguished group and mentioned smart software four times in two and one half hours. Why? It’s software. It has utility. It is nothing new. My prospect theory pegs artificial intelligence in the same category as online (think NASA Recon), browsing (think remote content to a local device), and portable phones (talking and doing other stuff without wires). Also, my Zepp watch stress reading is in the low 30s. No enlarged or cancerous prospect theory for me at this time.

Stephen E Arnold, May 17, 2023

The APA Zips Along Like … Like a Turtle, a Really Snappy Turtle Too

May 10, 2023

I read “American Psychology Group Issues Recommendations for Kids’ Social Media Use”. The article reports that social media is possibly, just maybe, perhaps, sort of an issue for some, a few, a handful, a teenie tiny percentage of young people. I am not sure when “social media” began. Maybe it was something called Six Degrees or Live Journal. I definitely recall the wonky weirdness of flashing MySpace pages. I do know about Orkut which if one cares to check was a big hit among a certain segment of Brazilians. The exact year is irrelevant; social media has been kicking around for about a quarter century.

Now, I learn:

The report doesn’t denounce social media, instead asserting that online social networks are “not inherently beneficial or harmful to young people,” but should be used thoughtfully. The health advisory also does not address specific social platforms, instead tackling a broad set of concerns around kids’ online lives with commonsense advice and insights compiled from broader research.

What are the data about teen suicides? What about teen depression? What about falling test scores? What about trend oddities among impressionable young people? Those data are available and easy to spot. In June 2023, another Federal agency will provide information about yet another clever way to exploit young people on social media.

Now the APA is taking a stand? Well, not really a stand, more of a general statement about what I think is one of the most destructive online application spaces available to young and old today.

How about this statement?

The APA recommends a reasonable, age-appropriate degree of “adult monitoring” through parental controls at the device and app level and urges parents to model their own healthy relationships with social media.

How many young people grow up with one parent and minimal adult monitoring? Yeah, how many? Do parents or a parent know what to monitor? Does a parent know about social media apps? Does a parent know the names of social media apps?

Impressive, APA. Now I remember why I thought Psych 101 was a total, absolute, waste of my time when I was a 17 year old fresh person at a third rate college for losers like me. My classmates — also losers — struggle to suppress laughter during the professor’s lectures. Now I am giggling at this APA position.

Sorry. Your paper and recommendations are late. You get an F.

Stephen E Arnold, May 10, 2023

AI Shocker? Automatic Indexing Does Not Work

May 8, 2023

Vea4_thumb_thumb_thumb_thumb_thumb_tNote: This essay is the work of a real and still-alive dinobaby. No smart software involved, just a dumb humanoid.

I am tempted to dig into my more than 50 years of work in online and pull out a chestnut or two. l will not. Just navigate to “ChatGPT Is Powered by These Contractors Making $15 an Hour” and check out the allegedly accurate statements about the knowledge work a couple of people do.

The write up states:

… contractors have spent countless hours in the past few years teaching OpenAI’s systems to give better responses in ChatGPT.

The write up includes an interesting quote; to wit:

“We are grunt workers, but there would be no AI language systems without it,” said Savreux [an indexer tagging content for OpenAI].

I want to point out a few items germane to human indexers based on my experience with content about nuclear information, business information, health information, pharmaceutical information, and “information” information which thumbtypers call metadata:

  1. Human indexers, even when trained in the use of a carefully constructed controlled vocabulary, make errors, become fatigued and fall back on some favorite terms, and misunderstand the content and assign terms which will mislead when used in a query
  2. Source content — regardless of type — varies widely. New subjects or different spins on what seem to be known concepts mean that important nuances may be lost due to what is included in the available dataset
  3. New content often uses words and phrases which are difficult to understand. I try to note a few of the more colorful “new” words and bound phrases like softkill, resenteeism, charity porn, toilet track, and purity spirals, among others. In order to index a document in a way that allows one to locate it, knowing the term is helpful if there is a full text instance. If not, one needs a handle on the concept which is an index terms a system or a searcher knows to use. Relaxing the meaning (a trick of some clever outfits with snappy names) is not helpful
  4. Creating a training set, keeping it updated, and assembling the content artifacts is slow, expensive, and difficult. (That’s why some folks have been seeking short cuts for decades. So far, humans still become necessary.)
  5. Reindexing, refreshing, or updating the digital construct used to “make sense” of content objects is slow, expensive, and difficult. (Ask an Autonomy user from 1998 about retraining in order to deal with “drift.” Let me know what you find out. Hint: The same issues arise from popular mathematical procedures no matter how many buzzwords are used to explain away what happens when words, concepts, and information change.

Are there other interesting factoids about dealing with multi-type content. Sure there are. Wouldn’t it be helpful if those creating the content applied structure tags, abstracts, lists of entities and their definitions within the field or subject area of the content, and pointers to sources cited in the content object.

Let me know when blog creators, PR professionals, and TikTok artists embrace this extra work.

Pop quiz: When was the last time you used a controlled vocabulary classification code to disambiguate airplane terminal, computer terminal, and terminal disease? How does smart software do this, pray tell? If the write up and my experience are on the same wave length (not surfing wave but frequency wave), a subject matter expert, trained index professional, or software smarter than today’s smart software are needed.

Stephen E Arnold, May 8, 2023

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta