<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Social Media Conversation Analyst &#187; Influence</title>
	<atom:link href="http://www.nickarnett.net/category/influence/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.nickarnett.net</link>
	<description>Social media analytics for decision-making</description>
	<lastBuildDate>Sun, 17 Jan 2010 04:04:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Analyzing conversations</title>
		<link>http://www.nickarnett.net/2009/12/14/analyzing-conversations/</link>
		<comments>http://www.nickarnett.net/2009/12/14/analyzing-conversations/#comments</comments>
		<pubDate>Mon, 14 Dec 2009 21:57:20 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Engagement]]></category>
		<category><![CDATA[Influence]]></category>
		<category><![CDATA[social network analysis]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=375</guid>
		<description><![CDATA[I've been calling myself a social media analyst for a long time, but the specific area that most interests me, around which I've started a couple of companies and received several patents, is better described as conversation analysis.]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve crawled out of my hole, blog-wise, today to write about conversation analysis.  I&#8217;ve been calling myself a social media analyst for a long time, but the specific area that most interests me, around which I&#8217;ve started a couple of companies and received several patents, is better described as conversation analysis.</p>
<p>There are a bunch of companies that offer social media monitoring services that claim to &#8220;monitor the conversation&#8221; that social media has begun to enable among organizations and their clients, customers and other constituents.  A huge shortcoming of them, as I see it, is that they really are little more than digital clipping services.  All they find is what is indexed by search engines and when they find it, they are looking at web pages, not conversations.  They miss a lot.</p>
<p>They completely miss the conversations that take place on mailing lists and in web forums that aren&#8217;t friendly to search engine crawlers.  When they do find digital conversations, they don&#8217;t analyze them structurally to identify individual posts, replies, timestamps and so forth.  This means that they aren&#8217;t really getting a valid picture of how many people are talking (perhaps the most important metric), how many topics there are, how many people are engaged in each topic and other metrics that are only possible when analysis looks beyond the single web page.  Of course, they don&#8217;t do this because it is hard!</p>
<p>It is probably an impossible task to keep up with all of the digital conversation platforms so that this type of analysis could be as thorough as Google&#8217;s web page indexing.  There are dozens of open-source and commercial web community forum platforms.  Some are easier to analyze than others; some are probably totally impractical to automatically analyze.  Some are deliberately quite private and invisible.  Still, I have no doubt that conversation analysis is worthwhile, if only to solve the fundamental problem of listening to an enormous group of people whose concerns and viewpoints matter to you &#8211; your customers, if no one else.</p>
<p>Most of my work over the last nine years, aside from detours into necessary infrastructure building, has been focused on this challenge &#8211; how do you summarize millions of postings in digital communities?  How do you summarize the topics under discussion, how those topics are changing over time, how many people care about those topics and so forth.  Automatic summarization has never made an intuitive leap, so part of the summarization challenge is to automatically select a tractable number of typical phrases, sentences or postings for a human being to read, a selection that should enable the kind of leap of recognition that a computer cannot.</p>
<p>Anything &#8220;social&#8221; is about relationships.  My goal continues to be to use technology to discover what I can about the relationships among people, brands, companies, events and so forth.   Along those lines, I&#8217;ll be writing some more blog posts in the next few days.  One will be on the difference between customer support and technical support.  Another will take a look at how a conversation between a large organization and its constituents might be structured &#8211; how today&#8217;s (and yesterday&#8217;s) social media fit together.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2009/12/14/analyzing-conversations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Measuring influence &#8211; it is mostly indirect</title>
		<link>http://www.nickarnett.net/2009/06/30/measuring-influence-it-is-mostly-indirect/</link>
		<comments>http://www.nickarnett.net/2009/06/30/measuring-influence-it-is-mostly-indirect/#comments</comments>
		<pubDate>Tue, 30 Jun 2009 16:15:29 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Influence]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=362</guid>
		<description><![CDATA[Possibly by coincidence, this week&#8217;s Social Media Club question of the week is about measuring influence in social networking&#8230; and I just wrote a bit about that topic in the Web Analytics group: On Mon, Jun 29, 2009 at 8:07 AM, Peter Kristof wrote: Can anyone point me to some good resources (articles, blogs, tools, [...]]]></description>
			<content:encoded><![CDATA[<p>Possibly by coincidence, this week&#8217;s Social Media Club question of the week is about measuring influence in social networking&#8230; and I just wrote a bit about that topic in the Web Analytics group:</p>
<p><em>On Mon, Jun 29, 2009 at 8:07 AM, Peter Kristof wrote:</em></p>
<p><em>Can anyone point me to some good resources (articles, blogs, tools, vendors,etc.) for research on measurement of social media / Web 2.0? </em></p>
<p>I invented some of the original buzz measurement stuff (now owned by Nielsen/Buzzmetrics) &#8230;</p>
<p>Analytics progress in this field is slow &#8211; it depends very much on understanding language, which is fundamentally lousy and not progressing very fast.  There is enormous ambiguity in the behavior and text it measures, which shouldn&#8217;t come as a surprise to anyone in web analytics.  Despite all the talk around sentiment and such, I&#8217;m still convinced that the most important metric is how many people are talking about a topic; any system that doesn&#8217;t focus on that is probably off the mark.  No. 2. is how influential those people could be.  I say &#8220;could be&#8221; because generally speaking, we only can identify influencers by their potential to influence (because they participate in a lot of discussions, across venues) than their actual influence.  Finally, I always pay attention to how such systems summarize what&#8217;s going on in social networks.  Two million postings and here are the 10 that are best representative &#8211; how did you pick those?</p>
<p>Beware of cool visualizations&#8230; any sort of self-organizing mapping of social media space usually will not to scale well.  There are some very hard graph problems behind them.  I think most vendors will admit that the eye candy is more useful for selling services than for delivering intelligence.</p>
<p>In terms of where things are going, I think we&#8217;re seeing more innovations in packaging and pricing, away from the big expensive solutions to smaller, lower-cost tools, rather than breakthroughs in the technology of measurement.  I suspect that will remain true for a while, if only because social media itself is evolving so fast that what works today is likely to be obsolete soon.</p>
<blockquote style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0.8ex; border-left-width: 1px; border-left-color: #cccccc; border-left-style: solid; padding-left: 1ex;">
<div style="background-color: #ffffff;">
<div style="width: 655px;">
<div style="width: 470px; padding-top: 0px; padding-right: 25px; padding-bottom: 0px; padding-left: 0px; float: left; margin: 0px;">
<div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 170px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Analytics progress in this field is slow &#8211; it depends very much on understanding language, which is fundamentally lousy and not progressing very fast.  There is enormous ambiguity in the behavior and text it measures, which shouldn&#8217;t come as a surprise to anyone in web analytics.  Despite all the talk around sentiment and such, I&#8217;m still convinced that the most important metric is how many people are talking about a topic; any system that doesn&#8217;t focus on that is probably off the mark.  No. 2. is how influential those people could be.  I say &#8220;could be&#8221; because generally speaking, we only can identify influencers by their potential to influence (because they participate in a lot of discussions, across venues) than their actual influence.  Finally, I always pay attention to how such systems summarize what&#8217;s going on in social networks.  Two million postings and here are the 10 that are best representative &#8211; how did you pick those?</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 170px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Beware of cool visualizations&#8230; any sort of self-organizing mapping of social media space usually will not to scale well.  There are some very hard graph problems behind them.  I think most vendors will admit that the eye candy is more useful for selling services than for delivering intelligence.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 170px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">In terms of where things are going, I think we&#8217;re seeing more innovations in packaging and pricing, away from the big expensive solutions to smaller, lower-cost tools, rather than breakthroughs in the technology of measurement.  I suspect that will remain true for a while, if only because social media itself is evolving so fast that what works today is likely to be obsolete soon.</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 170px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Nick</div>
</div>
</div>
</div>
</div>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2009/06/30/measuring-influence-it-is-mostly-indirect/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Reputation portability and social network APIs</title>
		<link>http://www.nickarnett.net/2009/06/15/reputation-portability-and-social-network-apis/</link>
		<comments>http://www.nickarnett.net/2009/06/15/reputation-portability-and-social-network-apis/#comments</comments>
		<pubDate>Tue, 16 Jun 2009 03:45:40 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Engagement]]></category>
		<category><![CDATA[Influence]]></category>
		<category><![CDATA[Reputation]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=359</guid>
		<description><![CDATA[With the explosion of social media in the last few years, it seems like reputation portability is becoming more attractive and practical.]]></description>
			<content:encoded><![CDATA[<p>The idea of portable reputations in digital media has been around for many years, but not much has happened.  The idea is fairly simple &#8211; create a means of taking credibility (or the lack of it) from one digital community to another, so that you&#8217;re not starting from zero each time you join a new one.  When Usenet and CompuServe were the only games in town, this didn&#8217;t really matter, but with the explosion of social media in the last few years, it seems like reputation portability is becoming more attractive and practical.  There are big obstacles, starting with the definition of terms.</p>
<p>&#8220;Reputation&#8221; is in the family of vague terms like &#8220;engagement,&#8221; &#8220;influence&#8221; and &#8220;community.&#8221;  They sound very impressive. &#8220;People ith high positive reputation scores generate community engagement&#8221; sounds like one of those phrases that expensive consultants toss around at industry conferences, loaded with implications and empty of specific meaning.  Nevertheless, reputation means something, even if we can&#8217;t agree what it is.  It has at least two components &#8211; a value and a valence.  For example, honesty is part of reputation; when its valence is negative, we call it dishonesty.  Of course, personal reputation in digital communities has more to do with accuracy, interesting-ness and opinion leadership than black-and-white issues like honesty.</p>
<p>Even if everybody agrees on what reputation can mean, the way it happens varies from one community to another.  Take &#8220;following&#8221; relationships, for example.  Following somebody on Facebook requires their permission, but not on Twitter.  Similarly, the way Twitter is dominated by &#8220;open&#8221; accounts means that being quoted (retweeted) on Twitter is more likely to happen and therefore possibly less significant than elsewhere.  Some communities have explicit voting and scoring systems that rank people, posts, pictures and so forth.  Few mean the same things, especially when they are based on raw scores from communities of vastly different sizes.  If nothing else, this means that nobody is going to come up with a definitive reputation ranking system, which is probably just fine.  There are many possible dimensions to reputation and various purposes for it, so I would expect that it&#8217;s a good thing if many systems arise.  I&#8217;m wondering if they will start to arise by way of the rising number of social media APIs.</p>
<p>Open APIs not only allow third parties to experiment with reputation scoring systems for each social network &#8211; for example, all the Twitter influence scoring systems &#8211; they allow third parties to try out reputation mashups, which implies some sort of reputation portability.  FriendFeed, being a social network mashup itself, is the kind of service that enables this.  Anybody who has claimed more than one social network identity on FriendFeed potentially could bring their reputation from one to the other, since the links on FriendFeed tell third parties that the two identities (a/k/a accounts or profiles) belong to the same person.  For example, if I have a large following on Twitter, then sign up for Facebook, a third party could inform Facebook users that I&#8217;m worth following, even though I haven&#8217;t done a thing yet on Facebook.  That capability becomes particularly interesting in terms of competition between social networking sites.  When the next Twitter comes along, whatever that might be, people may be able to get deeply engaged in it faster because of Twitter&#8217;s open API&#8230; assuming the new guys also have an open API, of course.  Some of this is already happening between partnering social media sites, where you are invited to bring your friends along.  That&#8217;s only mildly interesting because it doesn&#8217;t happen between direct competitors.  When the APIs generally support reputation-related data and third parties are the ones who make the marriages, so to speak, the world becomes very interesting.  That&#8217;s not just because of what you can do, but also because of what it makes harder &#8211; spamming.</p>
<p>Spammers will have a much harder time in a system of shared reputation data precisely because the social media networks are somewhat different from each other.  Spamming is harder on some, easier on others, so if somebody shows up only on the &#8220;easy&#8221; ones, that&#8217;s a strong clue that they are not legitimate.  If my follower relationships on Twitter are people with whom I have some sort of genuine relationship, I would expect a high percentage of them to be present on other social networks.  Open APIs let third parties measure the differences and make some estimates of the likelihood of legitimacy.  This feels to me something like the kind of robustness that arises from genetic diversity &#8211; yes, you might be able to conquer an individual or two, but you probably can&#8217;t beat the whole ecosystem at its own game.</p>
<p>One thing that becomes harder in this environment is creation of multiple, distinct identities.  Some will argue that people want to keep their personal, work and perhaps entertainment identities independent of one another.  The idea is that if you get your jollies in some socially embarrassing manner, you don&#8217;t want your boss or potential employer to find out.  Or, more legitimately, perhaps you are part of a 12-step program and you want to participate anonymously, or more correctly, pseudonymously, since you&#8217;ll want to create and use a pseudonym for that purpose.  I&#8217;ll respond to the first idea with a motto that is used in recovery &#8211; &#8220;you&#8217;re only as sick as your secrets.&#8221;  The more I consider the idea of having distinct, private identities for work, personal life and whatever else you think you need them for, the more I think it is rubbish.  Unless you&#8217;re spying for the CIA, there really isn&#8217;t much need to compartmentalize your life that way.  And I mean &#8220;compartmentalize&#8221; in the bad way, really.</p>
<p>I rarely truly believe anything I write like this until I&#8217;ve seen some data, so my next step will be to explore some of the APIs to see what I can tell about myself and others through multiple APIs.  I&#8217;m hoping that will be the subject of a blog post in a few days.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2009/06/15/reputation-portability-and-social-network-apis/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Implicit social networks: If Guy Kawasaki is right, so am I</title>
		<link>http://www.nickarnett.net/2009/01/01/implicit-social-networks-if-guy-kawasaki-is-right-so-am-i/</link>
		<comments>http://www.nickarnett.net/2009/01/01/implicit-social-networks-if-guy-kawasaki-is-right-so-am-i/#comments</comments>
		<pubDate>Fri, 02 Jan 2009 03:31:33 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Influence]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=152</guid>
		<description><![CDATA[Just read Guy Kawasaki&#8217;s Looking for Mr. Goodtweet: How to Pick Up Followers on Twitter, in which he offered the following tip: Tip 4: Follow everyone who follows you. When I first started on Twitter, Robert Scoble told me to follow everyone who followed me. “But why, Robert, would I follow everyone like that?” The answer is [...]]]></description>
			<content:encoded><![CDATA[<p>Just read Guy Kawasaki&#8217;s <a href="http://blog.guykawasaki.com/2008/11/looking-for-m-1.html" target="_blank">Looking for Mr. Goodtweet: How to Pick Up Followers on Twitter</a>, in which he offered the following tip:</p>
<p style="padding-left: 30px;"><strong><em>Tip 4: Follow everyone who follows you.</em></strong><em> When I first started on Twitter, Robert Scoble told me to follow everyone who followed me. “But why, Robert, would I follow everyone like that?” The answer is that it’s courteous to do so and because when you do, some people will respond to you and eveyone who follows them will see this—which is more exposure for you.</em></p>
<p style="padding-left: 30px;"><em>Having said this, when you get to more than fifty or so followers, it’s impossible to read what all your followers tweet. At that point, you have to focus on direct private messages (“Ds”) and direct public messages (“@s””).</em></p>
<p>Yipes. </p>
<p>The first analysis I did on Twitter was to <a href="http://www.nickarnett.net/2008/12/22/influence-measurement-on-twitter/" target="_self">count the followers of followers</a>, as an illustration of how influence is a more-than-first-order phenomenon.  The number of followers your followers have almost surely correlates to your potential influence.  People like <a title="Guy Kawasaki" href="http://twitter.com/guykawasaki" target="_blank">Guy</a> are just about impossible to measure that way &#8211; they have so many followers that it is impractical to count their followers&#8217; followers.</p>
<p>When I first looked at <a href="http://twitterholic.com/" target="_blank">Twitterholic</a> and saw how many followers Guy has, I thought &#8220;How the heck does anybody follow that many people?&#8221;   That question is answered &#8211; he doesn&#8217;t.  It is possible that Guy decided to do me a favor (along with everybody else who looks at implicit, in addition to explicit, social networks).  Guy knows that this is where I&#8217;ve focused for years&#8230; but, okay, that&#8217;s probably not why he advises people (who want a lot of followers) to follow everybody who follows them.  Still, it is good news for me.</p>
<p>The pattern of followers is Twitter&#8217;s explicit social network.  As soon as I started analyzing it I was stymied a bit by robotic auto-followers.  They play havoc with metrics that depend on the follower-followee relationship.  My friend <a href="http://twitter.com/dland" target="_blank">Dave</a> suggested that Twitter might need user-agent meta data, which would reveal whether or not a given Twitter user is a real person or not.  This would allow software to omit users like hashtags, twemes, AmazonGoldDeals, mrtweet and so forth, all of which automatically follow you if you follow them.  But it wouldn&#8217;t solve the problem as long as there are people like Guy with robot-like behavior, automatically following everybody who follows them.  I&#8217;m fairly certain that  Guy really is not a robot &#8211; even though, in addition to tip No. 4, he advocates somewhat mindless direct replies:</p>
<p style="padding-left: 30px;"><strong><em>Tip 2: Send @ messages to the smores.</em></strong><em> They probably won’t answer you, but that’s okay. All you want to do is appear like you have a relationship with them to enhance your credibility. The theory is, “If she is tweeting with @scobleizeer, she must be worth following.” Bull shiitake logic, admittedly, but it helps. To bastardize what a famous PR person once told me, “It’s not who you know. It’s who appears to know you.”</em></p>
<p>That tip guarantees that I&#8217;ll be sending Guy an @ message about the tweet that points to this post.  But I actually do know him, unlike the 10 or 15 other people on Twitter who don&#8217;t, who will @ message him anyway, further confounding those who naively analyze the explicit social network by looking at @ message relationships.</p>
<p>There&#8217;s nothing wrong with analyzing the explicit social network, but it is a big mistake to trust the results by themselves.  So I focus on the real meat in Twitter, figuring out as much as possible which items are getting real human energy into them and what they imply about relationships.  When a hundred people post links to the same URL, odds are that the page they&#8217;re tweeting really is meaningful and those people are likely to influence each other in the Twitterverse, especially if they used the same &#8220;shrunken&#8221; URL.  Throw in screen names, hash tags and language patterns and perhaps something truly useful and meaningful will come out.  I hope so.</p>
<p>As long as Guy doesn&#8217;t start advocating retweeting the &#8220;smores&#8221; tweets, I&#8217;m probably okay.  So far, all he&#8217;s done in that direction is to tell people to repeat <em>their own</em> tweets.</p>
<p>If Guy is right that it is a good idea to follow everybody who follows you, then I must be right to insist that any analysis that depends only on the explicit social network is inherently flawed.  Now I just have to decide if I really want to follow, not just Guy&#8217;s advice, but everybody who follows me. A voice in my head is saying, &#8220;If <em>everybody </em>jumped off a cliff&#8230;&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2009/01/01/implicit-social-networks-if-guy-kawasaki-is-right-so-am-i/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Twitter:  Massively parallel self-organizing points of view</title>
		<link>http://www.nickarnett.net/2008/12/31/twitter-massively-parallel-self-organization-of-points-of-view/</link>
		<comments>http://www.nickarnett.net/2008/12/31/twitter-massively-parallel-self-organization-of-points-of-view/#comments</comments>
		<pubDate>Wed, 31 Dec 2008 21:10:38 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Influence]]></category>
		<category><![CDATA[Patterns]]></category>
		<category><![CDATA[social change]]></category>
		<category><![CDATA[headlines]]></category>
		<category><![CDATA[point of view]]></category>
		<category><![CDATA[self-organization]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=110</guid>
		<description><![CDATA[There is no shortage of speculation about Twitter&#8217;s future, particularly its business model.  Just search on monetize Twitter.   The company&#8217;s site describes it as &#8220;a real-time short messaging service that works over multiple networks and devices.&#8221;  Snore.  Also this: The idea arose when &#8220;Jack Dorsey had grown interested in the simple idea of being [...]]]></description>
			<content:encoded><![CDATA[<p>There is no shortage of speculation about Twitter&#8217;s future, particularly its business model.  Just search on <a href="http://www.google.com/search?q=monetize+Twitter" target="_blank">monetize Twitter</a>.   The company&#8217;s site describes it as &#8220;a real-time short messaging service that works over multiple networks and devices.&#8221;  Snore.  Also this: The idea arose when &#8220;Jack Dorsey had grown interested in the simple idea of being able to know what his friends were doing.&#8221;  Less of a snore, but not a business, but a focus.  The companyalso describes itself this way: &#8220;Twitter solves information overload by changing expectations traditionally associated with online communication.&#8221;  On track, but still very broad.</p>
<p>Please realize that none of that was criticism.  I&#8217;ve been a founder, manager and advisor to many startups over the last 20 years, which led me to think quite a bit about the natural tension between creative invention and focused follow-through.  Solving a broad problem often drives popular new technology at first.  The challenge Twitter faces, like any startup that gets this far, is to find a market focus.  This is hard and relatively rare for startups because the kind of people who are good at inventing stuff are usually unsatisfied, often bored, at using their inventions in just one or two markets when they can see dozens.  However, a market focus is almost always essential to success.  Or as <a href="http://www.keynoteventures.com/paul.htm" target="_blank">Paul Dali</a> once said memorably, the five most important things for a startup are focus, focus, focus, distribution and focus.  So, Twitter, where to focus?</p>
<p>The old saw in developing a business plan is to ask yourself what business you are in.  Twitter has some interested uses that are probably not revenue makers.  For example, knowing that my friend <a href="http://twitter.com/dalepd/status/1088475027" target="_blank">Dale is making goat cheese</a> is interesting, I&#8217;m not at all sure there&#8217;s revenue there.  Let me be clear &#8211; I have a deep appreciation for the non-commercial value of the Internet, the <a href="http://en.wikipedia.org/wiki/Gift_economy" target="_blank">gift economy</a> in which we are all collaborators.  But it doesn&#8217;t pay the electric bill.</p>
<p>What business is Twitter in?  As the headline above says, I think Twitter&#8217;s revenue business will arise from the part of it that is a people-driven, massively parallel headline organizer. It helps me learn things that interest people who I think are interesting.  The people I follow are people who I choose to allow to influence what I read; they are people who have interesting points of view.  Not interesting facts.  I can usually uncover facts without much trouble.  Developing tools that help people find valuable points of view is much harder and far more, well, exciting.  It has been the goal of much of my career.</p>
<p>Ten years ago, <a href="http://www.useit.com/alertbox/980906.html" target="_blank">Jakob Nielsen wrote a piece on &#8220;microcontent,&#8221;</a> with arguments for why web  headlines, page titles and subject lines should not be written like newspaper headlines.  His advice is mostly on target for Twitter, but with one big difference &#8211; Twitter headlines come in the context of somebody&#8217;s point of view.  It&#8217;s not just anybody&#8217;s point of view, but somebody who Twitters often enough, interestingly enough, but not too often or too dull, so that you&#8217;re willing to follow them.  </p>
<p>I am reminded of a conversation I had a couple of years ago (about the war in Iraq, where a member of my extended family was killed in action) with <a href="http://honda.house.gov/" target="_blank">Mike Honda</a>, my representative in Congress.  Mike said that he doesn&#8217;t meet with constituents to get information.  He has a staff for that.  They can dig up just about any information he wants.  Constituents give him something more valuable &#8211; stories that energize him to go back to Washington and keep working despite all the obstacles and the temptation to do something that pays better and yields faster results.</p>
<p>If Google is like my research staff, Twitter is like my constituency&#8230; but the metaphor breaks when you consider that the U.S. government is democratic, but the Internet is as different from democracy as democracy is different from a monarchy.  A democracy is self-regulating, the Internet goes one step further &#8211; it is self-organizing, so there is no equivalent of a member of Congress.  We are all each others&#8217; research staff and constituencies.</p>
<p>When I decide who to follow on Twitter, I&#8217;m looking for strong points of view.  The words &#8220;check out&#8221; are a big ho-hum, but they sure are heavily used.  Please ban &#8220;check out&#8221; from your Twitter vocabulary.  Show me, don&#8217;t tell me, as every writing instructor says.  I promise to do my best to write tweets that reveal more than just facts.</p>
<p>Looking through my recent followees, here&#8217;s a <a href="http://twitter.com/timoreilly/status/1088365555">tweet that really works for me</a>, from Tim O&#8217;Reilly (who keeps popping up in my viewfinder as somebody who uses Twitter well).  It works because it starts with the word &#8220;Love,&#8221; so I know there&#8217;s a strong opinion there.  It also mentions another Twitterer I&#8217;ve heard of, which helps raise my interest.</p>
<p>I&#8217;m not sure how hashtags fit into this model.  They represent a number of peoples&#8217; point of view of categorization by topic.  Although I am a fan of topical organization, I automatically think of trees and directed graphs, yet hashtags are one-dimensional.  I don&#8217;t know if that works.  Structure can be implicit in tags, but that&#8217;s hard even when each message has several tags and there isn&#8217;t room for that in Twitter.  Perhaps out-of-band tagging would work better.  The only hashtag that I have really appreciated was <a href="http://twemes.com/svtweetup" target="_blank">#svtweetup</a>, which got me to a pretty good face-to-face networking event.</p>
<p>Why organize points of view?  I see these benefits:</p>
<ul>
<li>Helps keep me informed about the hot topics and buzz so that I&#8217;m staying current.</li>
<li>Is a &#8220;serendipity engine&#8221; that leads me to find things I didn&#8217;t know to look for.</li>
<li>Stimulates creativity by helping me see the same old stuff in new ways.</li>
</ul>
<p>Of course, it doesn&#8217;t have to be Twitter that does this.  But of everything that the Internet has spawned, Twitter seems to have the most potential.  As I work with the APIs and data, I&#8217;m starting to get some ideas about where revenue might be, but I&#8217;ll save that until I&#8217;ve done a bit more work with it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2008/12/31/twitter-massively-parallel-self-organization-of-points-of-view/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Influencing and being influenced; tracking topics on Twitter</title>
		<link>http://www.nickarnett.net/2008/12/30/influencing-and-being-influenced-tracking-topics-on-twitter/</link>
		<comments>http://www.nickarnett.net/2008/12/30/influencing-and-being-influenced-tracking-topics-on-twitter/#comments</comments>
		<pubDate>Wed, 31 Dec 2008 03:47:16 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Influence]]></category>
		<category><![CDATA[Patterns]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[causation]]></category>
		<category><![CDATA[correlation]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=102</guid>
		<description><![CDATA[One of the most common naive errors in statistics is to confuse correlation with causality.  Common sense tries to tell us that when two events co-occur, the first one is causing the second to happen (which often is the case).  Red traffic lights  correlate to cars stopping and sure enough, we know that red lights [...]]]></description>
			<content:encoded><![CDATA[<p>One of the most common naive errors in statistics is to confuse correlation with causality.  Common sense tries to tell us that when two events co-occur, the first one is causing the second to happen (which often is the case).  Red traffic lights  correlate to cars stopping and sure enough, we know that red lights cause (most) people to stop.  But sometimes things correlate because a third, external mechanism is influencing them.  Drownings increase as ice cream sales rise, but ice cream isn&#8217;t causing drownings.  The external factor is summer, of course.</p>
<p>This is on my mind because over the last few days, when cold medicine hasn&#8217;t fogged my brain up so much that I couldn&#8217;t think, or at least couldn&#8217;t think logically, I&#8217;ve been working with the Twitter APIs to see what I could come up with in terms of tracking topics as they move around on Twitter.   I&#8217;m attracted to Twitter because its immediacy and brevity make it relatively easy to analyze.</p>
<p>Eventually, what I hope to do is find useful patterns in the interplay of words, Twitter screen names, URLs cited and hashtags (and any other entities that could be extracted).  I&#8217;m focusing first on URLs, since they are sort of the &#8220;stories behind the headlines&#8221; on Twitter.  My friend <a href="http://twitter.com/dland">Dave Land</a> this morning mentioned that writing a tweet is like writing a headline.  If so, then the cited URLs in those tweets are like the stories behind the headlines.  </p>
<p>I&#8217;ve put Python and SQL to work scraping statuses from Twitter, pulling out word pairs (I&#8217;m planning to analyze them with the other entities via LSA), screen names and URLs.  I&#8217;m resolving all the little URLs to the pages they actually point to, since Twitter users, limited to 140 characters, frequently use services like <a href="http://tinyurl.com/" target="_blank">TinyURL </a>to shorten them, but I want to see when people are citing the same URLs even if the shrunken URLs are different.  In fact, looking at the ratio of shrunken URLs to actual URLs is interesting &#8211; if it is high, that means that a lot of people are finding the cited page independently, rather than retweeting it or getting it from the same source external to Twitter.</p>
<p>As I find cited URLs, I&#8217;m using Twitter&#8217;s search API to get the most recent mentions of them, then storing the identities of the users who also cited them and when they did so.  That gives me a timeline of URL citations.  I&#8217;m not tracking explicit retweets, so I don&#8217;t know if the first people to cite a URL first are more influential or not.</p>
<p>I haven&#8217;t asked Twitter to white-list me yet, so I&#8217;m working within the limitations of their API &#8211; 100 requests per hour.  That forces me to be as smart as possible about how my code explores the data.  I started by choosing somebody who has a decent number of followers, but not too many, so that it wouldn&#8217;t take too long to scrape the person&#8217;s followers&#8217; tweets.  I chose <a href="http://twitter.com/timoreilly" target="_blank">Tim O&#8217;Reilly</a> because I suspect he is fairly influential on Twitter and we&#8217;ve had some conversations that go back to the mid-90s about how to figure out &#8220;what the Internet is thinking today.&#8221;  </p>
<p>O&#8217;Reilly&#8217;s company was one of the first, if not the very first, to measure social media for market research.  Many years ago, they were scraping Usenet to help decide which technologies would make good topics for books.  I recall that one of the first decisions they made from that data was to choose between two open-source database projects, MySQL and mSQL.  They chose MySQL&#8230; and therein lies another reminder of causality v. correlation.  Did MySQL succeed <em>because</em> O&#8217;Reilly chose to focus on it, or did O&#8217;Reilly succeed because it chose the right books to publish?  There is no way of knowing, but I have <a href="http://www.amazon.com/Internet-Pro-JavaScript-David-Land/dp/1565922050/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1230693162&amp;sr=8-1" target="_blank">personal evidence</a> that O&#8217;Reilly doesn&#8217;t always choose the right topics&#8230; or perhaps the right authors.  That&#8217;s a story for another day.</p>
<p>After a lot of wrangling with Python, MySQL and technical issues having to do with Unicode and my inability to write a correlated subquery under the influence of Sudafed, I have something working.  It started by scraping Tim&#8217;s recent tweets and then searched for people who also cited the same URLs.  Then it explores those who it has found cite the greatest number of URLs overall.  It adds 10 people at a time and then re-ranks to see who it should explore next.</p>
<p>I&#8217;ve been running this for a couple of days now in various forms.  So far I have found about 7,000 unique URLs.  Only about 300 of them are duplicates &#8211; different shrunken URLs for the same page.  The URLs have been mentioned about 20,000 times by 7,500 users.  I have found about 40,000 two-word phrases (stop words, URLs, screen names and hashtags are excluded) and 52,000 mentions of those phrases, which means that a number of phrases are being used by multiple people.</p>
<p>What the heck, here are the top phrases and the number of Twitters (Twitterers? Twits?) who used them over the last few days (remember, this is far from comprehensive):</p>
<ul>
<li>check out<span> </span>52</li>
<li>blog post<span> </span>47</li>
<li>New Year<span> </span>36</li>
<li>social media<span> </span>34</li>
<li>New blog<span> </span>33</li>
<li>new years<span> </span>29</li>
<li>New York<span> </span>25</li>
<li>ice storm<span> </span>24</li>
<li>sad true<span> </span>23</li>
<li><a href="http://search.twitter.com/search?q=&quot;emergency+generator&quot;" target="_blank">emergency generator</a><span> </span>23</li>
<li>about attitudes<span> </span>22</li>
<li>gas tax<span> </span>21</li>
<li>mornings paper<span> </span>19</li>
<li>Attention Influence<span> </span>19</li>
<li>one best<span> </span>19</li>
<li>Equal Authority<span> </span>19</li>
<li>prices people<span> </span>18</li>
<li>Jeff Jarvis<span> </span>18</li>
<li>Its morning<span> </span>18</li>
</ul>
<p>I suspect that the words &#8220;check out&#8221; on Twitter are much like the words &#8220;click here&#8221; were in the early days of the web.  &#8221;Emergency generator&#8221; intrigued me, so I linked it above to Twitter search.  Hint: it has to do with the Toyota Prius.  I suspect those same people included a link to a <a href="http://greeninc.blogs.nytimes.com/2008/12/23/prius-its-not-just-a-car-its-an-emergency-generator/?em" target="_blank">New York Times article</a> about it.  Interestingly, a number of the people who cited it were not retweeting (at least not explicitly)&#8230; but many of them were using a shrunken URL cited by &#8211; guess who &#8211; Tim O&#8217;Reilly.  The interesting thing about the popularity of the phrase is that it gives my code a way to discover the other shrunken URLs in a single search, instead of having to scrape everything and resolve every shrunken URL to the actual page.  Tim may or may not have influenced those people to look at the article, but it is clear that he is in tune with a topic that people are interested in, which makes him interesting, whether he is an influencer or just well-influenced, so to speak.</p>
<p>Time to publish this post, I guess, even though I&#8217;m tempted to wait until today&#8217;s cold medicine has worn off to proofread it one more time.</p>
<p>More results here as I come up with them.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2008/12/30/influencing-and-being-influenced-tracking-topics-on-twitter/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Influence measurement on Twitter</title>
		<link>http://www.nickarnett.net/2008/12/22/influence-measurement-on-twitter/</link>
		<comments>http://www.nickarnett.net/2008/12/22/influence-measurement-on-twitter/#comments</comments>
		<pubDate>Tue, 23 Dec 2008 01:00:39 +0000</pubDate>
		<dc:creator>Nick Arnett</dc:creator>
				<category><![CDATA[Influence]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.nickarnett.net/?p=92</guid>
		<description><![CDATA[Social media proponents (a/k/a people who want your money for social media products and services) urge us many ways, but it all boils down to &#8220;People are talking about you on the Internet, so you&#8217;d better pay attention.&#8221; Tapping into the swelling ground, grabbing a long tail or otherwise engaging in social media is supposed [...]]]></description>
			<content:encoded><![CDATA[<p>Social media proponents (a/k/a people who want your money for social media products and services) urge us many ways, but it all boils down to &#8220;People are talking about you on the Internet, so you&#8217;d better pay attention.&#8221;  Tapping into the swelling ground, grabbing a long tail or otherwise engaging in social media is supposed to help you make better products, faster, leading to happier customers and more money.  But how do you decide who to listen to?</p>
<p>Success generates noise.  Millions of customers means millions of comments.  The first and easiest answer to this dilemma, which may be good enough &#8211; for now &#8211; is to figure out which comments are most popular.  See which ones are getting the most page views, the most Diggs, Tweets or other indicators that somebody cares.</p>
<p>The problem with that approach is that by the time something is popular, it is often too late do so anything about it.  This is particularly true in two situations:</p>
<ul>
<li>You are in an industry where &#8220;hits&#8221; generate the profits and are short-lived.  Unless you&#8217;re growing corn, it seems like all products increasingly fit this description.  Maybe corn, too.</li>
<li>Your customers are angry and unhappy.</li>
</ul>
<p>If you have a hit on your hands and you find out too late to create more inventory before the crowd has moved on, you missed an opportunity.  There&#8217;s a corollary to this: the sooner you find out you have a dud, the faster you can stop wasting resources on it.</p>
<p>The really unfortunate thing about angry and unhappy people is that they consistently have more energy to invest in bad-mouthing than happy people have for paying compliments.  That&#8217;s a well-known fact of marketing.  And life.  By the time grumpiness about you and your stuff becomes popular, a lot of damage has been done, obviously.  Knowing who is influential can help you prevent grumpiness in the first place or do better at quelling it before it becomes popular opinion.</p>
<p>In mass media, the important metrics focus on popularity.  Although popularity still matters, digital social networking allows us to measure influence, at least indirectly.  The difference between being popular and being influential  is very simple to understand in principle.</p>
<p>If I have 5,000 followers on Twitter, I&#8217;m obviously fairly popular.  I probably am also influential.  There are ways to figure that out, such as by measuring how much interaction I engage in or how many times my tweets are &#8220;re-tweeted.&#8221;</p>
<p>If I have one follower, does that mean I am not influential?  Not if that one follower is a fellow named Barack Obama, who has more than 150,000 followers, according to <a href="http://twitterholic.com/" target="_blank">Twitterholic</a>.  That is, if that Obama fellow really is following me.  I mean really following me, the way we mean &#8220;follow&#8221; in the real world.</p>
<p>In other words, a few <em>influential</em> followers can be far more significant than thousands with limited influence.</p>
<p>I&#8217;m using Twitter as an example because I&#8217;ve been working with the Twitter APIs to see how hard it would be to come up with measures of influence.  Twitter is growing on me and I think that part of the reason is the language it uses.</p>
<ul>
<li>Blogs and RSS have <em>readers; </em>Twitter has <em>followers. </em></li>
<li>Forums, blogs, mailing lists and such have <em>posts</em> or <em>messages;</em> Twitter has <em>updates. </em></li>
<li>Blogs have <em>feeds; </em>Twitter has <em>APIs.</em></li>
</ul>
<p>From a social network analysis standpoint, Twitter is much easier to deal with.  Mostly.</p>
<p>Popularity is a first-order measurement.  My popularity on Twitter is the number of followers I have.  Influence is a second-order or greater measurement.  The simplest measure of potential influence is to see how many followers my followers have.   The API makes this very easy to measure (for people who aren&#8217;t so popular that the API limits become an obstacle).</p>
<p>My Twitter followers are followed by a bit more than 10,000 people.  Pretty good, I think, since I only have 28 followers (I haven&#8217;t been on Twitter long).</p>
<p>My friend Dave Land has 95 followers and those people are followed by almost 175,000 others.  Wow.  Dave&#8217;s followers are followed by a lot more people than mine are.</p>
<p>Some of my followers are people I believe are influential in the world of web analytics.  Let&#8217;s see how they do (a third-order measurement of my potential influence, if I did it for all of them).  In no particular order:</p>
<ul>
<li>Anil Batra: 232 followers, who are followed by 235,000 others.</li>
<li>Eric T. Peterson:  689 followers, who are followed by 373,000 others.</li>
<li>June Dershowitz: 266 followers, who are followed by 122,000 others.</li>
<li>Marshall Sponder: 839 followers, who are followed by 727,000 others.</li>
</ul>
<p>Avinash Kaushik, Google&#8217;s web analytics evangelist, isn&#8217;t following me (hey, bub!), but anybody whose title is &#8220;evangelist&#8221; is supposed to be influential.  At the risk of exceeding the Twitter API limits, I ran my gizmo to get his stats.  Avinash has about 2,000 followers, who are followed by almost 600,000 others.</p>
<p>If you rank these people by popularity (followers), Avinash is No. 1, hands-down.  But if you rank by potential influence, Marshall Sponder&#8217;s followers are followed by the most people, which is especially surprising given that Avinash appears to be more than twice as popular.</p>
<p>Dave Land comes in at No. 1 when this group is ranked by the ratio of second-order followers to followers.  That means he is doing the best job of attracting followers who attract followers, which is what you need to do if you want your influence to scale beyond immediate popularity.  But I should note that having a lot of followers will inevitably dilute your second-order influence, which should comfort Avinash, who came in last on that measurement (thereby saving me from last place in all three rankings).</p>
<p>I should note a messy bit of this measurement &#8211; sites like Woot, Twemes, hashtags.org and others that automatically follow you when you follow them.  Ugh.  I haven&#8217;t figured out a good way to exclude them, so I&#8217;m just doing it manually&#8230; and I haven&#8217;t thoroughly made sure I caught all of them.  So there&#8217;s hope, Avinash &#8211; maybe Marshall is just signed up for more of those.  In any case, don&#8217;t take these numbers too seriously.  I&#8217;m going to work on some additional data points &#8211; number of replies and such, to strengthen the results.</p>
<p>Or can somebody save me this work and point to a site that has already done this sort of analysis?  I searched but didn&#8217;t see anybody looking at second-order popularity.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nickarnett.net/2008/12/22/influence-measurement-on-twitter/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>
