<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>data-mining &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/data-mining/</link>
	<description>Feed of posts on WordPress.com tagged "data-mining"</description>
	<pubDate>Thu, 23 May 2013 12:37:49 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA["NO BOUNDARIES, NO LIMITS, TAXING IDEOLOGY" an essay by Cheryl Pass]]></title>
<link>http://conservativesonfire.wordpress.com/2013/05/04/no-boundaries-no-limits-taxing-ideology-an-essay-by-cheryl-pass/</link>
<pubDate>Sat, 04 May 2013 11:45:26 +0000</pubDate>
<dc:creator>Conservatives on Fire</dc:creator>
<guid>http://conservativesonfire.wordpress.com/2013/05/04/no-boundaries-no-limits-taxing-ideology-an-essay-by-cheryl-pass/</guid>
<description><![CDATA[Today&#8217;s Guest Saturday post is by Cheryl Pass of the My Tea Party Chronicle blog. In this essa]]></description>
<content:encoded><![CDATA[<p>Today&#8217;s Guest Saturday post is by Cheryl Pass of the <a href="http://myteapartychronicle.blogspot.com/">My Tea Party Chronicle</a> blog. In this essay,  Cheryl reflects on the seemingly limitless power of our government.  She originally published this essay on Friday, April 26, 2013.</p>
<p style="text-align:center;">**********</p>
<h3>NO BOUNDARIES, NO LIMITS, TAXING IDEOLOGY</h3>
<div>I was thinking of writing something else today on the abuse of tax dollars, but that took me to another larger perspective about how government doesn&#8217;t recognize any limits on its power and controls over others.  The further I delve into public policy issues, the more I find that government entities at all levels are abusing the limits with taxing powers.  Limits?  What limits?  Did I say limits?  That may be a &#8220;duh&#8221; moment to most of you.  It&#8217;s just that I am finding it is worse than I thought.</div>
<div></div>
<div>Locally we are learning that different cities in our State are taking in taxes, fees, and funds to be used for specific purposes, but instead they are throwing those taxes into some other pot and frittering the money away.  No one stops them.  No holds are barred.  I&#8217;ve read that Charlotte, NC has been taking money out of the Airport pot of money and used it as a cash cow for other projects.  I learned today that Asheville, NC has done similar things with water and sewer monies.  Well, why wouldn&#8217;t they think this is just fine since the Federal government has been doing this type of money juggling for years?  Didn&#8217;t the Social Security money just magically disappear into the swamp? Who set the standard?  It&#8217;s just the way things are done now, eh?</div>
<div></div>
<div>The other cute game played by Federal and State governments is a language game. The leftists love language games.  Transportation taxes and fees collected from cars and trucks were established to be used on roads and bridges for cars and trucks.  But a funny thing happened on the way to the forum.  Transportation was redefined to mean walking and bicycling and hopping on a trolley car or light rail.  Those modes of transportation obviously do not pay anything for roads and bridges.  But the drivers over roads and bridges all of a sudden were co-opted into paying for walking, bicycling, mass transit, greenways, and other subsidized modes of getting around.  Transportation means mobility now. And the pot of money going into &#8220;educating&#8221; the public about how &#8220;sustainable&#8221; it is to do this is huge.  Millions paying for &#8220;educators&#8221; and training sessions.  Car owners / drivers can now pay for &#8220;educating people about how great it is to add bike lanes, crowding those cars right off the roads.  How clever is that?  Any kind of mobility?  Maybe they should add skate boards and scooters.   How about roller skates?  Segues?  Where does this end?</div>
<div></div>
<div>It doesn&#8217;t.  And that&#8217;s the reason that a sales tax was added to the sale of a house to pay for Obamacare.  There are no more restraints on taxes and the abuse of them.  The people in government have given themselves carte blanche to tax anything for the purpose of anything.  Ta da!!  Presto Change-o!  You think your property taxes are going to pay for your local services and schools?  Not so much.  You think gasoline taxes are paying for the road you drive on?  Not so much.  You think your utility fees are going to pay for utilities you use?  Well&#8230;.maybe not.  Yes and no.  It&#8217;s all up for grabs.</div>
<div></div>
<div>You might think education money would be used for school buildings, teachers, and text books.  Now education money is being used to grill children about their families&#8217; personal lives under the edicts of Common Core.   Education money is being used to buy equipment to track that data.  <a href="http://www.thenewamerican.com/culture/education/item/15213-data-mining-students-through-common-core">This is called &#8220;data mining.&#8221;</a>  Would you allow your children to be &#8220;data mined&#8221; by the Federal government?  This is data that will be used to feed the insatiable government appetite for information in order to control people.  Same same with Obamacare money.  A great chunk of it is going into data collection technology for the Federal government.  In what stretch of the imagination does the Federal government have a right to collect data from either students or from healthcare patients?</div>
<div></div>
<div>And now we are talking about taxing internet sales?   News flash.  We already have internet sales taxes if you sell within your own state.  But now it will be a Federally run program, policed by the Federal government.  This time, taxing sales to other States.  You already pay shipping fees to a shipper who pays transportation and income taxes.  You already pay State and Federal income taxes.  And now&#8230;let&#8217;s add another sales tax to the mix?  Another bureaucracy will be added.  Maybe it will just merge into the IRS?  Oh good!  Can&#8217;t wait til all of that happens.  This has to be just great for intrastate commerce.  Right?  The excuse this time is&#8230;..wait for it&#8230;&#8221;fairness?&#8221;</div>
<div></div>
<div>I wonder if there is a system of government that restricts government from this bait and switch game.  Oh&#8230;maybe that was the Constitution.  Long ago, in a galaxy far far away&#8230;?</div>
<div></div>
<div>No boundaries.  No limits.  Would you say some people are out of control?  Or as Ross Perot used to say about Mexico, can you hear that &#8220;giant sucking sound?&#8221;  Only this time it is from all levels of our own government and bureaucratic NGO&#8217;s.  The biggest sucking sound is coming from Washington DC, but I&#8217;m finding this limitless thievery of money and of personal information is going on in every little nook and cranny. America is sucking itself dry.</div>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
<p>&#160;</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[FROM the Impossible to the Imaginable: 'Data Profiling' Big Business and Very Big Science Data]]></title>
<link>http://3d-metrics.me/2013/05/04/from-the-impossible-to-the-imaginable-data-profiling-big-business-and-very-big-science-data/</link>
<pubDate>Sat, 04 May 2013 11:04:38 +0000</pubDate>
<dc:creator>Sabine Kurjo McNeill</dc:creator>
<guid>http://3d-metrics.me/2013/05/04/from-the-impossible-to-the-imaginable-data-profiling-big-business-and-very-big-science-data/</guid>
<description><![CDATA[Profiling Big Data for Business[1] and Very Big Data for Science[2] Peaks, Troughs and Averages – in]]></description>
<content:encoded><![CDATA[<p align="center"><b>Profiling <a href="http://archive.ics.uci.edu/ml/datasets.html?format=&#38;task=&#38;att=&#38;area=&#38;numAtt=&#38;numIns=&#38;type=dt&#38;sort=nameUp&#38;view=table">Big Data for Business</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn1"><b>[1]</b></a> and <a href="http://library2.epfl.ch/page-90988-en.html">Very Big Data for Science</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn2"><b>[2]</b></a></b></p>
<p align="center"><b>Peaks, Troughs and Averages – independent of Application and Scale -<br />
in new Depth, Detail and Perspective</b></p>
<div align="center">
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="204"> <a href="http://3dmetricsdotme.files.wordpress.com/2013/05/verti-n30-m40-300x215.jpg"><img class="alignleft size-full wp-image-360" alt="verti-n30-m40-300x215" src="http://3dmetricsdotme.files.wordpress.com/2013/05/verti-n30-m40-300x215.jpg?w=300&#038;h=215" width="300" height="215" /></a></td>
<td width="208"> <a href="http://3dmetricsdotme.files.wordpress.com/2013/05/multi-dimensional-sine-8d.gif"><img class="alignleft size-medium wp-image-361" alt="Multi-dimensional sine 8d" src="http://3dmetricsdotme.files.wordpress.com/2013/05/multi-dimensional-sine-8d.gif?w=300&#038;h=214" width="300" height="214" /></a></td>
<td width="204"> <a href="http://3dmetricsdotme.files.wordpress.com/2013/05/jouzel-selects-001-1934-diffs-x-sc.jpg"><img class="alignleft size-medium wp-image-362" alt="Jouzel Selects 001 1934 diffs x sc" src="http://3dmetricsdotme.files.wordpress.com/2013/05/jouzel-selects-001-1934-diffs-x-sc.jpg?w=300&#038;h=232" width="300" height="232" /></a></td>
</tr>
<tr>
<td colspan="3" width="616">
<p align="center"><b>Vertical Data Layers along a ‘visual z-axis’</b></p>
</td>
</tr>
</tbody>
</table>
</div>
<p><span style="color:#333399;"><b>Profiling Data</b></span> is based on <a href="http://3d-metrics.com/wordpress/software-methods/layering-complex-data">layering complex data</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn3">[3]</a> into vertical slices along a ‘visual z-axis’. This disruptive software method processes multivariate, univariate, sequential and time series data alike. It is now apparent that its algorithms can be usefully applied to bridging knowledge gaps between finance, economics, business and science.</p>
<p>In particular, market data, business intelligence, medical and climate data, measurements, sensor data, experimental results, clinical and experiential data can all be visualised with more depth, detail and other advantages:</p>
<ul>
<li><span style="color:#ff0000;">‘Extreme data profiles’</span> show minima, maxima, averages and other statistical measures and derived metadata:
<ul>
<li>This allows for sorting, ranking, weighting and prioritising visually.</li>
</ul>
</li>
<li>A new forecasting mechanism can be applied for short-, medium and long-term trends:
<ul>
<li>Overlaying these trends allows for new predictive time windows.</li>
<li>Parameters are derived from historic data to fine tune the generic approach.</li>
</ul>
</li>
</ul>
<p><span style="line-height:24px;">In business, <span style="color:#ff0000;">time is money</span>. In life and science, <span style="color:#ff0000;">knowledge is power</span>. It is therefore a promising perspective to examine business and science related data with the same tool!</span></p>
<p><b>In the spirit </b>of <a href="http://en.wikipedia.org/wiki/Alexander_von_Humboldt#Humboldtian_science">Humboldtian Science</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn4">[4]</a>,<b> </b>‘<a href="http://3d-metrics.com/gallery/main.php">software lenses</a>’<a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn5">[5]</a> are state of the art measuring instruments to compare hitherto incomparable data and images from finance, business and science. Visual assessments by data experts, combined with numerical processing power, allow for creating ‘<a href="http://3d-metrics.me/key-features/smart-knowledge-systems/">Smart Knowledge Portals</a>’, e.g.:</p>
<ul>
<li><b>Smart Particle Physics</b>: meta views of individual as well as grouped experiments;</li>
<li><b>Smart Science: </b>the data source and expertise determine domain, scale and insights;</li>
<li><b>Smart Energy Consumption: </b>systems of control and communication<b> </b>are fine tuned and tweaked<b> </b>as peaks, troughs and other numerical metadata are aligned over different time spans.</li>
</ul>
<p>It is hoped that collaboration can be created between <a class="zem_slink" title="CERN" href="http://maps.google.com/maps?ll=46.2341666667,6.05277777778&#38;spn=0.01,0.01&#38;q=46.2341666667,6.05277777778 (CERN)&#38;t=h" target="_blank" rel="geolocation">CERN</a> and <a href="http://3d-metrics.me/about/">3D Metrics</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn6">[6]</a> to accelerate science:</p>
<ul>
<li>roughly <a href="http://3d-metrics.me/2013/04/28/celebrating-50-years-of-computing-cern/">50 Years after Computer Operations were established</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn7">[7]</a></li>
<li><a href="http://cerncourier.com/cws/article/cern/28358">25-20 years after the world-wide web was born</a>[8]</li>
<li>hopefully appreciating the potential of ‘software lenses’ as ‘inter-data processors’ and <a href="http://3d-metrics.me/2013/04/15/empowering-young-people-with-new-tools-for-investigation-and-evaluation/">new instruments of investigation</a><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftn8">[9]</a> &#8211; for meta views and in-depth analysis &#8211; of data- and measurement-based science.</li>
</ul>
<p>PS. I took the phrase &#8217;from the impossible to the imaginable&#8217; from <a href="http://cern.ch/ben">Ben Segal</a>&#8216;s review of the book on <a href="http://cerncourier.com/cws/article/cern/28358">How the Web was Born</a>.</p>
<div>
<hr align="left" size="1" width="33%" />
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref1">[1]</a> <a href="http://bit.ly/18Dmecu">http://bit.ly/18Dmecu</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref2">[2]</a> <a href="http://library2.epfl.ch/page-90988-en.html">http://library2.epfl.ch/page-90988-en.html</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref3">[3]</a> <a href="http://3d-metrics.com/wordpress/software-methods/layering-complex-data">http://3d-metrics.com/wordpress/software-methods/layering-complex-data</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref4">[4]</a> <a href="http://en.wikipedia.org/wiki/Alexander_von_Humboldt#Humboldtian_science">http://en.wikipedia.org/wiki/Alexander_von_Humboldt#Humboldtian_science</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref5">[5]</a> <a href="http://3d-metrics.com/gallery/main.php">http://3d-metrics.com/gallery/main.php</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref6">[6]</a> <a href="http://3d-metrics.me/about/">http://3d-metrics.me/about/</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref7">[7]</a> <a href="http://3d-metrics.me/2013/04/28/celebrating-50-years-of-computing-cern/">http://3d-metrics.me/2013/04/28/celebrating-50-years-of-computing-cern/</a></div>
<div><a href="http://3d-metrics.me/2013/04/28/celebrating-50-years-of-computing-cern/">[8]</a> <a href="http://cerncourier.com/cws/article/cern/28358">http://cerncourier.com/cws/article/cern/28358</a></div>
<div><a title="" href="/Users/Sabine/Documents/My%20ABC%20Documents%2011%2005%2030/3D%20Metrics/Articles/Coding/Essentials/13%2005%2003%20Data%20Profiling.doc#_ftnref8">[9]</a> <a href="http://3d-metrics.me/2013/04/15/empowering-young-people-with-new-tools-for-investigation-and-evaluation/">http://3d-metrics.me/2013/04/15/empowering-young-people-with-new-tools-for-investigation-and-evaluation/</a></div>
</div>
<h6 class="zemanta-related-title" style="font-size:1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://3dmetrics.me.uk/2013/04/25/smart-knowledge-portals-as-new-instruments-of-investigation-for-lifelong-learning-and-education/" target="_blank">SMART Knowledge Portals as New Instruments of Investigation &#8211; for lifelong learning and education</a> (3dmetrics.me.uk)</li>
<li class="zemanta-article-ul-li"><a href="http://www.10gen.com/blog/post/why-open-source-essential-big-data" target="_blank">Why Open Source Is Essential To Big Data</a> (10gen.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/28084/visualisation-the-key-that-unlocks-datas-value/" target="_blank">Visualisation &#8211; the key that unlocks data&#8217;s value?</a> (cloudave.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.domo.com/blog/2012/04/sensory-overload/" target="_blank">Big Data Infographic &#124; How Big is Big Data? &#124; Domo &#124; Blog</a> (domo.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.domo.com/blog/2011/08/data-data-everywhere/" target="_blank">How Are You Managing Big Data? Data, Data Everywhere &#124; Domo &#124; Blog</a> (domo.com)</li>
<li class="zemanta-article-ul-li"><a href="http://nationalresearchcenter.wordpress.com/2013/05/03/the-rise-of-big-data/" target="_blank">The Rise of Big Data</a> (nationalresearchcenter.wordpress.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.irishtimes.com/news/consumer/keeping-an-eye-on-big-data-1.1380610" target="_blank">Keeping an eye on big data</a> (irishtimes.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.fastcolabs.com/3008620/lessons-crash-course-data-science" target="_blank">Lessons From A Crash Course In Data Science</a> (fastcolabs.com)</li>
</ul>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Resource: Introduction to Data Science by Prof Bill Howe, UW]]></title>
<link>http://parasdoshi.com/2013/05/03/resource-introduction-to-data-science-by-prof-bill-howe-uw/</link>
<pubDate>Fri, 03 May 2013 19:42:56 +0000</pubDate>
<dc:creator>Paras Doshi</dc:creator>
<guid>http://parasdoshi.com/2013/05/03/resource-introduction-to-data-science-by-prof-bill-howe-uw/</guid>
<description><![CDATA[Introduction to Data Science course taught by Bill Howe just started on coursera platform. Having st]]></description>
<content:encoded><![CDATA[Introduction to Data Science course taught by Bill Howe just started on coursera platform. Having st]]></content:encoded>
</item>
<item>
<title><![CDATA[It's Finally Friday!!!]]></title>
<link>http://myadventuresinadulthood.wordpress.com/2013/05/03/its-finally-friday-36/</link>
<pubDate>Fri, 03 May 2013 17:46:05 +0000</pubDate>
<dc:creator>myadventuresinadulthood</dc:creator>
<guid>http://myadventuresinadulthood.wordpress.com/2013/05/03/its-finally-friday-36/</guid>
<description><![CDATA[Quote of the Day: Wine Me, Dine Me: Does wine speak make you feel super inadequate?? Find out what a]]></description>
<content:encoded><![CDATA[<p><strong>Quote of the Day:</strong></p>
<p><a href="http://myadventuresinadulthood.files.wordpress.com/2013/05/act-your-age.jpg"><img class="aligncenter size-full wp-image-3044" alt="act your age" src="http://myadventuresinadulthood.files.wordpress.com/2013/05/act-your-age.jpg?w=480&#038;h=474" width="480" height="474" /></a></p>
<p><strong>Wine Me, Dine Me:</strong></p>
<p>Does wine speak make you feel super inadequate?? Find out what a <a title="tannin" href="http://www.thedailymuse.com/health/whats-a-tannin-and-other-wine-words-you-should-know/" target="_blank">tannin</a> is other useful stuff to make you feel smarter around those wine snobs.</p>
<p><strong>Luscious Locks:</strong></p>
<p>Never have a <a title="bad hair" href="http://www.thedailymuse.com/style/20361/" target="_blank">bad hair</a> day again.</p>
<p><strong>Nesting:</strong></p>
<p>Save money and avoid <a title="toxins" href="http://www.mindbodygreen.com/0-8403/72-uses-for-simple-household-products-to-save-money-avoid-toxins.html" target="_blank">toxins</a>.</p>
<p><strong>DIY:</strong></p>
<p>Got some old furniture kicking around?? <a title="Polish" href="http://www.apartmenttherapy.com/5-recipes-for-cleaning-polishing-amp-restoring-wood-188526" target="_blank">Polish</a> it up and give it a new look.</p>
<p><strong>Techie Love:</strong></p>
<p>Block websites from collecting your <a title="data" href="http://mashable.com/2013/04/17/disconnect-me/?utm_medium=email&#38;utm_source=feedburner&#38;utm_campaign=Feed%3A+Mashable+%28Mashable%29&#38;utm_content=Yahoo!+Mail" target="_blank">data</a>.</p>
<p><strong>Climbing the Ladder:</strong></p>
<p>How to win over any <a title="interviewer" href="http://www.thedailymuse.com/job-search/how-to-read-your-interviewers-and-win-them-over/" target="_blank">interviewer</a>.</p>
<p><strong>Body Lovin&#8217;:</strong></p>
<p>Trouble falling asleep? Try listening to the most.relaxing.<a title="song" href="http://www.apartmenttherapy.com/listen-to-the-most-relaxing-song-ever-170097" target="_blank">song</a>.ever.</p>
<p><strong>Inspirational:</strong></p>
<p>Being <a title="alone" href="http://www.mindbodygreen.com/0-5834/25-Ways-to-Be-Alone-But-Not-Lonely.html" target="_blank">alone</a> doesn&#8217;t mean your lonely.</p>
<p><strong>Giggles and Such:</strong></p>
<p>Punch yourself in the <a title="face" href="http://www.funnyordie.com/videos/4ad20b4edf/michael-shannon-reads-the-insane-sorority-letter?playlist=featured_videos" target="_blank">face</a>. Right now.</p>
<p><a href="http://myadventuresinadulthood.files.wordpress.com/2013/05/douche-bagastan.jpg"><img class="aligncenter size-full wp-image-3046" alt="douche bagastan" src="http://myadventuresinadulthood.files.wordpress.com/2013/05/douche-bagastan.jpg?w=420&#038;h=294" width="420" height="294" /></a><a href="http://myadventuresinadulthood.files.wordpress.com/2013/05/eat-crayons-poop-rainbows.jpg"><img class="aligncenter size-full wp-image-3047" alt="eat crayons poop rainbows" src="http://myadventuresinadulthood.files.wordpress.com/2013/05/eat-crayons-poop-rainbows.jpg?w=426&#038;h=519" width="426" height="519" /></a></p>
<p><a href="http://myadventuresinadulthood.files.wordpress.com/2013/05/dog-mermaids.jpg"><img class="aligncenter size-full wp-image-3045" alt="dog mermaids" src="http://myadventuresinadulthood.files.wordpress.com/2013/05/dog-mermaids.jpg?w=477&#038;h=548" width="477" height="548" /></a>Have a great weekend!</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Tapping into the Information Grid]]></title>
<link>http://edfutureblog.wordpress.com/2013/05/03/tapping-into-the-information-grid/</link>
<pubDate>Fri, 03 May 2013 14:39:28 +0000</pubDate>
<dc:creator>Raghu's Blog</dc:creator>
<guid>http://edfutureblog.wordpress.com/2013/05/03/tapping-into-the-information-grid/</guid>
<description><![CDATA[The Myths and Realities of Big Data In the age of information, big data is getting bigger. The rate]]></description>
<content:encoded><![CDATA[<p><strong>The Myths and Realities of <a class="zem_slink" title="Big data" href="http://en.wikipedia.org/wiki/Big_data" target="_blank" rel="wikipedia">Big Data</a></strong></p>
<p>In the <a class="zem_slink" title="Information Age" href="http://en.wikipedia.org/wiki/Information_Age" target="_blank" rel="wikipedia">age of information</a>, big <a class="zem_slink" title="Data" href="http://en.wikipedia.org/wiki/Data" target="_blank" rel="wikipedia">data</a> is getting bigger. The rate of data growth is so fast in fact that 90% of the data existing in the world today has been created only over the past two years. This wealth of information comes streaming in from a multitude of channels across many industries. The question for <a class="zem_slink" title="Education" href="http://en.wikipedia.org/wiki/Education" target="_blank" rel="wikipedia">educators</a> and learners is how to use big data to improve the educational experience.</p>
<p>Before we can start leveraging such information, we should understand the realities of big data and common myths surrounding it.</p>
<p><em>Myth 1: The more data, the better.</em></p>
<p>Two factors are involved in terms of quantity. First, there’s the sheer volume. Second, there’s the breadth of data; the range of types of information available. When it comes to both, more isn’t necessarily better.</p>
<p>At best, an overabundance or unbalanced variety of data can lack any entirely clear results and at worst, can actually produce conflicting outcomes. Having a heavy volume of one certain kind of information might lead analysts in the wrong direction. For example, a large quantity of transaction data might leave out a great deal of <a class="zem_slink" title="Insight" href="http://en.wikipedia.org/wiki/Insight" target="_blank" rel="wikipedia">insight</a> provided by social interaction data.</p>
<p>Given the rate at which our society produces new stats and figures, big data can easily become out of date. Gathering mass amounts of information for the sake of volume may not always prove helpful. At some point it runs the risk of losing relevance. Therefore determining which pieces of data to use is of high importance.</p>
<p><em>Myth 2: Data Scientists are the only ones who can effectively extract insight from big data.</em></p>
<p>Described by some as “part analyst, part artist,” the big data scientist is often thought of as the only key to unlocking the <a class="zem_slink" title="Solution" href="http://en.wikipedia.org/wiki/Solution" target="_blank" rel="wikipedia">solutions</a> found in mass quantities of information. While that might have held true at one point, new data tools are making it easier for all education leaders to engage and utilize big data as well. For example, by presenting information in a visual format, such as a <a class="zem_slink" title="Heat map" href="http://en.wikipedia.org/wiki/Heat_map" target="_blank" rel="wikipedia">heat map</a>, these tools allow more people to grab hold of high level directional indicators and recognize what to focus on within their field.</p>
<p><em>Myth 3: Big data alone will provide solutions to all our problems.  </em></p>
<p>There’s no arguing that big data has the potential to deliver incredible insight into <a class="zem_slink" title="Human behavior" href="http://en.wikipedia.org/wiki/Human_behavior" target="_blank" rel="wikipedia">human behavior</a> and lead to successful solutions based on that insight. But in order to arrive at truly meaningful results, big data most often needs to be placed in context.</p>
<p>Take information derived from <a class="zem_slink" title="Facebook" href="http://facebook.com" target="_blank" rel="homepage">Facebook</a> as an example. A post might receive a myriad of “likes,” but this action could have a variety of reasons behind it. A user could “like” a comment because his or her thoughts align with that of the specific post <i>or </i>it could simply be because the user has an <a class="zem_slink" title="Emotional bias" href="http://en.wikipedia.org/wiki/Emotional_bias" target="_blank" rel="wikipedia">emotional bias</a> towards the person who posted. Without an understanding of the context behind data points, results and the solutions that are derived from them run the risk of overlooking important variables.</p>
<p>In the education realm, successfully leveraging big data can help teachers identify the kinds of learning materials that will be most helpful to students on an individual level as well as provide valuable insight into educational trends across an entire school system. With more options at our fingertips, we can start embracing the big data movement from all positions throughout the industry; tapping into its potential in order to facilitate improvement within the education system.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Facebook's Data Mining]]></title>
<link>http://workplacewise.wordpress.com/2013/05/03/facebooks-data-mining/</link>
<pubDate>Fri, 03 May 2013 13:56:37 +0000</pubDate>
<dc:creator>Dr. Susannah</dc:creator>
<guid>http://workplacewise.wordpress.com/2013/05/03/facebooks-data-mining/</guid>
<description><![CDATA[If a church or government were doing these things, it would feel authoritarian, but when technologis]]></description>
<content:encoded><![CDATA[<div class="mediumText" style="font-size:14px;color:#181818;font-family:Georgia, Times, 'Times New Roman', serif;line-height:18px;background-color:#ffffff;">
<blockquote><p>If a church or government were doing these things, it would feel authoritarian, but when technologists are the culprits, we seem hip, fresh, and inventive. People will accept ideas presented in technological form that would be abhorrent in any other form. It is utterly strange to hear my many old friends in the world of <a class="zem_slink" title="Information Age" href="http://en.wikipedia.org/wiki/Information_Age" target="_blank" rel="wikipedia">digital culture</a> claim to be the true sons of the Renaissance without realizing that using computers to reduce individual expression is a primitive, retrograde activity, no matter how sophisticated your tools are.” ~ <a style="color:#666600;text-decoration:none;" href="http://www.goodreads.com/author/show/3010868.Jaron_Lanier">Jaron Lanier</a>, <i><a style="color:#666600;text-decoration:none;" href="http://www.goodreads.com/work/quotes/6878840">You are Not a Gadget</a></i></p></blockquote>
</div>
<div class="wp-caption alignright" style="width: 170px"><a href="http://www.flickr.com/photos/11762101@N00/2251266697" target="_blank"><img class="zemanta-img-inserted zemanta-img-configured" title="facebook" alt="facebook" src="http://farm3.static.flickr.com/2171/2251266697_5304abac74_m.jpg" width="160" height="84" /></a><p class="wp-caption-text">facebook (Photo credit: sitmonkeysupreme)</p></div>
<p>I think <a class="zem_slink" title="Facebook" href="http://en.wikipedia.org/wiki/Facebook" target="_blank" rel="wikipedia">Facebook</a> might, on some deeply hidden level, actually be a bit dodgy. Thinking about human nature, there is at least the possibility of some truly slimy values driving the development and changes to this <a class="zem_slink" title="Social media" href="http://en.wikipedia.org/wiki/Social_media" target="_blank" rel="wikipedia">social media</a> behemoth.</p>
<p>Nevertheless&#8230;</p>
<p>Facebook for me, fulfils exactly what it&#8217;s purported purpose is ~ to keep me connected with my family half a world away. That&#8217;s all I expect it to do. So on the whole, FB does an exceptional job in this role.</p>
<p>I don&#8217;t expect Fb or it&#8217;s platform, ideals, or agenda to become my social life. I don&#8217;t expect Fb to replace my face-to-face friendships, my Friday Social Circle, or the opportunities I have to be in the actual presence of my daughters, grandsons, family, or friends. I don&#8217;t &#8220;friend&#8221; people I don&#8217;t know because my definition of friendship is much deeper than simply having common likes or dislikes. I don&#8217;t post things I wouldn&#8217;t want my grandsons to see in 30 years. I don&#8217;t talk about things I shouldn&#8217;t, I keep a boundary between my personal and professional lives, and I don&#8217;t forget that what people post/write/picture of themselves is generally their idea of the &#8220;good parts version.&#8221;</p>
<p>Lastly, I stay on top of privacy settings, simply because to me, keeping my personal life <em>personal</em> is a no-brainer.  &#8217;Personal&#8217; happens along a continuum ~ my idea of personal is much, much broader than Bill&#8217;s, and our respective pages reflect this difference. This aspect (privacy) is no different than turning my back to the rude eavesdropper at the next table in the local coffee shop. I&#8217;m not going to leave, but I&#8217;m not going to share willingly, either.</p>
<p><img class="alignleft size-full wp-image-257" alt="" src="http://workplacewise.files.wordpress.com/2013/05/dataminingtechniques-homepage.jpg?w=261&#038;h=259" width="261" height="259" />If Fb is data mining (and why wouldn&#8217;t they? It&#8217;s not a charity endeavour) then whatever information they get from my likes and dislikes, web surfing, etc., is going into the pool with the millions of others who have similar tastes as me. If I&#8217;m then susceptible to the annoying ads that have now become part of my news feed, that would be my problem &#8211; not <a class="zem_slink" title="Mark Zuckerberg" href="http://https://www.facebook.com/zuck" target="_blank" rel="homepage">Mark Zuckerberg</a>&#8216;s.</p>
<p>In all likelihood, Fb will become more and more commercial, and if/when it gets to the point where the annoyance far outweighs the value gained in maintaining contact with distant loved ones, I&#8217;ll look for a smaller, less hyped app. There will definitely be some, and when I find what I&#8217;m looking for, me and my friends will migrate en masse and wave goodbye to Fb.</p>
<p>In the meantime, I take the suggestions about security seriously. Take a moment to read through the article below and make sure you&#8217;ve done what you need to do to protect yourself.</p>
<p><a href="http://www.linkedin.com/today/post/article/20130502052254-64875646-how-facebook-exploits-your-private-information">How Facebook Exploits Your Private Information &#124; LinkedIn</a>.</p>
<h6 class="zemanta-related-title" style="font-size:1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.fool.com/investing/general/2013/04/29/linkedin-just-became-my-go-to-social-network.aspx" target="_blank">LinkedIn Just Became My Go-To Social Network</a> (fool.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.prweb.com/releases/nate-long-marketing/3-things-never-facebook/prweb10688706.htm" target="_blank">3 Things You Should Never Do On Facebook</a> (prweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://dbpxhaust.wordpress.com/2013/04/29/fav-jaron-lanier-why-facebook-isnt-free-dbpvids/" target="_blank">Fav: &#8220;Jaron Lanier: Why Facebook Isn&#8217;t Free&#8221; #dbpvids</a> (dbpxhaust.wordpress.com)</li>
<li class="zemanta-article-ul-li"><a href="http://socialtimes.com/linked-in-block-feature-change-org-petition_b125375" target="_blank">Stalked On LinkedIn: Victim Is Not Alone in Her Fight to Block Abusive Members</a> (socialtimes.com)</li>
</ul>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Mencoba kehidupan baru dengan GIS]]></title>
<link>http://purnamaalamsyah.wordpress.com/2013/05/03/mencoba-kehidupan-baru-dengan-gis/</link>
<pubDate>Fri, 03 May 2013 13:39:23 +0000</pubDate>
<dc:creator>Purnama Alamsyah</dc:creator>
<guid>http://purnamaalamsyah.wordpress.com/2013/05/03/mencoba-kehidupan-baru-dengan-gis/</guid>
<description><![CDATA[Ternyata benar masa lalu itu akan selalu menghantui kita jika kita tidak pernah menyelesaikannya. Ad]]></description>
<content:encoded><![CDATA[Ternyata benar masa lalu itu akan selalu menghantui kita jika kita tidak pernah menyelesaikannya. Ad]]></content:encoded>
</item>
<item>
<title><![CDATA[Homeland Insecurity: After Boston, The Struggle Between Liberty and Security]]></title>
<link>http://bostonbombing2013.info/2013/05/03/homeland-insecurity-after-boston-the-struggle-between-liberty-and-security/</link>
<pubDate>Fri, 03 May 2013 13:11:17 +0000</pubDate>
<dc:creator>danielkronlid</dc:creator>
<guid>http://bostonbombing2013.info/2013/05/03/homeland-insecurity-after-boston-the-struggle-between-liberty-and-security/</guid>
<description><![CDATA[Massimo Calabresi and Michael Crowley | TIME | 1 May 2013 But the new guidelines also featured added]]></description>
<content:encoded><![CDATA[<p>Massimo Calabresi and Michael Crowley &#124; TIME &#124; 1 May 2013</p>
<blockquote><p>But the new guidelines also featured added restrictions on an especially sensitive area of FBI counterterrorism work: mosques. Under the new rules, agents could no longer enter a religious organization without special new approval—in some cases directly from FBI headquarters.</p></blockquote>
<p><a title="SOURCE @ time.com" href="http://swampland.time.com/2013/05/01/homeland-insecurity-after-boston-the-struggle-between-liberty-and-security" target="_blank">SOURCE @ swampland.time.com</a></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Thomson Reuters: Riset Indonesia Sedikit, Tetapi Berkualitas Tinggi &amp; Dikutip Ilmuwan Dunia]]></title>
<link>http://sainsfilteknologi.wordpress.com/2013/05/03/thomson-reuters-riset-indonesia-sedikit-tetapi-berkualitas-tinggi-dikutip-ilmuwan-dunia/</link>
<pubDate>Fri, 03 May 2013 08:40:43 +0000</pubDate>
<dc:creator>sainsfilteknologi</dc:creator>
<guid>http://sainsfilteknologi.wordpress.com/2013/05/03/thomson-reuters-riset-indonesia-sedikit-tetapi-berkualitas-tinggi-dikutip-ilmuwan-dunia/</guid>
<description><![CDATA[Reblogged from Indonesia Proud: Tidak dapat dipungkiri bahwa jumlah publikasi penelitian Indonesia m]]></description>
<content:encoded><![CDATA[<div class="reblog-post"><p class="reblog-from"><img alt='' src='http://2.gravatar.com/avatar/51b5784385713439fad3188e69f87477?s=25&amp;d=identicon&amp;r=G' class='avatar avatar-25' height='25' width='25' /> <a href="http://indonesiaproud.wordpress.com/2012/04/30/thomson-reuters-riset-indonesia-sedikit-tetapi-berkualitas-tinggi-dikutip-ilmuwan-dunia/">Reblogged from Indonesia Proud:</a></p><div class="wpcom-enhanced-excerpt"><div class="wpcom-enhanced-excerpt-content"><a href="http://indonesiaproud.wordpress.com/2012/04/30/thomson-reuters-riset-indonesia-sedikit-tetapi-berkualitas-tinggi-dikutip-ilmuwan-dunia/" target="_self"><img src="http://s0.wp.com/imgpress?url=http%3A%2F%2Fwww.mediabistro.com%2Fprnewser%2Ffiles%2Foriginal%2Fthompson_reuters_logo3.jpg" alt="Click to visit the original post" class="size-full" /></a>

<p>Tidak dapat dipungkiri bahwa jumlah <em></em>publikasi penelitian Indonesia memang rendah. Namun, ternyata kualitas riset Indonesia tergolong unggul.</p>
<p>Hal itu terlihat dari hasil analisis <a title="Thomson Reuters: Riset Indonesia Sedikit, Tetapi Berkualitas Tinggi &amp; Dikutip Ilmuwan Dunia" href="http://indonesiaproud.wordpress.com/2012/04/30/thomson-reuters-riset-indonesia-sedikit-tetapi-berkualitas-tinggi-dikutip-ilmuwan-dunia/">Thomson Reuters</a>, sumber informasi intelijen terkemuka di dunia untuk perusahaan dan para profesional, yang disampaikan dalam konferensi pers di Jakarta, pada 26 April 2012.</p>
<p>Diketahui, secara ranking, jumlah publikasi riset Indonesia tergolong kedua terendah se-Asia Tenggara. Berdasarkan jumlah publikasi, negara yang paling produktif dalam riset berturut-turut adalah Singapura, Malaysia, dan Thailand.</p>
</div> <p class="read-more"><a href="http://indonesiaproud.wordpress.com/2012/04/30/thomson-reuters-riset-indonesia-sedikit-tetapi-berkualitas-tinggi-dikutip-ilmuwan-dunia/" target="_self"><span>Read more&hellip;</span> 205 more words</a></p></div></div><div class="reblogger-note"><div class='reblogger-note-content'>
Riset Indonesia
</div></div>]]></content:encoded>
</item>
<item>
<title><![CDATA[How Murdoch, Bill Gates and Big Corporations Are Data Mining Our Schools]]></title>
<link>http://blnnews.com/2013/05/02/how-murdoch-bill-gates-and-big-corporations-are-data-mining-our-schools/</link>
<pubDate>Fri, 03 May 2013 00:11:33 +0000</pubDate>
<dc:creator>web staff</dc:creator>
<guid>http://blnnews.com/2013/05/02/how-murdoch-bill-gates-and-big-corporations-are-data-mining-our-schools/</guid>
<description><![CDATA[In case you didn&#8217;t know, Unit 5 and Dist 87  were test schools for this plan.  Only 4 others i]]></description>
<content:encoded><![CDATA[<p><strong>In case you didn&#8217;t know, Unit 5 and Dist 87  were test schools for this plan.  Only 4 others in the country!</strong></p>
<p>Posted on <a title="9:41 am" href="http://stopcommoncoreillinois.org/2013/05/01/how-murdoch-bill-gates-and-big-corporations-are-data-mining-our-schools/" rel="bookmark">May 1, 2013</a>by <a title="View all posts by ilpatriot" href="http://stopcommoncoreillinois.org/author/ilpatriot/" rel="author">ilpatriot</a></p>
<div>
<p>The Indypendent -</p>
<p>BY</p>
<p><a href="http://www.indypendent.org/authors/peter-rugh">PETER RUGH</a></p>
<p>Last week, students across New York finished a set of tests taken over a two week period designed to measure their proficiency at reading and math against new federal college readiness standards known as <a href="http://www.corestandards.org/">Common Core</a>. Some parents opted their children <a href="http://fsrn.org/audio/new-york-parents-urge-boycott-high-stakes-testing-public-schools/11865">out of the exams</a> in protest against what they described as the school system’s over-emphasis on testing and its use of data as the principle indicator of their children’s achievement.</p>
<p>Starting next year, those scores, along with students’ personal information – race, economic background, report cards, discipline records and personal addresses – will be stored in a database designed by Wireless Generation, a subsidiary of media mogul Rupert Murdoch’s News Corporation.</p>
<p>That’s right, Rupert Murdoch can read your child’s report card anytime he likes and he knows where your kid is sleeping. The database will be managed by inBloom inc, a non-profit outfit that, like Wireless Generation, is under the domain of billionaire Bill Gates – who, together with the Carnegie Corporation and other philanthropic organizations, set up the company via his Bill and Melinda Gates Foundation.</p>
<p>inBloom is receiving $50 million for their services from the New York Education Department through a contract awarded last fall. Data analyzing firms, educational software designers and other third-party venders, both for and not-for-profit, will be granted access to student information.</p>
<p>New York is not alone in turning to student data tracking system to measure performance. Some 200,000 U.S. teachers use Wireless Generation software as part of a national trend in which education administrators are increasingly turning to data analysis to grasp why America’s pupils are flunking when <a href="http://www.nytimes.com/2012/12/11/education/us-students-still-lag-globally-in-math-and-science-tests-show.html">compared to the rest of the world</a>.</p>
<p>“I am a deep believer in the power of data to drive our decisions,” <a href="http://www.ed.gov/news/speeches/robust-data-gives-us-roadmap-reform">said</a> U.S. Education Secretary Arne Duncan shortly after his appointment to the post in 2008. “Data gives us the roadmap to reform. It tells us where we are, where we need to go, and who is most at risk.”</p>
<p>But the consolidation of individual student information has been raising eyebrows — and sparking a backlash. <strong>The Electronic Privacy Information Center <a href="http://www.washingtonpost.com/blogs/answer-sheet/wp/2013/03/13/lawsuit-charges-ed-department-with-violating-student-privacy-rights/">is suing</a> Duncan’s Education Department for amending privacy regulations in 2011 that allow student data to be accessed for non-educational objectives without informing parents — a violation, EPIC contends, of the Family Educational Rights Privacy and Privacy Act.</strong></p>
<p><strong>According to inBloom’s <a href="https://www.inbloom.org/privacy-security-policy">privacy policy</a>, the company is not responsible for security breaches; though it will “use reasonable administrative, technical, and physical safeguards to ensure student records are kept private,” inBloom “cannot guarantee the security of the information stored in inBloom or that the information will not be intercepted when it is being transmitted.”</strong></p>
<p><a href="http://www.indypendent.org/2013/04/30/how-murdoch-bill-gates-and-big-corporations-are-data-mining-our-schools"> Read more</a></p>
<div id="jp-post-flair">
<div>
<div>
<h3>Share this:</h3>
<div></div>
</div>
</div>
</div>
</div>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Algorithms Every Data Scientist Should Know: Reservoir Sampling]]></title>
<link>http://gdfm.me/2013/05/02/algorithms-every-data-scientist-should-know-reservoir-sampling/</link>
<pubDate>Thu, 02 May 2013 19:29:37 +0000</pubDate>
<dc:creator>Gianmarco De Francisci Morales</dc:creator>
<guid>http://gdfm.me/2013/05/02/algorithms-every-data-scientist-should-know-reservoir-sampling/</guid>
<description><![CDATA[Say you have a stream of items of large and unknown length that we can only iterate over once. Creat]]></description>
<content:encoded><![CDATA[<p><em style="box-sizing:border-box;color:#505050;font-family:'Helvetica Neue', Helvetica, Arial, sans-serif;background-color:#ffffff;">Say you have a stream of items of large and unknown length that we can only iterate over once. Create an algorithm that randomly chooses an item from</em><span style="color:#505050;font-family:'Helvetica Neue', Helvetica, Arial, sans-serif;background-color:#ffffff;"> </span><em style="box-sizing:border-box;color:#505050;font-family:'Helvetica Neue', Helvetica, Arial, sans-serif;background-color:#ffffff;">this stream such that each item is equally likely to be selected.</em></p>
<p><a href="http://blog.cloudera.com/blog/2013/04/hadoop-stratified-randosampling-algorithm/">Algorithms Every Data Scientist Should Know: Reservoir Sampling</a></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[IRS to Spy on Our Shopping Records, Travel, Social Interactions, Health Records and Files from Other Government Investigators:More and More People Are Staring Into Our Fishbowl ]]></title>
<link>http://parserjets.wordpress.com/2013/05/02/irs-to-spy-on-our-shopping-records-travel-social-interactions-health-records-and-files-from-other-government-investigatorsmore-and-more-people-are-staring-into-our-fishbowl/</link>
<pubDate>Thu, 02 May 2013 19:11:41 +0000</pubDate>
<dc:creator>parserjet</dc:creator>
<guid>http://parserjets.wordpress.com/2013/05/02/irs-to-spy-on-our-shopping-records-travel-social-interactions-health-records-and-files-from-other-government-investigatorsmore-and-more-people-are-staring-into-our-fishbowl/</guid>
<description><![CDATA[&nbsp; The IRS is joining the fun. U.S. News and World Report notes today: [The IRS] will use in rob]]></description>
<content:encoded><![CDATA[&nbsp; The IRS is joining the fun. U.S. News and World Report notes today: [The IRS] will use in rob]]></content:encoded>
</item>
<item>
<title><![CDATA[Information Overload - The True Cost of Data]]></title>
<link>http://piecubed.co.uk/2013/05/02/information-overload/</link>
<pubDate>Thu, 02 May 2013 16:29:12 +0000</pubDate>
<dc:creator>salman128</dc:creator>
<guid>http://piecubed.co.uk/2013/05/02/information-overload/</guid>
<description><![CDATA[In the last few years we have produced more data than in all of human history. We live our lives con]]></description>
<content:encoded><![CDATA[<p><img class="aligncenter" alt="" src="http://lincolnshirecvs.org.uk/wp-content/uploads/2013/04/data.jpg" width="776" height="315" /></p>
<p>In the last few years we have produced more data than in all of human history. We live our lives constantly producing a stream of data, it controls our lives, not in a Matrix or Skynet kind of way, but every time we interact (text, call, tweet), conduct a transaction, perform an internet search, complete a national census or even simply give birth or die, you are creating data and contributing that in the right hands is valuable and powerful tool. A recent <a href="http://www.emc.com/collateral/analyst-reports/idc-the-digital-universe-in-2020.pdf" target="_blank">EMC study</a> claims that less than 1% of global data is actually analysed.</p>
<p><!--more--></p>
<p>With smarter data mining algorithms and more powerful computers, data mining on a mega scale is becoming a reality, an era of Big Data. This is a dataset so large that it holds many potential secrets about us and the universe beyond. We like to think that as individuals we are in control of our own destinies, and on an individual scale I believe we are. But on a larger scale, human behaviour has shown to become regular and predictable. Discovering patterns in a chaotic dataset of human behavior is becoming a reality and its potential is beginning to be exploited. It is being used everywhere from the world of finance, in determining what commodities and stocks will be effected by world events, to the routes driven by police on the beat to increase their chances of <a href="http://www.bbc.co.uk/news/technology-22008497" target="_blank">encountering crime</a>. Google is already able to predict who you are, targeting specific adverts to you based on your search results, interests, email keywords, check ins etc.</p>
<div class="wp-caption alignleft" style="width: 224px"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/1/15/Graunt_Observations.jpg" width="214" height="300" /><p class="wp-caption-text">John Graunt&#8217;s publication &#8211; Bills of Mortality</p></div>
<p>Big Data is hardly a new idea; it is essentially searching for patterns in sets of data and turning it into useful information. This has been done since the days of the Bubonic Plague when in 1662 John Graunt analysed masses of public data on death records to track and thus predict the spread of the plague. Tracking the spread of a modern epidemic in real time is now well within the realms of possibility, add to this other data such as flight records, seasonal tourism, bird migratory patterns and you have a huge potential untapped database that is ready to be exploited.</p>
<p>Big data is an integral part astronomical research too. Astronomy has traditionally been a science founded on data hunting, ever since the Babylonians recorded star positions and predicted their patterns, distinguishing stars from planets. Vast amounts of data are now gathered from telescopes and satellites worldwide. For example, the <a href="http://piecubed.co.uk/2013/03/18/listening-to-jupiter/" target="_blank">Radio Jove</a> project is adding to all the data recorded on Jupiter and waiting to be utilised. But this is just a single planet in our solar system, a planet we have been observing for centuries. Looking deeper into space (and there is a lot of it!), requires a lot more searching, the <a href="http://www.almaobservatory.org/" target="_blank">Alma telescope array</a> in Chile, for example, will record 30 Tb of data per second! The challenge will be to organise and make use of this data and hopefully gain a lot more detail about each one of those stars and beyond what the Babylonians tracked across the skies millennia ago.</p>
<div class="wp-caption alignright" style="width: 240px"><img alt="" src="http://farm5.staticflickr.com/4032/4667376531_92b6cb2e37_z.jpg" width="230" height="173" /><p class="wp-caption-text">Scrambled data anyone? Humpty-dumpty before the great fall was low entropy, If only all the kings horses and all the kings men could piece humpty-dumpty together again</p></div>
<p style="text-align:justify;"><span style="text-align:justify;">But what is information? Information is real. It is a subtle concept; inherently it is order from disordered data and subject to the laws of physics as anything else. What does this mean? It means that it is an inseparable part of the physical world. Here’s where the science comes in. Science says that over time the disorder of the universe increases, its entropy. Entropy is a measure of chaos, the degree of disorder. The entropy of the universe is always increasing since the moment of the big bang. It&#8217;s the reason why a hot cup of coffee doesn&#8217;t get hotter. It would be absorbing heat from its surrounding, making its surroundings colder and </span><span style="text-align:justify;">decreasing the entropy. This </span><span style="text-align:justify;">would make the molecules more ordered, intuitively we know this is ludicrous. Now picture a carton of eggs falling and smashing, if I were to say that it is possible to put the eggs back together given nothing but information on the eggs, their spatial coordinates in relation to a fixed point, it would be difficult, but given enough data, certainly possible. With large enough dataset, perhaps.</span></p>
<p>This is similar to a scenario first dreamt up by James Maxwell in a famous thought experiment that hypothetically violated the second law of thermodynamics. He thought about a container full of gas molecules at equilibrium, divided into two sections by an insulated partition. There is a small window that can be open and closed (by Maxwell’s Demon) to allow the faster than average molecules to the right side of the partition and let the slower ones move to the left. This will over time mean one side of the chamber heats up while the other cools down, since average molecular speed is proportional to temperature. Ordering the system – decreasing its entropy, done with nothing more than the information about the velocity of the molecules and seemingly violating the second law of thermodynamics.</p>
<div class="wp-caption aligncenter" style="width: 618px"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/8/8b/Maxwell%27s_demon.svg" width="608" height="224" /><p class="wp-caption-text">Maxwell&#8217;s thought experiment, sadly he didn&#8217;t live long enough to see his demon exorcised</p></div>
<p>Now back to the carton of eggs. Think of the enormous amount of information, how many bits, you need to describe the smashing of a carton of eggs. To describe every interaction between the trillions of atoms requires an unimaginable amount of data. This has to be stored somewhere, be it a piece of paper, hard drive or sequence of knots in a rope. Crucially, we only have a finite storage capacity and to continue recording our process, memory has to be deleted and replaced. Similarly with Maxwell’s Demon, it would need to ‘forget’ information it had on the moving molecules after each operation. The erasure of information carries a thermodynamic penalty, it actually expends energy. Thus Maxwell&#8217;s Demon was proven to be impossible, the expending of energy creates more entropy than the ordering process of the molecules. NOTHING can violate the 2nd law of Thermodynamics.</p>
<div id="attachment_1091" class="wp-caption alignright" style="width: 220px"><img class=" wp-image-1091 " alt="1 bit of data, the smallest fundamental unit of data" src="http://piecubed.files.wordpress.com/2013/05/data.png?w=210&#038;h=160" width="210" height="160" /><p class="wp-caption-text">1 bit of data, the smallest fundamental unit of data</p></div>
<p style="text-align:justify;">Even forgetting something takes a little energy &#8211; the Landauer limit. It is extremely small, 2.85&#215;10-25 Joules! This is the amount of energy that is required to delete or reset 1 bit of data from the universe, disorder from the order of information. This Erasure of information is an irreversible process that increases the entropy of the universe. It explains why we cannot reverse the cooling of a cup of coffee on a table, which would amount to obtaining free energy as simply unlimited infinite memory storage does not exist.</p>
<p>-Salman</p>
<p>Further reading:</p>
<p>Nature: <a href="http://www.nature.com/news/the-unavoidable-cost-of-computation-revealed-1.10186" target="_blank">The unavoidable cost of computing revealed </a></p>
<p>When a Good Theory meets a Bad Idealization: <a href="http://www.pitt.edu/~jdnorton/Goodies/Idealization/index.html" target="_blank">The Failure of the Thermodynamics of Computation</a></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[American sheeple clamor for police state]]></title>
<link>http://dirtybabylon.com/2013/05/01/american-sheeple-clamor-for-police-state/</link>
<pubDate>Wed, 01 May 2013 14:27:21 +0000</pubDate>
<dc:creator>truthman2012</dc:creator>
<guid>http://dirtybabylon.com/2013/05/01/american-sheeple-clamor-for-police-state/</guid>
<description><![CDATA[Americans overwhelmingly favor installing video surveillance cameras in public places, judging the i]]></description>
<content:encoded><![CDATA[<p>Americans overwhelmingly favor installing video surveillance cameras in public places, judging the infringement on their privacy as an acceptable trade-off for security. There are lingering questions about the role of the nation’s intelligence agencies before the recent attacks, with people divided about whether they had collected information that could have prevented them, 41 percent said they had. <a href="http://www.nytimes.com/2013/05/01/us/poll-finds-strong-acceptance-for-public-surveillance.html?pagewanted=2&#38;ref=us" target="_blank">http://www.nytimes.com/2013/05/01/us/poll-finds-strong-acceptance-for-public-surveillance.html?pagewanted=2&#38;ref=us</a> Receptiveness to cameras on street corners reflects a public that regards terrorism as a fact of life in the United States, 9 out of 10 people polled said Americans would always have to live with the risk. A majority, 53 percent, said the suspects had links to a larger terrorist group. Sixty-six percent said information about how to make explosives should not be allowed on the Internet, where it would be available to aspiring terrorists, even if some would view that as a form of censorship. Only 20 percent of people said they believed the government had gone too far in restricting civil liberties in the fight against terrorism. “I know some people are paranoid about the government intruding on their privacy,” Polls taken in the aftermath of terrorist attacks often show spikes in the public’s fears of another attack. Forty-nine percent said the risk of terrorism had risen in the United States because of legal immigration. A large majority of those polled, 72 percent, said they did not plan to avoid large public events to reduce their exposure to potential terrorist attacks.</p>
<p>MUST SEE VIDEO:  <a href="http://video.pbs.org/video/2365004424" target="_blank">http://video.pbs.org/video/2365004424</a></p>
<p>Kathy Buffington, a retired teacher from Rochester, N.Y., said she was rattled by the images of a locked-down Boston, even if it was warranted in this case. But she said that in a country dealing with the threat of terrorism since the September 2001 attacks, the fight against it should not be a pretext for more pervasive forms of surveillance. “wiretapping without a warrant goes too far, now that the immediate 9/11 crisis is over.”</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[#TCDisrupt Startup Battlefield Day 2:  My Top 3]]></title>
<link>http://neilcarty.wordpress.com/2013/04/30/tcdisrupt-startup-battlefield-my-top-3/</link>
<pubDate>Wed, 01 May 2013 03:08:58 +0000</pubDate>
<dc:creator>Neil S. Carty</dc:creator>
<guid>http://neilcarty.wordpress.com/2013/04/30/tcdisrupt-startup-battlefield-my-top-3/</guid>
<description><![CDATA[Day 2 at TechCrunch Disrupt&#8217;s Battlefield featured 3 new categories; Layers of Experience, New]]></description>
<content:encoded><![CDATA[<p>Day 2 at TechCrunch Disrupt&#8217;s Battlefield featured 3 new categories; <em>Layers of Experience, New Marketplaces </em>and <em>Mobile First.  </em>Here are the three startups that caught my attention.</p>
<p><strong>Enigma: <a href="http://www.enigma.io" rel="nofollow">http://www.enigma.io</a> @enigma_io </strong>is a search and discovery platform for public data.  It makes it easy to analyze data from more than 100,000 publicly available data sources, which while out there is often tough for those who need it to locate it.  It&#8217;s already being used by journalists from the New York Times to break stories, financial firms, educational institutions and large companies in their business intelligence efforts.  Enigma puts this data into an easily sliceable system to glean insights from assuming you have the knowledge workers in place that know how to manipulate and extrapolate from it.  The company also has a browser plug that when clicked highlights these sources when viewing a web page.  For instance, efforts in the CSR space from a brand page.</p>
<p>This tool has tremendous potential for companies to identify what their competitors are doing in various areas assuming the data is public.  It&#8217;s significantly disrupting issues of infrastructure and content.  From a brand perspective this could be a huge help in gleaning insights in that up front situation analysis process.</p>
<p><strong>HealthyOut</strong>: <strong><a href="http://www.healthyout.com" rel="nofollow">http://www.healthyout.com</a> @healthyout</strong> provides users with personalized menus of food from local restaurants that&#8217;s set up as a subscription delivery service to help them lose weight or live healthier lifestyles. For a $28 monthly fee, the company taps into local restaurant kitchens to your home or office as if you had a personal nutritionist.  The cost of the food is an additional cost dependent on the local restaurant.  Their differentiator is that they go beyond the boxed or prepared meal service instead tapping in pre-existing menus but providing personalized instructions to the restaurant based on the user&#8217;s pre-set criteria.  They company has already raised a $1MM A round and is operating at 200K users in their NYC only beta.</p>
<p>I really like this company for brands with small business efforts, local chains who might be able to integrate healthy menu options into the product or those activating in the health vertical.</p>
<p><strong>Talkz</strong>: <strong><a href="http://www.talkz.com" rel="nofollow">http://www.talkz.com</a></strong> With Kik Voxxer, What Now and others, the messaging space is an admittedly crowded category but Talkz has a few tricks up their sleeves to differentiate themselves.  What the messaging service combines text, voice, image capture, doodle and other features to enhance the messaging experience.  Probably the most novel is what they call &#8220;voice cloning.&#8221; Over time the technology uses iSpeech technology to learn your voice so the next time you text, the recipient can hear the message in you own voice.  They&#8217;re even cloning celebrities (with a number on-board TBA) to make the experience more fun.  During their pitch they used Barack Obama&#8217;s voice to demonstrate a call.  The great thing is that recipients don&#8217;t need to have the app downloaded to hear the message but the company is hoping some of the viral factors innate to the product will create service switchers.  The app is currently only available iOS.</p>
<p>From a brand perspective this is a pretty interesting way to integrate celebrity endorsements into messaging as well as branded backgrounds for photos but all of these ideas are just conceptual at the moment.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Electronic Fingerprints]]></title>
<link>http://theinnerauditor.com/2013/04/30/electronic-fingerprints/</link>
<pubDate>Tue, 30 Apr 2013 18:39:46 +0000</pubDate>
<dc:creator>cwl890</dc:creator>
<guid>http://theinnerauditor.com/2013/04/30/electronic-fingerprints/</guid>
<description><![CDATA[Media coverage of the details of the SEC’s current investigation of corporate boards for insider tra]]></description>
<content:encoded><![CDATA[<p><a href="http://theinnerauditor.com/2013/04/30/electronic-fingerprints/diceandchips/" rel="attachment wp-att-791"><img class="alignleft size-full wp-image-791" alt="DiceAndChips" src="http://theinnerauditor.files.wordpress.com/2013/04/diceandchips.gif?w=190&#038;h=150" width="190" height="150" /></a>Media coverage of the details of the SEC’s current investigation of corporate boards for insider trading  brings to mind that in all such cases today, the most telling evidence investigators will have available to them will be a wide variety of “electronic fingerprints,” in the form of e-mails, instant messages, voice mails and transaction records to name but a few.  What legal and financial implications exist for corporations as a result of the behavior of their executives, employees and contractors that is documented in this way?  What prudent steps can fraud examiners and other compliance professionals help their client’s take to minimize this ever growing exposure?</p>
<p>In this electronic goldfish bowl environment, we need to advise our client managements that proactive steps have to be taken to minimize the possibility that an employee will do something to put the company at risk, whether or not the risk has a direct business consequence, such as a major regulatory investigation, a major financial loss or the possibility of a lawsuit.  Second, and related, organizations must make sure that the effect of any lawsuit is minimized in the event that one occurs.</p>
<p>A key factor in managing the risk from employee misbehavior, and underlined by the SEC’s board member investigation, is to have in place an effective financial and general compliance system to monitor the behavior of employees to ensure that their behavior complies with company policy.  Eying this opportunity, technology vendors have flooded the market with software in recent years to help firms store, manage and retrieve the necessary files to identify potential and actual misbehavior.  While many of these solutions allow the buyer to check the compliance box on the compliance score sheet, the stored data is, in reality, a treasure trove for regulators and adversary attorneys building a case while it does little to actually identify and control employee misbehavior.</p>
<p>To truly mitigate this type of risk, it’s necessary to be able to analyze patterns in communication and organizational behavior in ways that go beyond basic keyword filtering and associated counting and statistics.  Advanced analysis and understanding of behavior patterns form the basis for identifying the type of anomalous behavior characteristic of insider trading and other fraud scenarios that merits further investigation.</p>
<p>These same technologies can be used for retrospective investigation (internal or external).  They can also be applied to records retention, where a smarter approach can better identify materials that can be legally and safely purged.  As a result of storing fewer records, storage costs are decreased and, in the event of a lawsuit or regulatory investigation, there are fewer items for the opposing attorneys or regulators to review, resulting in lower costs and decreased risk of untoward information coming to light.</p>
<p>Remember that your client’s existing data constitutes a baseline for evaluating “normal” corporate behavior.  As such it addresses one of the most challenging issues for enterprise monitoring: how to keep an eye on a vast volume of data.  Knowing what is normal makes it possible to identify and focus on behaviors, like that of the board members under investigation, that are outside the norm.  Analysis of such data in defending against an investigation or lawsuit is also useful since the client can prove that the alleged offenses perpetrated by a limited group of employees are not typical of its usual practice or ethical culture.</p>
<p>Despite the current slow economic recovery, now is not the time to place investment in employee compliance monitoring on the back burner.  Such a response can put an organization in an even more vulnerable position.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[How lawyers are mining the information mother lode for pricing, practice tips and predictions - ABA Journal]]></title>
<link>http://lennyesq.wordpress.com/2013/04/30/how-lawyers-are-mining-the-information-mother-lode-for-pricing-practice-tips-and-predictions-aba-journal/</link>
<pubDate>Tue, 30 Apr 2013 18:33:08 +0000</pubDate>
<dc:creator>lennyesq</dc:creator>
<guid>http://lennyesq.wordpress.com/2013/04/30/how-lawyers-are-mining-the-information-mother-lode-for-pricing-practice-tips-and-predictions-aba-journal/</guid>
<description><![CDATA[The day is here: Big data, loosely defined as the computer analysis of torrents of information to fi]]></description>
<content:encoded><![CDATA[<h3></h3>
<div>
<div>
<div></div>
<div>
<p>The day is here: Big data, loosely defined as the computer analysis of torrents of information to find hidden gems of insight, is slowly transforming the way law is practiced in the U.S.</p>
<p>Law firms are using <a title="Big data" href="http://en.wikipedia.org/wiki/Big_data" rel="wikipedia">big data</a> to identify which cases will be easy slam dunks and those that are air balls. They&#8217;re relying on the technology to get a read on what other law firms are charging, so they can adjust their rates accordingly. And big data is also popping up in law firm human resources departments, where tech-savvy department heads are crunching data on potential new hires in the hopes of coming up with recruits who are truly a good fit.</p>
<p>&#160;</p>
<p><a href="http://www.abajournal.com/magazine/article/the_dawn_of_big_data?utm_source=maestro&#38;utm_medium=email&#38;utm_campaign=tech_monthly">Read the entire article here</a>.</p>
<p>&#160;</p>
<p>***</p>
<p>I suppose its only price-fixing and a violation of anti-trust laws when a couple of solos and small firm practitioners talk about charges&#8211;not when the big folks do some data mining?</p>
<p>&#160;</p>
<p>&#160;</p>
</div>
<p>Related articles</p>
<ul>
<li><a href="http://stevesammons.me/2013/04/28/what-are-you-doing-about-big-data-that-is-a-question-making-the-rounds-with-retailers-today/">What are you doing about &#8220;big data?&#8221; That is a question making the rounds with retailers today.</a> (stevesammons.me)</li>
<li><a href="http://statsburgh.com/2013/03/13/powered-analytics-harnessing-the-power-of-big-data/">Powered Analytics harnessing the power of &#8220;big data&#8221;</a> (statsburgh.com)</li>
<li><a href="http://www.irishtimes.com/news/how-making-sense-of-big-data-could-create-jobs-in-ireland-1.1376372">How making sense of big data could create jobs in Ireland</a> (irishtimes.com)</li>
</ul>
</div>
</div>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Researchers have created a 21st century global mood ring with data mining]]></title>
<link>http://gigaom.com/2013/04/30/researchers-have-created-a-21st-century-global-mood-ring-with-data-mining-2/</link>
<pubDate>Tue, 30 Apr 2013 16:00:26 +0000</pubDate>
<dc:creator>Amanda Alvarez</dc:creator>
<guid>http://gigaom.com/2013/04/30/researchers-have-created-a-21st-century-global-mood-ring-with-data-mining-2/</guid>
<description><![CDATA[After your morning stock market and weather updates, maybe add a check of the hedonometer to your li]]></description>
<content:encoded><![CDATA[<p>After your morning stock market and weather updates, maybe add a check of the hedonometer to your list. The new site draws on tweets &#8212; and soon, the New York Times, Google Trends, and other sources of textual sentiment &#8212; to gauge population-level happiness. This big data approach is taking the collective mood temperature across space and time, but it’s unlikely to reveal the secret to achieving happiness.</p>
<p>Launching today and updated every 24 hours with faster refresh rates to come, <a href="http://hedonometer.org/">Hedonometer.org</a> uses English-language tweets to create a happiness index. The system is based on a 10,000-word strong “emotional temperature” database, where words are ranked on a scale of 1-9 by volunteers using Amazon’s Mechanical Turk. Words like “laughter,” “happiness,” and “love” top the list, while “loneliness,” “bad,” “inflation,” and “surgery,” along with assorted expletives, round out the bottom, with rankings close to 1. The emoticon “:(“ has a rating of 2.36.</p>
<p>Users can zoom in on any day all the way back to September 10, 2008, check out the balance of positive and negative words, and see how these compare to the week before and after. Saturdays, for example, tend to be happier than Tuesdays. Christmas Day stands out as being the happiest day of the year, every year. The hedonometer developers, mathematicians from the University of Vermont along with scientists from the MITRE Corporation, found that April 15, the day of the Boston bombings, was the unhappiest day on record, with an average happiness index of 5.88. Other recent sad days include December 14 last year (Newtown school shooting) and June 25, 2009 (death of Michael Jackson).</p>
<p>Indeed, eyeballing the global happiness index suggests a slight downward slope since 2008. Whether or not this effect is real depends on establishing a normal background happiness level, and comparison with geographic, socioeconomic, and political metrics. What’s interesting is that the hedonometer is turning more than 50 million daily micro-statements into a “quantitative macro-story,” as UV’s Chris Danforth put it. Individually insignificant words and tweets swell into a collective emotional response, the blips and dips of which stand out and correlate with major events.</p>
<p><img class="aligncenter size-medium wp-image-640967" alt="hedonometer-happiness-data-mining" src="http://gigaom2.files.wordpress.com/2013/04/hedonometer-happiness-data-mining.jpg?w=300&#038;h=251" width="300" height="251" /></p>
<p>The research from Danforth and his colleagues got some <a href="http://phys.org/news/2013-04-math-pattern-analysis-twitter-users.html">press earlier this month</a>, when they reported that happiness went up the further Twitter users were from home. Other insights from the same team included the fact that obesity and happiness were inversely correlated, and that cities’ happiness scores were related to swear words, suggesting that “geoprofanity” could be a good marker for regional happiness differences.</p>
<p>The hedonometer is set to draw on more data streams soon, including blogs, news transcripts, and Bit.ly shortened links, and will be data mining in a dozen languages. Nonetheless, the happiness index will remain an aggregate measure, like a nation’s GDP, and may not have much impact in and of itself. The underlying methodology, however, is the real driver, with broad applicability to big data, whether social media-generated or not.</p>
<p><em>Image via Chris Danforth, University of Vermont</em></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Data Mining – It's Not What Customers Think]]></title>
<link>http://potsandpansbyccg.com/2013/04/30/data-mining-its-not-what-customers-think/</link>
<pubDate>Tue, 30 Apr 2013 13:47:02 +0000</pubDate>
<dc:creator>CCGConsultingLLC</dc:creator>
<guid>http://potsandpansbyccg.com/2013/04/30/data-mining-its-not-what-customers-think/</guid>
<description><![CDATA[I know that when the public hears that their ISP is engaging in data mining that they assume this me]]></description>
<content:encoded><![CDATA[<p>I know that when the public hears that their ISP is engaging in data mining that they assume this means that the ISP is reading their emails and monitoring their website viewing. And <a class="zem_slink" title="Internet service provider" href="http://en.wikipedia.org/wiki/Internet_service_provider" target="_blank" rel="wikipedia">ISPs</a> do have the ability to do those things although I don’t know any who spy on their customers in that way.</p>
<p>I can certainly understand why data mining scares the average consumer. Supermarkets get you to sign up for their loyalty programs so that they know everything you buy from them. And I know I get a spooky feeling when I express an interest about some product in one place on the Internet and then see ads for that product pop up on Facebook or my <a class="zem_slink" title="Google Search" href="http://Google.com" target="_blank" rel="homepage">Google search</a>.</p>
<p>But data mining is a valuable tool and every ISP should be using it – just not in the same way that the supermarkets and Facebook do it. In fact, we probably need to come up with a better terminology for doing the things I am suggesting below.</p>
<p>There are a number of tools around that let you look at data about customer usage and these tools allow an ISP to do the following:</p>
<ul>
<li><span style="text-decoration:underline;">Spambots</span>. There is a wide array of spambots and other malware on the web that can infect customers’ computers. The worst of these, from a network perspective are spambots, which take over your customer’s computers and use it to send out spam. Most ISPs monitor email usage from their own domain and can spot when one of their users has been taken over by a spambot. But most customers these days do not use the email names and domains assigned by their ISP. Instead they web email addresses such as gmail or even the older <a class="zem_slink" title="AOL" href="http://www.aol.com" target="_blank" rel="homepage">AOL</a>. And some spambots create new email addresses that the customer doesn’t even know about. And so data mining can be used to look for customers with unusual upload traffic. No customer is going to be offended if you ask them if they are uploading traffic 24 hours per day if in the process you help eliminate Trojan horses and spambots from their computer.</li>
</ul>
<div class="wp-caption alignright" style="width: 250px"><a href="http://www.flickr.com/photos/33393684@N02/3487202382" target="_blank"><img class="zemanta-img-inserted" title="118 - Another File Sharing Session" alt="118 - Another File Sharing Session" src="http://farm4.static.flickr.com/3561/3487202382_7ea8c8d6d1_m.jpg" width="240" height="177" /></a><p class="wp-caption-text">118 &#8211; Another File Sharing Session (Photo credit: erickespinosa)</p></div>
<ul>
<li><span style="text-decoration:underline;">Web servers</span>. Most ISPs do not want a customer to be using a residential ISP account to run a commercial web server. A web server is a device that is being used to run a website or service that drives a large amount of download traffic. Such a website might be used for e-commerce for example. But far too often web servers are used to run porn sites. ISPs are not against web servers, but they do expect people who operate them to buy the proper business level service. A web server can be full 24-hours per day, and that is generally not the level of service that is intended for a shared residential product. Data mining can be used to identify web servers and the customer can be directed to a more appropriate (and appropriately priced) service.</li>
</ul>
<ul>
<li><span style="text-decoration:underline;">Data Caps</span>. Most ISPs have set some cap on the amount of usage that a customer can download in a month. And these caps do not have to be small. I have one client that has a 2 terabyte cap each month for residential downloads. But there is no sense in having a data cap if you can’t actually measure how much bandwidth each customer is using. Data mining tools are the way to measure customers’ usage.</li>
</ul>
<ul>
<li><span style="text-decoration:underline;">File sharing</span>. Most ISPs have terms of service that prohibit customers from sharing copyrighted materials with others. But realistically an ISP is not going to know what customers are sharing with each other unless you get a complaint from a copyright holder. But many ISPs still like to get a handle on file-sharing because such traffic can eat up a lot of system bandwidth. Data mining can help you identify customers who are probably involved in one of the common <a class="zem_slink" title="File sharing" href="http://en.wikipedia.org/wiki/File_sharing" target="_blank" rel="wikipedia">file sharing programs</a>.  An awful lot of file sharing is done by teenagers. I have clients who send out friendly reminders to customers who they think are file sharing that say something like: “We notice by your internet usage that you are probably running a file sharing program. We would just like to remind you that it is illegal to share copyrighted material and that there have been cases where copyright owners have gotten significant settlements by suing people who were sharing their property.” Such notices cut down on a lot of file sharing traffic as parent pressure kids into doing the right thing.</li>
</ul>
<p>So you should be data mining. But perhaps the things I have described could all better be classified as network management, a term that would not dismay your customers.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Citizen-only FOIA law did not violate Constitution]]></title>
<link>http://amesiowalaw.wordpress.com/2013/04/29/citizen-only-foia-law-did-not-violate-constitution/</link>
<pubDate>Mon, 29 Apr 2013 20:01:32 +0000</pubDate>
<dc:creator>Frank Feilmeyer</dc:creator>
<guid>http://amesiowalaw.wordpress.com/2013/04/29/citizen-only-foia-law-did-not-violate-constitution/</guid>
<description><![CDATA[In a relatively terse opinion issued today, the U.S. Supreme Court held that Virginia&#8217;s freedo]]></description>
<content:encoded><![CDATA[<p>In a relatively terse opinion issued today, the U.S. Supreme Court held that Virginia&#8217;s freedom of information law that limited requests to citizens did not violate the United States Constitution. <a href="http://www.supremecourt.gov/opinions/12pdf/12-17_d1o2.pdf">McBurney v. Young, No. 12-17 (filed April 29, 2013)</a>. The out-of-state petitioners brought an action under 42 U.S.C. § 1983 claiming that Virginia&#8217;s statute violated the Privileges and Immunities and Dormant Commerce clauses of the U.S. Constitution. </p>
<p>Justice Alito, writing for a unanimous court, essentially found that a state could distinguish between residents and nonresidents for purposes of determining who is entitled to receive governmental information from that state. These non-Virginia-resident petitioners sought Virginia records, and their requests were denied. One petitioner was an ex-resident and sought information about a child support matter. Another petitioner sought records for the purpose of reselling the data. </p>
<p>The Privileges and Immunities Clause provides, “The Citizens of each State [are] entitled to all Privileges and Immunities of Citizens in the several States.” U. S. Const., Art. IV, §2, cl. 1. The Court noted that the state law was designed to give residents access to their own government and that those citizens were supporting that effort with their taxes.</p>
<p>The &#8220;dormant Commerce Clause jurisprudence &#8216;significantly limits the ability of States and localities to regulate or otherwise burden the flow of interstate commerce.” (Citation omitted.) The Court rejected out of hand the idea that a state law allowing citizens access to records had any regulatory or burdensome effect on interstate commerce.</p>
<p>This will be seen as a victory for state and local governments that provide records for their citizens and want to protect against the commercialization and data-mining of public records.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Detection of Spyware by Mining Executable Files]]></title>
<link>http://miningexecutablefiles.wordpress.com/2013/04/29/detection-of-spyware-by-mining-executable-files/</link>
<pubDate>Mon, 29 Apr 2013 19:52:11 +0000</pubDate>
<dc:creator>Jaraad</dc:creator>
<guid>http://miningexecutablefiles.wordpress.com/2013/04/29/detection-of-spyware-by-mining-executable-files/</guid>
<description><![CDATA[A more recent type of malicious threat is represented by spyware and this threat has not been extens]]></description>
<content:encoded><![CDATA[<p>A more recent type of malicious threat is represented by spyware and this threat has not been extensively studied. The installed spyware may be capable of capturing keystrokes, taking screenshots, saving authentication credentials, storing personal email addresses and web form data, and thus may obtain behavioral and personal information about users. It may also communicate system configuration including hardware and software, system accounts, location information, and information about other aspects of the system to a third party. This can lead to financial loss, as in identity theft and credit card fraud.</p>
<p>[ShahzadHL10]<br />
Shahzad, R.; Haider, S. &#38; Lavesson, N.<br />
Detection of Spyware by Mining Executable Files<br />
Availability, Reliability, and Security, 2010. ARES &#8217;10 International Conference on, 2010, 295-302</p>
<p>Source: <a href="http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5438080&#38;tag=1">http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5438080&#38;tag=1</a></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[How We Learn What We Know, to Chart What We Know]]></title>
<link>http://policeledintelligence.com/2013/04/29/how-we-learn-what-we-know-to-chart-what-we-know/</link>
<pubDate>Mon, 29 Apr 2013 19:08:15 +0000</pubDate>
<dc:creator>Nick Selby</dc:creator>
<guid>http://policeledintelligence.com/2013/04/29/how-we-learn-what-we-know-to-chart-what-we-know/</guid>
<description><![CDATA[I spent an enjoyable 20 minutes today speaking with Hilary Sargent, an OSINT rock-star who had to as]]></description>
<content:encoded><![CDATA[<p><a href="http://chartgirl.com/wordpress/wp-content/uploads/2013/04/TAMERLAN_large.jpg" target="_blank"><img class="alignleft size-medium wp-image-1687" alt="hilary_thumbnail" src="http://policeledintelligence.files.wordpress.com/2013/04/hilary_thumbnail.png?w=300&#038;h=184" width="300" height="184" /></a>I spent an enjoyable 20 minutes today speaking with <a href="http://hilarysargent.com/" target="_blank">Hilary Sargent</a>, an OSINT rock-star who had to ask me what OSINT was.</p>
<p>That&#8217;s not to say that she didn&#8217;t know &#8211; I&#8217;m sure she did. But when I told her that it was Open Source INTelligence, and further explained the difference, say, between that and traditional HUMINT, she cut me off to explain that she&#8217;s a veteran investigator.</p>
<p>And elicitation specialist.</p>
<p>Now she runs the site <a href="http://chartgirl.com/" target="_blank">chartgirl.com</a>. Click the image on the left to see the full sized edition of the one that kicked off this post, or <a href="http://chartgirl.com/wordpress/wp-content/uploads/2013/04/TAMERLAN_large.jpg" target="_blank">click here</a>.</p>
<p>We&#8217;d started talking because I asked her permission to run here on PLI the graphic above, that she released today. It tracks quite clearly the who-knew-what-and-when-they-knew-it of the Tsarnaev investigation. I&#8217;d seen it online, tweeted by <a href="https://twitter.com/th3j35t3r/" target="_blank">Th3J35t3r</a> , and it was to me a fantastic vindication of what I was talking about the other day when I posted my <a href="http://policeledintelligence.com/2013/04/24/boston-bombing-investigation-intel-failure-no-bad-expectation-setting-oh-yeah/" target="_blank">looooooooooooong post </a>about why the Boston Marathon Bombings was not evidence of an intelligence failure.</p>
<p>I asked Hilary what brought her to do this kind of work &#8211; I&#8217;m not the first to do so; see a <a href="http://blogs.reuters.com/jackshafer/2013/04/24/chartgirl-boxes-the-news/" target="_blank">great piece in Reuters about her here</a>.</p>
<p>That&#8217;s when she asked me the question about OSINT. &#8220;I&#8217;ve heard that word a few times today,&#8221; she said.</p>
<p>Sargent says that she has not yet turned her charting abilities into a business, but she keeps getting stuck reading newspaper articles that really need a chart and don&#8217;t have one.</p>
<p>So at this point, her startup costs seem to be her weekends.</p>
<p>Sargent said that one of the reasons she wanted to make this chart was listening to Lindsay Graham saying that, <a href="http://www.politico.com/story/2013/04/lindsey-graham-tamerlan-tsarnaev-russia-trip-90416.html" target="_blank">but for a spellcheck</a>, the FBI woulda had him.</p>
<p>Sargent&#8217;s understanding of the intricacies of data sharing have led her to the conclusion that when it comes to interagency sharing, <em>offering</em> access is not the same thing as data sharing.</p>
<p><a href="http://policeledintelligence.files.wordpress.com/2013/04/hilary_thumbnail2.png"><img class="alignright size-medium wp-image-1690" alt="hilary_thumbnail2" src="http://policeledintelligence.files.wordpress.com/2013/04/hilary_thumbnail2.png?w=300&#038;h=170" width="300" height="170" /></a>&#8220;I was trying to show the idea that, &#8216;they should have caught this guy,&#8217; was wrong,&#8221; she told me.  If you look at her chart, which she compiled from open news sources and scholarly descriptions of the organizations and systems it describes, you can see that no agency was even remotely close to knowing enough to catch him. &#8220;Each agency knew a different thing.&#8221;</p>
<p>Sargent says she made the chart to clarify some of the inconsistencies and flat out wrong statements in the press about even basic things, like a &#8220;Watch List&#8221;.</p>
<p>Mainly, though, she said, she did it because it needed to be done and no one else had done it.</p>
<p>&#8220;I start with a similar pencil and paper version,&#8221; she told me. &#8220;I know kind of the rough basics, like that TSA and Customs are part of DHS; I didn&#8217;t know about TIDE or a lot of these other more arcane databases.</p>
<p>&#8220;I also didn&#8217;t know the way to structure to describe it visually. It seemed clear there wer four things that, had everyone who could have possibly known and focused on, they would have caught this guy.</p>
<p>So, if, for example, there was one senior guy at a JTTF who had spoken with the guys in Russia and with the family, and known about the son&#8217;s activity and his overseas travel, then they would have known.&#8221;</p>
<p>Sargent started mainly by reading media articles &#8211; there haven&#8217;t been a lot of documents released on this yet, anyway. She spent a ton of hours reading about the differences between several of the federal lists.</p>
<p>&#8220;Most articles did a bad job delineating between, for example, TIDE and the No Fly List and TECS. That was the most important distinction. For the media to say he was &#8220;On a watch list&#8221; is really misleading.</p>
<p>&#8220;Then I just read a couple of hundred articles, and a bunch of stuff written over the ast ten years about how all this stuff works.&#8221;</p>
<p>&#8220;I didn&#8217;t include description of 90-day window,&#8221; she said. &#8220;I try to walk a line between only charting what I completely understand; I don&#8217;t put things I don&#8217;t understand in the chart. I don&#8217;t want people from the intelligence community to look at my chart and roll there eyes; that is my goal anyway, and it is a hard thing to do.</p>
<p>&#8220;I am sure a lot of people in the IC have been reading the New York Times and rolling their eyes,&#8221; she said.</p>
<p>The real reason I am writing about her here is because of how she synthesized all these data and then produced the analysis, and basically it was straightforward OSINT work. She Googles a lot, and she has the kind of personality that lends itself to wanting to find the answer and being willing to spend the time to get it.</p>
<p>One good tip she gives, that we agree with and do, is that for things like TIDE and TECS and No Fly that you start at Wikipedia to get a basic idea, but never quote it &#8211; rather, look at the link collection at the bottom of the listing and go directly to the scholarly works and articles quoted therein. She warns that this is for researching things more than people  - unless you&#8217;re a B-list celebrity or higher, the Wikipedia pages and the links tend to be self-serving paff for people.</p>
<p>But for things? Great stuff.</p>
<p>Her one statement about cops and research is very important for analysts to remember when they speak with detectives or cops who say that they just Don&#8217;t Understand The Google.</p>
<p>&#8220;Oh yeah? Well tell him to get me a score from some game on ESPN and I guarantee you he can find it in seconds on his phone. These are the same skills.</p>
<p>&#8220;They can do it.&#8221;</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Win-Vector Blog » Data Science, Machine Learning, and Statistics: what is in a name?]]></title>
<link>http://datascience101.wordpress.com/2013/04/29/win-vector-blog-data-science-machine-learning-and-statistics-what-is-in-a-name/</link>
<pubDate>Mon, 29 Apr 2013 15:45:48 +0000</pubDate>
<dc:creator>Ryan Swanstrom</dc:creator>
<guid>http://datascience101.wordpress.com/2013/04/29/win-vector-blog-data-science-machine-learning-and-statistics-what-is-in-a-name/</guid>
<description><![CDATA[Win-Vector Blog » Data Science, Machine Learning, and Statistics: what is in a name?. This is an exc]]></description>
<content:encoded><![CDATA[<p><a href='http://www.win-vector.com/blog/2013/04/data-science-machine-learning-and-statistics-what-is-in-a-name/'>Win-Vector Blog » Data Science, Machine Learning, and Statistics: what is in a name?</a>.</p>
<p>This is an excellent write-up for the differences between:</p>
<ul>
<li>Statistics</li>
<li>Machine Learning</li>
<li>Data Mining</li>
<li>Informatics</li>
<li>Big Data</li>
<li>Predictive Analytics</li>
<li>Data Science</li>
</ul>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Questions are data too]]></title>
<link>http://skillicorn.wordpress.com/2013/04/29/questions-are-data-too/</link>
<pubDate>Mon, 29 Apr 2013 15:12:23 +0000</pubDate>
<dc:creator>skillicorn</dc:creator>
<guid>http://skillicorn.wordpress.com/2013/04/29/questions-are-data-too/</guid>
<description><![CDATA[In the followup investigation of the Boston Marathon bombings, we see again the problem that data an]]></description>
<content:encoded><![CDATA[<p>In the followup investigation of the Boston Marathon bombings, we see again the problem that data analytics has with questions.</p>
<p>Databases are built to store data. But, as Jeff Jones has most vocally pointed out, simply keeping the data is not enough in adversarial settings. You also need to keep the questions, and treat them as part of the ongoing data. The reason is obvious once you think about it &#8212; intelligence analysts need not only to know the known facts; they also need to know that someone else has asked the same question they just asked. Questions are part of the mental model of analysts, part of their situational awareness, but current systems don&#8217;t capture this part and preserve it so that others can build on it. In other words, we don&#8217;t just need to connect the dots; we need to connect the edges!</p>
<p>Another part of this is that, once questions are kept, they can be re-asked automatically. This is immensely powerful. At present, an analyst can pose a question (&#8220;has X ever communicated with Y?&#8221;), get a negative answer, only for information about such a communication to arrive a microsecond later and not be noticed. In fast changing environments, this can happen frequently, but it&#8217;s implausible to expect analysts to remember and re-pose their questions at intervals, just in case.</p>
<p>We still have some way to go with the tools and techniques available for intelligence analysis.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Big data as "belt and suspenders."]]></title>
<link>http://afishwithlegs.wordpress.com/2013/04/29/big-data-as-belt-and-suspenders/</link>
<pubDate>Mon, 29 Apr 2013 14:37:00 +0000</pubDate>
<dc:creator>Ken Krimstein</dc:creator>
<guid>http://afishwithlegs.wordpress.com/2013/04/29/big-data-as-belt-and-suspenders/</guid>
<description><![CDATA[I hate to keep harping on this big data thing. I know it can add clarity and understanding and in ou]]></description>
<content:encoded><![CDATA[<p>I hate to keep harping on this big data thing. I know it can add clarity and understanding and in our completely interconnected interwebs world it is often the only way we can have any sense of what&#8217;s going on. </p>
<p>But I&#8217;m here to tell you that big data can also be used as an excuse NOT to act. There are always more numbers to get. There are always more scenarios to evaluate. We will know more in a week. A month. A year. An hour. Five minutes.</p>
<p>Stop already!</p>
<p>Sometimes you have to just do something. I remember picking up a very useful expression from someone, &#8220;that client is wearing a belt and suspenders.&#8221; What&#8217;s it mean? It means they are beyond cautious. They want too much validation. And they look nerdy and ugly and dumb too.</p>
<p>(Now, let me very very clear on this &#8212; that was for a long lost client from a million years ago who is out of business, no relation to anyone living!!!)</p>
<p>At any rate, as someone else said, paralysis by analysis.</p>
<p>Hey, sometimes you just gotta throw the pitch. See if the other guy swings. See what happens.</p>
<p>But, alas, the blanket of numbers is too often a security blanket we won&#8217;t let go of. And in the meantime, a world of opportunity could be passing us by. Oh, and we look real nerdy too.</p>
<p>I&#8217;m just sayin&#8230;</p>
]]></content:encoded>
</item>

</channel>
</rss>
