{"id":6299,"date":"2017-01-23T15:50:14","date_gmt":"2017-01-23T23:50:14","guid":{"rendered":"http:\/\/www.palada.net\/index.php\/2017\/01\/23\/news-137\/"},"modified":"2017-01-23T15:50:14","modified_gmt":"2017-01-23T23:50:14","slug":"news-137","status":"publish","type":"post","link":"https:\/\/www.palada.net\/index.php\/2017\/01\/23\/news-137\/","title":{"rendered":"\u200b\u200bDoes prevalence matter? A different approach to traditional antimalware test scoring"},"content":{"rendered":"<p>Most well-known antimalware tests today focus on broad-spectrum malware.\u00a0 In other words, tests include malware that is somewhat indiscriminate (isn&#8217;t necessarily targeted), at least somewhat prevalent and sometimes very prevalent. Typically, tests are not focused on specialized threats that are highly targeted, and most avoid including programs that walk the line between good and evil, such as adware and other programs that we call unwanted software as opposed to malware.\u00a0 Files that are in most test sets are files that antimalware vendors agree a customer would never want and are generally pervasive in the ecosystem.<\/p>\n<p>The traditional test score counts each file equally. That is, if there are 100 files, each file is worth 1% of the test.\u00a0 In the real world, however, people don&#8217;t encounter malware at exactly the same rate. Some malware is incredibly prevalent while other malware families are not as pervasive.\u00a0 Likewise, some malware might focus on certain regional demographics or languages and not affect other parts of the world.\u00a0 When it comes to real customer impact, not all malware has the same distribution or prevalence.\u00a0 Yet, they are treated as such in traditionally scored tests.<\/p>\n<h3>Collaborating to create a more applicable scoring model<\/h3>\n<p>Microsoft has been partnering with <a href=\"http:\/\/www.av-comparatives.org\/\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">AV-Comparatives<\/span><\/a> to create a scoring model that incorporates prevalence to represent true customer impact.\u00a0 At Virus Bulletin (VB) this year, Peter Stelzhammer (AV-Comparatives co-founder) and I <a href=\"https:\/\/www.virusbtn.com\/conference\/vb2015\/abstracts\/StewartStelzhammer.xml\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">presented<\/span><\/a> this model.\u00a0 Today, AV-Comparatives is releasing the prevalence-weighted results from the most recent file detection test in a <a href=\"http:\/\/www.av-comparatives.org\/wp-content\/uploads\/2015\/11\/avc_prevalence_201509_en.pdf\">PDF report<\/a> and also on <a href=\"http:\/\/impact.av-comparatives.org\/\">the impact section of their website<\/a>.\u00a0 This test compares detection rates of vendors against a very comprehensive malware set \u2013 166k Portable Executable (PE) files.<\/p>\n<p>After working with AV-Comparatives for many years, I have personally developed a great respect for the way they curate files for their tests. They work diligently to select files that are relevant, are not in that &#8220;unwanted&#8221; category (which vendors would lobby to dispute out of their test), and they are able to source hundreds of thousands of recent files for the test.\u00a0 That said, one thing we found is that it is incredibly difficult, if you&#8217;re using a traditional scoring model, to attempt to source a perfect number of files that represent ecosystem prevalence.<\/p>\n<p>For one, many malware families rely on non-PE components to spread. Jenxcus is a good example \u2013 its VBS (Visual Basic script) component is one of the most frequently blocked files on our customers&#8217; computers. However, its PE component is seen comparatively rarely, so it&#8217;s quite difficult to source enough Jenxcus PE files for a test to equate to that family&#8217;s ecosystem prevalence.\u00a0 Samples from some families might be easier to source than others (more willing to be found or submitted to public sources).\u00a0 These constraints make it practically impossible to select a test set that perfectly equates to the ecosystem.<\/p>\n<h3>Looking at the prevalence model<\/h3>\n<p>Enter the prevalence model.\u00a0 <a href=\"http:\/\/www.amtso.org\/\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">AMTSO<\/span><\/a> through the Realtime Threat List (<a href=\"http:\/\/www.amtso.org\/rttl\/\"><span lang=\"EN-US\" style=\"text-decoration: underline\">RTTL<\/span><\/a>) has been making strides lately to encourage vendors to share malware prevalence information with testers to help testers build better test sets.\u00a0 While we have been moving toward critical mass and getting closer to the needed features to make that project work, Microsoft offered to sponsor AV-Comparatives and provide telemetry details from over 200 million computers in over 100 different countries to them to create a prevalence-weighted model.<\/p>\n<p>The following chart shows how the test set stacks up to the ecosystem (# of files in comparison to ecosystem prevalence).<\/p>\n<p><a href=\"http:\/\/www.microsoft.com\/security\/portal\/blog-images\/a\/MMPCAVC.png\"><img decoding=\"async\" src=\"http:\/\/www.microsoft.com\/security\/portal\/blog-images\/a\/MMPCAVC.png\" alt=\" In general, files selected for the most prevalent malware families (those in the high category) were underrepresented using the traditional method of scoring and those in the low category were overrepresented\" border=\"0\" \/><\/a><\/p>\n<p><em>Figure 1: \u00a0In general, files selected for the most prevalent malware families (those in the high category) were underrepresented using the traditional method of scoring and those in the low category were overrepresented.<\/em><\/p>\n<p>When you drill down a layer to the highly prevalent malware families, you can start to see why the numbers don&#8217;t line up.\u00a0 Some of the file infector families, like <a href=\"http:\/\/www.microsoft.com\/security\/portal\/threat\/encyclopedia\/Entry.aspx?Name=Win32\/Sality\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">Sality<\/span><\/a> and <a href=\"http:\/\/www.microsoft.com\/security\/portal\/threat\/encyclopedia\/Entry.aspx?Name=Win32\/Virut\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">Virut<\/span><\/a>, had a very large sample set (a great representation of the family in fact). However, other prevalent families, like <a href=\"http:\/\/www.microsoft.com\/security\/portal\/threat\/encyclopedia\/Entry.aspx?Name=Worm:Win32\/Gamarue\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">Gamarue<\/span><\/a>, <a href=\"http:\/\/www.microsoft.com\/security\/portal\/threat\/encyclopedia\/Entry.aspx?Name=Trojan:Win32\/Dorv.A\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">Dorv<\/span><\/a>, <a href=\"http:\/\/www.microsoft.com\/security\/portal\/threat\/encyclopedia\/Entry.aspx?Name=Win32\/Jenxcus\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">Jenxcus<\/span><\/a>, and <a href=\"http:\/\/www.microsoft.com\/security\/portal\/threat\/encyclopedia\/Entry.aspx?Name=Win32\/Sventore\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">Sventore<\/span><\/a> were underrepresented.\u00a0 Sventore was new \u2013 there was only one file to represent that family.\u00a0 Gamarue, Dorv, and especially Jenxcus didn&#8217;t have nearly enough recent PE files available for the test to allow them to equate to their ecosystem prevalence.<\/p>\n<p><a href=\"http:\/\/www.microsoft.com\/security\/portal\/blog-images\/a\/MMPCAVC2.png\"><img decoding=\"async\" src=\"http:\/\/www.microsoft.com\/security\/portal\/blog-images\/a\/MMPCAVC2.png\" alt=\"A tabulated sample of the test score impact\" border=\"0\" \/><\/a><\/p>\n<p><em>\u00a0Figure 2:\u00a0\u00a0Another\u00a0example of the\u00a0test scores\u00a0not lining up.<\/em><\/p>\n<p>The prevalence-weighted model takes into account the prevalence of the tested file, the malware family associated with the file, and the malware family&#8217;s partition (high, moderate, low, very low) to calculate each file&#8217;s impact to the test which balances the score with the actual customer impact in the ecosystem.<\/p>\n<p>For more details about the exact calculation method, you can see the <a href=\"http:\/\/www.av-comparatives.org\/wp-content\/uploads\/2015\/11\/avc_prevalence_201509_en.pdf\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">AV-Comparatives report released today<\/span><\/a>.<\/p>\n<p>The charts above show how the prevalence model balanced test scores to make them more accurately represent a vendor&#8217;s detection capabilities.\u00a0 In essence, missed files were scored to represent the malware people were more likely to encounter, which is good information for consumers.\u00a0 However, prevalence-weighting the score can mean that vendors (at least those who monitor malware prevalence) might have very similar test scores.\u00a0 Therefore, additional context is probably needed to help consumers make decisions.<\/p>\n<p>Geolocation is one context we analyzed. In the <a href=\"http:\/\/www.av-comparatives.org\/wp-content\/uploads\/2015\/11\/avc_prevalence_201509_en.pdf\"> <span lang=\"EN-US\" style=\"text-decoration: underline\">report<\/span><\/a>, we broke down vendor test scores by country using each country&#8217;s malware prevalence profile. There are some examples of vendors that did great in some countries and not so great in others. Scores didn&#8217;t always line up with vendors that were co-located in the target region.\u00a0 If you&#8217;re interested in a specific country, be sure to check out AV-Comparative&#8217;s regional maps in the report.<\/p>\n<p>Organizations, especially those that might have special security concerns, might need other differentiation.\u00a0 After Peter and I presented at VB this year, we got lots of great feedback from people at the conference.\u00a0 One of the ideas we discussed was differentiating malware prevalence specifically affecting enterprises and even showing the differences between verticals.\u00a0 Other discussions centered around showing detection differences by type of threat \u2013 ransomware in comparison to information stealers, etc.<\/p>\n<p>Prevalence is but one model that provides additional insight to help people make better-informed decisions when choosing their protection provider. Through partnerships like this one with AV-Comparatives and others in the industry, productive discussions and innovative models result in even better antimalware testing and provide greater benefits to consumers and enterprises alike.<\/p>\n<p><em>Holly Stewart<\/em><\/p>\n<p><em>MMPC<\/em><\/p>\n<p><a href=\"https:\/\/blogs.technet.microsoft.com\/mmpc\/2015\/11\/23\/does-prevalence-matter-a-different-approach-to-traditional-antimalware-test-scoring\/\" target=\"bwo\" >https:\/\/blogs.technet.microsoft.com\/mmpc\/feed\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Most well-known antimalware tests today focus on broad-spectrum malware.\u00a0 In other words, tests include malware that is somewhat indiscriminate (isn&#8217;t necessarily targeted), at least somewhat prevalent and sometimes very prevalent. Typically, tests are not focused on specialized threats that are highly targeted, and most avoid including programs that walk the line between good and evil,&#8230;<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"colormag_page_container_layout":"default_layout","colormag_page_sidebar_layout":"default_layout","footnotes":""},"categories":[10759,10378],"tags":[10760,10763,10764,10516,10765,10766,10761,10762],"class_list":["post-6299","post","type-post","status-publish","format-standard","hentry","category-microsoft","category-security","tag-antimalware-research-for-it-pros-and-enthusiasts","tag-antimalware-test-result","tag-does-prevalence-matter","tag-microsoft","tag-microsoft-antimalware-test-scoring","tag-prevalence","tag-windows-10","tag-windows-defender"],"_links":{"self":[{"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/posts\/6299","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/comments?post=6299"}],"version-history":[{"count":0,"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/posts\/6299\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/media?parent=6299"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/categories?post=6299"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/tags?post=6299"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}