PixelPlayer AI system picks out instruments from the mix

[ad_1]

Well, check out an AI project being worked on at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL).

PixelPlayer

PixelPlayer is described as “deep-learning system” that can analyse a video of a musical performance, and isolate the particular instruments involved, making them louder or softer.

It can identify the sounds of more than 20 commonly seen instruments.

Seen is a key phrase because previous efforts to separate sounds have apparently focused exclusively on audio, which MIT states often requires extensive human labeling.

You can hear the system in action below:

</p> <blockquote readability="12.8686868687"> <p>“We expected a best-case scenario where we could recognize which instruments make which kinds of sounds,” <a rel="image" href="http://news.mit.edu/2018/ai-editing-music-videos-pixelplayer-csail-0705" target="_blank">says</a> Hang Zhao, a PhD student at CSAIL.</p> <p>“We were surprised that we could actually spatially locate the instruments at the pixel level. Being able to do that opens up a lot of possibilities, like being able to edit the audio of individual instruments by a single click on the video.”</p> </blockquote> <p>PixelPlayer finds patterns in data using neural networks that have been trained on existing videos.</p> <h2>Neural networks</h2> <p>MIT says one neural network visually analyses the the video, another one concentrates on the audio, and a third “synthesizer” associates specific pixels with specific soundwaves to separate the different sounds.</p> <p>The “deep-learning” is so deep – it uses so-called “self-supervised” learning – the MIT team doesn’t necessarily understand everything the system does in terms of identifying separate instruments.</p> <p>It seems, however, that certain harmonic frequencies correlate to specific instruments, such as a violin, while quick pulse-like patterns correspond to instruments like the xylophone.</p> <h2>Environmental sounds</h2> <p>What are the possible, non-musical applications for the technology? Zhao suggests a system like Pixel Player could be used to better understand the environmental sounds that external objects make, such as vehicles.</p> <p>Hang Zhao is lead author a paper co-written with with MIT professors Antonio Torralba, in the Department of Electrical Engineering and Computer Science, and Josh McDermott, in the Department of Brain and Cognitive Sciences. Also involved are research associate Chuang Gan, undergraduate student Andrew Rouditchenko, and PhD graduate Carl Vondrick.</p> <p>The paper will be presented to the <em>European Conference on Computer Vision</em> (ECCV) in Munich in September.</p> <p>Thanks to Sue P. for highlighting this one.</p> <p><em>Images: MIT CSAIL</em></p> <p>[Via <em><a rel="image" href="https://newatlas.com/pixelplayer/55348/" target="_blank">New Atlas</a></em>]</p> </p></div> <p>[ad_2]<br /> <br /><a href="https://www.electronicsweekly.com/blogs/gadget-master/general/pixelplayer-ai-system-picks-instruments-mix-2018-07/">Source link </a></p> </div> <footer class="entry-footer"> <span class="cat-links"><i class="fa fa-folder-open" aria-hidden="true"></i>Posted in <a href="https://tahium.com/category/news/" rel="category tag">News from other Blogs</a></span> </footer> </article> <nav class="navigation post-navigation" aria-label="Posts"> <h2 class="screen-reader-text">Post navigation</h2> <div class="nav-links"><div class="nav-previous"><a href="https://tahium.com/news/htc-reveals-more-on-its-blockchain-phone/" rel="prev">HTC reveals more on its blockchain phone</a></div><div class="nav-next"><a href="https://tahium.com/news/the-foldable-samsung-galaxy-x-coming-soon-ish-an-annotated-infographic/" rel="next">The foldable Samsung Galaxy X – coming soon-ish – an annotated infographic</a></div></div> </nav> </main> </div> <div itemscope itemtype="http://schema.org/WPSideBar" role="complementary" aria-label="Main Sidebar" id="sidebar-secondary" class="col-md-4 widget-area"> <aside id="search-2" class="widget widget_search"><form role="search" method="get" class="search-form" action="https://tahium.com/"> <label> <span class="screen-reader-text">Search for:</span> <input type="search" class="search-field" placeholder="Search …" value="" name="s" /> </label> <input type="submit" class="search-submit" value="Search" /> </form></aside></div> </div> </div> <footer itemscope itemtype="http://schema.org/WPFooter" id="footer" role="contentinfo" class = "footer grey-bg"> <div class="container"> <div class="footer-widget-wrap"> <div itemscope itemtype="http://schema.org/WPSideBar" role="complementary" id="sidebar-widgets-area-1" class="col-md-3 col-sm-6 col-xs-12 widget-box" aria-label="Widgets Area 1"> </div> <div itemscope itemtype="http://schema.org/WPSideBar" role="complementary" id="sidebar-widgets-area-2" class="col-md-3 col-sm-6 col-xs-12 widget-box" aria-label="Widgets Area 2"> </div> <div itemscope itemtype="http://schema.org/WPSideBar" role="complementary" id="sidebar-widgets-area-3" class="col-md-3 col-sm-6 col-xs-12 widget-box" aria-label="Widgets Area 3"> </div> <div itemscope itemtype="http://schema.org/WPSideBar" role="complementary" id="sidebar-widgets-area-4" class="col-md-3 col-sm-6 col-xs-12 widget-box" aria-label="Widgets Area 4"> <div id="search-3" class="widget widget_search"><form role="search" method="get" class="search-form" action="https://tahium.com/"> <label> <span class="screen-reader-text">Search for:</span> <input type="search" class="search-field" placeholder="Search …" value="" name="s" /> </label> <input type="submit" class="search-submit" value="Search" /> </form></div><div id="text-2" class="widget widget_text"> <div class="textwidget"><style type="text/css"> .style1 { text-align: center; } </style> <div class="style1"> <img src="http://tahium.com/wp-content/uploads/2016/10/paypal.png" width="286" height="176"></div> </div> </div> </div> </div> <div class="footer-bottom-wrap"> <span class="llorix_one_lite_copyright_content">Tahium Ltd registered in England and Wales, Company No:9912545</span><div itemscope role="navigation" itemtype="http://schema.org/SiteNavigationElement" id="menu-secondary" aria-label="Secondary Menu"><h1 class="screen-reader-text">Secondary Menu</h1></div> <ul class="social-icons"> <li> <a href="http://tahium.com/feed/"> <span class="screen-reader-text">fa-google-plus-square</span> <i class="fa llorix-one-lite-footer-icons fa-google-plus-square transparent-text-dark" aria-hidden="true"></i> </a> </li> <li> <a href="#"> <span class="screen-reader-text">fa-facebook</span> <i class="fa llorix-one-lite-footer-icons fa-facebook transparent-text-dark" aria-hidden="true"></i> </a> </li> <li> <a href="#"> <span class="screen-reader-text">fa-twitter</span> <i class="fa llorix-one-lite-footer-icons fa-twitter transparent-text-dark" aria-hidden="true"></i> </a> </li> </ul> </div> <div class="powered-by"><a href="http://radiaki.com/" rel="nofollow">panos Athanasiadis</a>powered by <a href="http://radiaki.com/" rel="nofollow">Tahium</a></div> </div> </footer> <script type="speculationrules"> {"prefetch":[{"source":"document","where":{"and":[{"href_matches":"/*"},{"not":{"href_matches":["/wp-*.php","/wp-admin/*","/wp-content/uploads/*","/wp-content/*","/wp-content/plugins/*","/wp-content/themes/llorix-one-lite/*","/*\\?(.+)"]}},{"not":{"selector_matches":"a[rel~=\"nofollow\"]"}},{"not":{"selector_matches":".no-prefetch, .no-prefetch a"}}]},"eagerness":"conservative"}]} </script> <script id="llorix-one-lite-bootstrap-js" src="https://tahium.com/wp-content/themes/llorix-one-lite/js/vendor/bootstrap.min.js?ver=3.3.7"></script> <script id="llorix-one-lite-custom-all-js-extra"> var screenReaderText = {"expand":"\u003Cspan class=\"screen-reader-text\"\u003Eexpand child menu\u003C/span\u003E","collapse":"\u003Cspan class=\"screen-reader-text\"\u003Ecollapse child menu\u003C/span\u003E"}; //# sourceURL=llorix-one-lite-custom-all-js-extra </script> <script id="llorix-one-lite-custom-all-js" src="https://tahium.com/wp-content/themes/llorix-one-lite/js/custom.all.js?ver=2.0.2"></script> <script id="llorix-one-lite-skip-link-focus-fix-js" src="https://tahium.com/wp-content/themes/llorix-one-lite/js/skip-link-focus-fix.js?ver=1.0.0"></script> <script id="wp-emoji-settings" type="application/json"> {"baseUrl":"https://s.w.org/images/core/emoji/17.0.2/72x72/","ext":".png","svgUrl":"https://s.w.org/images/core/emoji/17.0.2/svg/","svgExt":".svg","source":{"concatemoji":"https://tahium.com/wp-includes/js/wp-emoji-release.min.js?ver=42c06e83d311ee8bf6e18631088363da"}} </script> <script type="module"> /*! This file is auto-generated */ const a=JSON.parse(document.getElementById("wp-emoji-settings").textContent),o=(window._wpemojiSettings=a,"wpEmojiSettingsSupports"),s=["flag","emoji"];function i(e){try{var t={supportTests:e,timestamp:(new Date).valueOf()};sessionStorage.setItem(o,JSON.stringify(t))}catch(e){}}function c(e,t,n){e.clearRect(0,0,e.canvas.width,e.canvas.height),e.fillText(t,0,0);t=new Uint32Array(e.getImageData(0,0,e.canvas.width,e.canvas.height).data);e.clearRect(0,0,e.canvas.width,e.canvas.height),e.fillText(n,0,0);const a=new Uint32Array(e.getImageData(0,0,e.canvas.width,e.canvas.height).data);return t.every((e,t)=>e===a[t])}function p(e,t){e.clearRect(0,0,e.canvas.width,e.canvas.height),e.fillText(t,0,0);var n=e.getImageData(16,16,1,1);for(let e=0;e<n.data.length;e++)if(0!==n.data[e])return!1;return!0}function u(e,t,n,a){switch(t){case"flag":return n(e,"\ud83c\udff3\ufe0f\u200d\u26a7\ufe0f","\ud83c\udff3\ufe0f\u200b\u26a7\ufe0f")?!1:!n(e,"\ud83c\udde8\ud83c\uddf6","\ud83c\udde8\u200b\ud83c\uddf6")&&!n(e,"\ud83c\udff4\udb40\udc67\udb40\udc62\udb40\udc65\udb40\udc6e\udb40\udc67\udb40\udc7f","\ud83c\udff4\u200b\udb40\udc67\u200b\udb40\udc62\u200b\udb40\udc65\u200b\udb40\udc6e\u200b\udb40\udc67\u200b\udb40\udc7f");case"emoji":return!a(e,"\ud83e\u1fac8")}return!1}function f(e,t,n,a){let r;const o=(r="undefined"!=typeof WorkerGlobalScope&&self instanceof WorkerGlobalScope?new OffscreenCanvas(300,150):document.createElement("canvas")).getContext("2d",{willReadFrequently:!0}),s=(o.textBaseline="top",o.font="600 32px Arial",{});return e.forEach(e=>{s[e]=t(o,e,n,a)}),s}function r(e){var t=document.createElement("script");t.src=e,t.defer=!0,document.head.appendChild(t)}a.supports={everything:!0,everythingExceptFlag:!0},new Promise(t=>{let n=function(){try{var e=JSON.parse(sessionStorage.getItem(o));if("object"==typeof e&&"number"==typeof e.timestamp&&(new Date).valueOf()<e.timestamp+604800&&"object"==typeof e.supportTests)return e.supportTests}catch(e){}return null}();if(!n){if("undefined"!=typeof Worker&&"undefined"!=typeof OffscreenCanvas&&"undefined"!=typeof URL&&URL.createObjectURL&&"undefined"!=typeof Blob)try{var e="postMessage("+f.toString()+"("+[JSON.stringify(s),u.toString(),c.toString(),p.toString()].join(",")+"));",a=new Blob([e],{type:"text/javascript"});const r=new Worker(URL.createObjectURL(a),{name:"wpTestEmojiSupports"});return void(r.onmessage=e=>{i(n=e.data),r.terminate(),t(n)})}catch(e){}i(n=f(s,u,c,p))}t(n)}).then(e=>{for(const n in e)a.supports[n]=e[n],a.supports.everything=a.supports.everything&&a.supports[n],"flag"!==n&&(a.supports.everythingExceptFlag=a.supports.everythingExceptFlag&&a.supports[n]);var t;a.supports.everythingExceptFlag=a.supports.everythingExceptFlag&&!a.supports.flag,a.supports.everything||((t=a.source||{}).concatemoji?r(t.concatemoji):t.wpemoji&&t.twemoji&&(r(t.twemoji),r(t.wpemoji)))}); //# sourceURL=https://tahium.com/wp-includes/js/wp-emoji-loader.min.js </script> </body> </html>