Truemag

  • Subscribe
    • New Subscription
    • Account Updates
    • Customer Service
  • News & Events
    • News
    • Events
  • Advertise
    • Media Kit
    • Reprints
    • Contacts
  • Editorial
    • Podcasts
    • Current Articles
    • Digital Editions
    • eNewsletter
    • Editor’s Desk
    • Edit Calendar
    • Contacts
  • Buyers Guide
    • Search
    • Sponsor Index
    • Vendor Update
  • Annual Software Ranking
    • Ranking Form
    • Annual Software Ranking
    • 2018 Software Ranking File Package

Indico Launches Enso Open Source Project for Machine Learning

6.27.18

Indico, a provider of Enterprise AI solutions for unstructured content, today announced the launch of a new open source project focused on simplifying the use of transfer learning with natural language. Enso is an open-source library designed to streamline the benchmarking of embedding and transfer learning methods for a wide variety of natural language processing tasks. It provides machine learning engineers and software developers with a standard interface and useful tools for the fair comparison of varied feature representations and target task models.

“The Open Source community is the driving force for innovation in machine learning, and Indico has benefitted from it and embraces the open source effort fully,” said Slater Victoroff, co-founder and CTO at Indico. “Enso is a way for us to give back to the community and continue to promote the benefits of transfer learning to accelerate its adoption and reduce the barriers to machine learning.”

Transfer learning is the practice of applying knowledge gained on one machine learning task to aid the resolution of subsequent tasks. It has seen historic success in the field of computer vision and image classification. Tasks that would typically require hundreds of thousands of images can be tackled with just dozens of training examples per class thanks to the use of these pre-trained models. The field of natural language processing, however, has seen fewer gains from transfer learning. The Enso project is focused on addressing a core set of interrelated problems that underlie these limitations:

A lack of academic reproducibility. Due to the use of custom datasets and variations in coding practices, it is difficult to determine whether a new methodology is truly effective.
Weak baseline benchmarks that limit general applicability. It is important to evaluate new methods on a broad range of datasets to determine whether or not a new approach represents a substantial improvement over alternatives.
“Overfitting” to specific datasets. Many of the models used for benchmarking are tied to specific datasets making it too difficult to take a model trained for one domain and train it on another.
The Enso project promotes the availability of more general datasets and stronger baselines to compare research against. This will help users ascertain where application of a given method is effective and where it is not, accelerating the application of machine learning for more practical purposes.

“Measuring how well methods perform as the amount of training data increases is critical,” said Madison May, Indico machine learning architect and co-founder. “In real life examples, we often need to select for methods that perform well with only a few hundred labeled training examples. By providing a standard interface for benchmarking, we believe Enso can facilitate the development of more generalized models that have greater value to a broader base of users.”

Enso is compatible with Python 3.4+.

www.indico.io

Jun 27, 2008Olivia Cahoon
MapR Unveils Major Data Platform Update for AI and AnalyticsCray Announces New AI Workflow Software Suite
Product Centrics
TrueNAS Open Source Storage Platform brings Full Windows ACL Support to Linux

Fully featured Windows file system ACLs are well supported in TrueNAS 12.0 (CORE and Enterprise), but not generally supported by Linux. Thanks to some innovation, and sweat from the iXsystems engineering team, TrueNAS SCALE 21.08...

Driving Successful Digital Transformation Initiatives in 2022

Well, the end of the year is the perfect time to reflect on all the past year's activities and plan for the coming year. As we plan for 2022, one thing...

Recovery Platforms

Established in 2013, Imanis Data, previously Talena...

Data Driven Efficiency

Founded in 2003, Tableau is a public software company...

Updated Hitachi CRM

Building Product Manufacturers (BPM) require...

Quick Links
Untitled Document
SW500 SW500 SW500 SW500 SW500
2022 © Rockport Custom Publishing, LLC