Mike C | 20 Aug 21:30 2015

RecommenderJob and GenericUserBasedRecommender

Hi!

I've made a custom User Recommender using the Mahout API, using
GenericUserBasedRecommender.

I'm not quite sure how to take the next step to get it working across
Hadoop.  RecommenderJob appears to be for Item Recommenders and it's not
really clear how to adapt it for a custom Recommender.

Anyone have any pointers to how this can be done in my case?

Thanks.

Mike
Zhou Jiang | 19 Aug 04:42 2015
Picon

Is it possible to use Mahout Random Forests work with image(pixel) data in libsvm format?

Hi All,

The Default Random Forests MapReduce works with UCI glass data. 

ID f1 f2 f3 … fn L

Is there a way to make it work with image data in libsvm format(Sparse Representation) listed as blow:

L index1:feature1 index2:feature2 … indexk:featurek

Thanks

Nick Kolegraff | 15 Aug 01:42 2015
Picon

Time Series Stuff

Hey Mahouts,
Looking for some time series analysis stuff I can use in mahout.  I don't
see much, other than this legacy HMM stuff.

https://mahout.apache.org/users/classification/hidden-markov-models.html

Are plans in the works on developing out more time series analysis and
functionality and/or already exists?  '11 is the last commit that mentions
HMMs.  "git log -Shmm"

Thanks,
Nick
David Kaplan | 12 Aug 14:49 2015
Picon

Mahout Clustering Help Please

Hi all,
Hope someone can please point me in the right direction,
Very new to mahout..
Here's my scenario:

I have written a system that collects Classifieds items from multiple
websites - phones,cars,antiques and many more using scrapy, all the items
are then ingested into Solr - +- 3 million entries.
 This is then the backend for my search engine

 I want to be able to extract meaningful information to accurately
calculate realistic price average etc. I need guidance/perhaps examples in
accurate outlier detection, categorization etc extreme beginner in machine
learning so need to know if that's what I should be using

 Part of my challenge is the broad range of items/categories, different
levels of skewed data etc. e.g. finding outliers with "iphone" results when
many of those are cheap iphone accessories.

Basically it seems i need to cluster/classify but not sure exactly how to
go about it, because i do already have the categories for 500K of the
entries, example category "Cell Phones & Accessories - Accessories"

And then actually connecting Mahout to Solr...

Many thanks!
David
Andrew Palumbo | 7 Aug 14:53 2015

[ANNOUNCE] Apache Mahout 0.11.0 Release

The Apache Mahout PMC is pleased to announce the release of Mahout 0.11.0.

Mahout's goal is to create an environment for quickly creating machine learning applications that scale
and run on the highest performance parallel computation engines available. Mahout comprises an
interactive environment and library that supports generalized scalable linear algebra and includes
many modern machine learning algorithms.


The Mahout Math environment we call “Samsara” for its symbol of universal renewal. It reflects a
fundamental rethinking of how scalable machine learning algorithms are built and customized.
Mahout-Samsara is here to help people create their own math while providing some off-the-shelf
algorithm implementations. At its base are general linear algebra and statistical operations along
with the data structures to support them. It’s written in Scala with Mahout-specific extensions, and
runs most fully on Spark.


To get started with Apache Mahout 0.11.0, download the release artifacts and signatures from http://www.apache.org/dist/mahout/0.11.0/.



Many thanks to the contributors and committers who were part of this release. Please see below for the
Release Highlights.



RELEASE HIGHLIGHTS


This is a minor release over Mahout 0.10.0 meant to introduce several new features and to fix some bugs. 
Mahout 0.11.0 includes all new features and bugfixes released in Mahout versions 0.10.1, and 0.10.2.

(Continue reading)

Suneel Marthi | 7 Aug 09:24 2015
Picon

[RESULT] [VOTE] Apache Mahout 0.11.0 Release Candidate

The Vote has passed with 5 +1s from PMC and no -1s, look forward to the
announce once the release has been finalized. The Voting for the 0.11.0
release is officially now closed.
Suneel Marthi | 7 Aug 02:43 2015
Picon

[ANNOUNCE] Apache Mahout 0.10.2 Release

The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.2.
Mahout's goal is to create an environment for quickly creating machine
learning applications that scale and run on the highest performance
parallel computation engines available. Mahout comprises an interactive
environment and library that supports generalized scalable linear algebra
and includes many modern machine learning algorithms.

The Mahout Math environment we call “Samsara” for its symbol of universal
renewal. It reflects a fundamental rethinking of how scalable machine
learning algorithms are built and customized. Mahout-Samsara is here to
help people create their own math while providing some off-the-shelf
algorithm implementations. At its base are general linear algebra and
statistical operations along with the data structures to support them. It’s
written in Scala with Mahout-specific extensions, and runs most fully on
Spark.

To get started with Apache Mahout 0.10.2, download the release artifacts
and signatures from http://www.apache.org/dist/mahout/0.10.2/.

Many thanks to the contributors and committers who were part of this
release. Please see below for the Release Highlights.

RELEASE HIGHLIGHTS

This is an incremental minor release over Mahout 0.10.1 meant to introduce
several new features (all of which are also available in the 0.11 lineage)
and fix a few bugs.

Mahout 0.10.2

(Continue reading)

Suneel Marthi | 7 Aug 00:08 2015
Picon

[RESULT] [VOTE] Apache Mahout 0.10.2 Release

We had 3 +1 PMC votes and no -1s, the release has passed and the voting is
now closed.
Suneel Marthi | 6 Aug 06:44 2015
Picon

[VOTE] Apache Mahout 0.11.0 Release Candidate

This is the vote for release 0.11.0 of Apache Mahout.

The vote will be going for at least 72 hours and will be closed on Thursday,
August 6th, 2015.  Please download, test and vote with

 [ ] +1, accept RC as the official 0.11.0 release of Apache Mahout
[ ] +0, I don't care either way,
[ ] -1, do not accept RC as the official 0.11.0 release of Apache Mahout,
because...

Maven staging repo:

https://repository.apache.org/content/repositories/orgapachemahout-1016
<https://repository.apache.org/content/repositories/orgapachemahout-1015>

These artifacts are the same as the previous 0.11.0 artifacts and there's
been no code changes. If you have already tested the previous artifacts,
please cast ur votes again and we'll finalize the release once we have at
least 3 PMC +1 votes.
go canal | 6 Aug 06:11 2015

Matrix inverse

Hello,I am new to Mahout. Would appreciate if someone could tell me if matrix inverse is still supported in
the latest release (0.10) ? I thought it was supported in the earlier release, for example, 0.3, in the
class  org.apache.mahout.math.matrix.linalq.Algebra ? thanks,
canal
Suneel Marthi | 6 Aug 02:02 2015
Picon

[VOTE] Apache Mahout 0.10.2 Release

This is the vote for release 0.10.2 of Apache Mahout.

The vote will be going for at least 72 hours and will be closed on Thursday,
August 6th, 2015.  Please download, test and vote with

 [ ] +1, accept RC as the official 0.10.2 release of Apache Mahout
[ ] +0, I don't care either way,
[ ] -1, do not accept RC as the official 0.10.2 release of Apache Mahout,
because...

Maven staging repo:

https://repository.apache.org/content/repositories/orgapachemahout-1015

This vote differs from the previous one to package a missing artifact. If
you have already tested the previous artifacts, please cast ur votes again
and we'll finalize the release sooner once we have atleast 3 PMC +1 votes.

Gmane