srowen | 1 May 2009 09:32
Picon
Favicon

svn commit: r770553 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl: common/ neighborhood/

Author: srowen
Date: Fri May  1 07:32:27 2009
New Revision: 770553

URL: http://svn.apache.org/viewvc?rev=770553&view=rev
Log:
Added SamplingIterator and SamplingIterable and fixed up issue with sampling in NearestNUserNeighborhood

Added:
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/common/SamplingIterable.java
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/common/SamplingIterator.java
Modified:
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/neighborhood/AbstractUserNeighborhood.java
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/neighborhood/NearestNUserNeighborhood.java
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/neighborhood/ThresholdUserNeighborhood.java

Added: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/common/SamplingIterable.java
URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/common/SamplingIterable.java?rev=770553&view=auto
==============================================================================
---
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/common/SamplingIterable.java (added)
+++
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/common/SamplingIterable.java
Fri May  1 07:32:27 2009
 <at>  <at>  -0,0 +1,45  <at>  <at> 
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
(Continue reading)

confluence | 1 May 2009 18:04
Picon
Favicon

[CONF] Apache Lucene Mahout: Mahout on Elastic MapReduce (page created)

Mahout on Elastic MapReduce (MAHOUT) created by Stephen Green
   http://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce

Content:
---------------------------------------------------------------------

This page details the set of steps that was necessary to get an example of k-Means clustering running on
Amazon's Elastic MapReduce.  The aim here was simply to get something running, but it should provide a good
head start if you want to run something else.

h1. Getting Started

   * Get yourself an Elastic MapReduce account.  If you're already using EC2, then you can do this from
[Amazon's AWS Managment Console|https://console.aws.amazon.com/], which has a tab for running EMR.
   * Make sure that you're up to date with the Mahout source.  

h1. Uploading data

h1. Running Mahout

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

(Continue reading)

confluence | 1 May 2009 19:18
Picon
Favicon

[CONF] Apache Lucene Mahout: Mahout on Elastic MapReduce (attachment added)

New files attached to Mahout on Elastic MapReduce (Apache Lucene Mahout) by Stephen Green
   http://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce

Attached file(s):
---------------------------------------------------------------------
    - mahoutjob.tiff (image/tiff, 255 kb)
        Using the EMR GUI to start a k-means clustering job
                http://cwiki.apache.org/confluence/download/attachments/116559/mahoutjob.tiff?version=1

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence

confluence | 1 May 2009 19:20
Picon
Favicon

[CONF] Apache Lucene Mahout: Mahout on Elastic MapReduce (attachment added)

New files attached to Mahout on Elastic MapReduce (Apache Lucene Mahout) by Stephen Green
   http://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce

Attached file(s):
---------------------------------------------------------------------
    - emrmh1.png (image/png, 147 kb)
        Using the EMR GUI to start a k-means clustering job
                http://cwiki.apache.org/confluence/download/attachments/116559/emrmh1.png?version=1

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence

srowen | 1 May 2009 19:38
Picon
Favicon

svn commit: r770768 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender: FarthestNeighborClusterSimilarity.java NearestNeighborClusterSimilarity.java TreeClusteringRecommender.java

Author: srowen
Date: Fri May  1 17:38:13 2009
New Revision: 770768

URL: http://svn.apache.org/viewvc?rev=770768&view=rev
Log: (empty)

Modified:
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/FarthestNeighborClusterSimilarity.java
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/NearestNeighborClusterSimilarity.java
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/TreeClusteringRecommender.java

Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/FarthestNeighborClusterSimilarity.java
URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/FarthestNeighborClusterSimilarity.java?rev=770768&r1=770767&r2=770768&view=diff
==============================================================================
---
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/FarthestNeighborClusterSimilarity.java (original)
+++
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/FarthestNeighborClusterSimilarity.java
Fri May  1 17:38:13 2009
 <at>  <at>  -21,23 +21,20  <at>  <at> 
 import org.apache.mahout.cf.taste.common.TasteException;
 import org.apache.mahout.cf.taste.similarity.UserSimilarity;
 import org.apache.mahout.cf.taste.impl.common.RefreshHelper;
-import org.apache.mahout.cf.taste.impl.common.RandomUtils;
+import org.apache.mahout.cf.taste.impl.common.SamplingIterable;
 import org.apache.mahout.cf.taste.model.User;

 import java.util.Collection;
-import java.util.Random;
(Continue reading)

confluence | 1 May 2009 19:48
Picon
Favicon

[CONF] Apache Lucene Mahout: Mahout on Elastic MapReduce (attachment added)

New files attached to Mahout on Elastic MapReduce (Apache Lucene Mahout) by Stephen Green
   http://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce

Attached file(s):
---------------------------------------------------------------------
    - mahout.png (image/png, 89 kb)
        Using the EMR GUI to start a k-means clustering job
                http://cwiki.apache.org/confluence/download/attachments/116559/mahout.png?version=1

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence

confluence | 1 May 2009 20:39
Picon
Favicon

[CONF] Apache Lucene Mahout: Mahout on Elastic MapReduce (page edited)

Mahout on Elastic MapReduce (MAHOUT) edited by Stephen Green
      Page: http://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce
   Changes: http://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=116559&originalVersion=5&revisedVersion=6

Content:
---------------------------------------------------------------------

This page details the set of steps that was necessary to get an example of k-Means clustering running on
Amazon's Elastic MapReduce (EMR).  The aim here was simply to get something running, but it should provide
a good head start if you want to run something else.  I started out on the [QuickStart] page and went from
there.  Along the way, I encountered some problems and posted to the Amazon EMR forums to get some help.  The
[resulting
thread|http://developer.amazonwebservices.com/connect/thread.jspa?threadID=30945&tstart=15]
might have some useful information if you're having trouble.

h1. Getting Started

   * Get yourself an EMR account.  If you're already using EC2, then you can do this from [Amazon's AWS Managment
Console|https://console.aws.amazon.com/], which has a tab for running EMR.
   * Get the [ElasticFox|https://addons.mozilla.org/en-US/firefox/addon/11626] and
[S3Fox|https://addons.mozilla.org/en-US/firefox/search?q=s3fox&cat=all] Firefox extensions. 
These will make it easy to monitor running EMR instances, upload code and data, and download results.
   * Download the Ruby command line client for EMR.  You can do things from the GUI, but when you're in the midst of
trying to get something running, the CLI client will make life a lot easier.
   * Have a look at [Common Problems Running Job
Flows|http://developer.amazonwebservices.com/connect/thread.jspa?messageID=124694&#124694]
and [Developing and Debugging Job
Flows|http://developer.amazonwebservices.com/connect/message.jspa?messageID=124695#124695]
in the EMR forum at Amazon.  They were tremendously useful.
   * Make sure that you're up to date with the Mahout source.  The fix for [Issue
(Continue reading)

otis | 5 May 2009 00:25
Picon
Favicon

svn commit: r771475 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/similarity/BooleanTanimotoCoefficientSimilarity.java

Author: otis
Date: Mon May  4 22:25:50 2009
New Revision: 771475

URL: http://svn.apache.org/viewvc?rev=771475&view=rev
Log:
- Removed null checks which should be caught by the instanceof checks at the beginning of the method

Modified:
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/similarity/BooleanTanimotoCoefficientSimilarity.java

Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/similarity/BooleanTanimotoCoefficientSimilarity.java
URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/similarity/BooleanTanimotoCoefficientSimilarity.java?rev=771475&r1=771474&r2=771475&view=diff
==============================================================================
---
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/similarity/BooleanTanimotoCoefficientSimilarity.java (original)
+++
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/similarity/BooleanTanimotoCoefficientSimilarity.java
Mon May  4 22:25:50 2009
 <at>  <at>  -54,10 +54,6  <at>  <at> 

    <at> Override
   public double userSimilarity(User user1, User user2) {
-
-    if (user1 == null || user2 == null) {
-      throw new IllegalArgumentException("user1 or user2 is null");
-    }
     if (!(user1 instanceof BooleanPrefUser && user2 instanceof BooleanPrefUser)) {
       throw new IllegalArgumentException();
     }
(Continue reading)

srowen | 5 May 2009 13:43
Picon
Favicon

svn commit: r771673 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/BooleanUserGenericUserBasedRecommender.java

Author: srowen
Date: Tue May  5 11:43:05 2009
New Revision: 771673

URL: http://svn.apache.org/viewvc?rev=771673&view=rev
Log: (empty)

Modified:
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/BooleanUserGenericUserBasedRecommender.java

Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/BooleanUserGenericUserBasedRecommender.java
URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/BooleanUserGenericUserBasedRecommender.java?rev=771673&r1=771672&r2=771673&view=diff
==============================================================================
---
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/BooleanUserGenericUserBasedRecommender.java (original)
+++
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/impl/recommender/BooleanUserGenericUserBasedRecommender.java
Tue May  5 11:43:05 2009
 <at>  <at>  -176,7 +176,6  <at>  <at> 
     if (theNeighborhood.isEmpty()) {
       return Double.NaN;
     }
-    double preference = 0.0;
     double totalSimilarity = 0.0;
     for (User user : theNeighborhood) {
       if (!user.equals(theUser)) {
 <at>  <at>  -185,13 +184,12  <at>  <at> 
         if (pref != null) {
           double theSimilarity = similarity.userSimilarity(theUser, user) + 1.0;
           if (!Double.isNaN(theSimilarity)) {
(Continue reading)

srowen | 5 May 2009 13:49
Picon
Favicon

svn commit: r771675 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop: RecommenderJob.java RecommenderMapper.java

Author: srowen
Date: Tue May  5 11:49:41 2009
New Revision: 771675

URL: http://svn.apache.org/viewvc?rev=771675&view=rev
Log: (empty)

Modified:
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/RecommenderJob.java
    lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/RecommenderMapper.java

Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/RecommenderJob.java
URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/RecommenderJob.java?rev=771675&r1=771674&r2=771675&view=diff
==============================================================================
---
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/RecommenderJob.java (original)
+++
lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/RecommenderJob.java
Tue May  5 11:49:41 2009
 <at>  <at>  -37,14 +37,14  <at>  <at> 
  * <p>Command line arguments are:</p>
  * <ol>
  *  <li>Fully-qualified class name of { <at> link Recommender} to use to make recommendations.
- *   Note that it must have a no-arg constructor.</li>
+ *   Note that it must have a constructor which takes a { <at> link org.apache.mahout.cf.taste.model.DataModel}
+ *   argument.</li>
  *  <li>Number of recommendations to compute per user</li>
  *  <li>Location of a text file containing user IDs for which recommendations should be computed,
  *   one per line</li>
  *  <li>Location of a data model file containing preference data, suitable for use with
(Continue reading)


Gmane