Andreas Veithen | 20 Apr 2013 10:16
Picon

Shutting down the Axis MoinMoin Wiki

All,

Over the past few weeks, our MoinMoin Wiki has increasingly become the
target for spam. It turns out that the Wiki only contains a single
page with content that doesn't need to be preserved:

http://wiki.apache.org/axis/FrontPage?action=LocalSiteMap

Also note that we own another Confluence Wiki.

If there are no objections, I will request infra to shut down the MoinMoin wiki.

Andreas
Gagandeep singh | 31 Mar 2013 07:18
Picon

New implementation of MLT

Hi folks

We started using the default implementation of MLT
(org.apache.solr.handler.MoreLikeThisHandler) recently and found that there
are a couple of things it lacks:

   1. Searching for terms in the same field as the original document:
      - the current implementation picks the top field to search an
      interesting term in based on docFreq, however this can give bad
results if
      say original product is from brand:"RED Valentino", and we end
up searching
      red in color field.
   2. Phrase boosts:
      - if product name is "business cards", then it makes sense to give a
      boost to the phrase boost to products which are also business cards.
   3. Support for bq, bf, fq, multiplicative boost:
      - you might want to filter out_of_stock products, give a
      multiplicative boost to a product based on their price
similarity / launch
      date.
   4. Support of explainOther

We had a use case for each of these and i ended up writing my own
MLTQueryParser which builds the MLT query for a given document. It also has
a new concept called childDocs. You can think of some documents as
products, and a collection of products can be though of as a category page.
You could search for similar documents based on the products a category
page has.

(Continue reading)

Chris Hostetter | 6 Dec 2012 19:30

ANNOUNCE: CFP Open For Lucene Revolution 2013: San Diego (April 29 - May 2)


http://lucenerevolution.org/

Lucene Revolution 2013 will take place at The Westin San Diego on April 29 
- May 2, 2013. Many of the brightest minds in open source search will 
convene at this 4th annual Lucene Revolution to discuss topics and trends 
driving the next generation of search. The conference will be preceded by 
two days of Apache Lucene, Solr and Big Data training.

The event’s agenda will be comprised of Apache Lucene/Solr and Big Data 
tutorials and speaker sessions, creating opportunities for developers, 
technologists and business leaders to explore and gain deeper 
understandings of the technologies connected with open source search.

Individuals are encouraged to submit proposals for technical talks that 
focus on Apache Lucene and Solr in the enterprise, Big Data, case studies, 
large-scale search, and data integration.

Guidelines for submissions...

http://lucenerevolution.org/2013/call-for-papers

-Hoss
Ross Gardler | 1 Dec 2012 16:13
Picon
Favicon
Gravatar

Fwd: Advice on inviting Apache insight to NSF SI2 meeting?

Anyone able to help the NSF out at a meeting in DC in Jan?

See below for more info.

On a personal note I've noticed an increased, and genuine, interest in how
to improve the impact of publicly funded research outputs from the NSF in
the last couple of years. We are even seeing important components turning
up in the Incubator. I hope we can send someone able to help them
understand how we do things around here. It's your (US) tax dollars going
into this work.

I'd normally do this myself but I have a clash on these dates.

Let me know privately if you are interested and I'll help you/them figure
out who the best fit is.

Ross

Sent from my tablet
---------- Forwarded message ----------
From: "James Howison" <jhowison@...>
Date: Nov 30, 2012 9:31 PM
Subject: Advice on inviting Apache insight to NSF SI2 meeting?
To: "Ross Gardler" <rgardler@...>

Hi Ross,

I'm organizing a panel at an upcoming meeting for the PIs of all the
projects funded by the SI2 funding program at the NSF (SI2, or SI^2, is the
main Scientific Software funding program at the moment).
(Continue reading)

Glenn Adams | 20 Oct 2012 16:34
Gravatar

[ANN] Apache XML Graphics Commons 1.5 Released

The Apache XML Graphics team is pleased to announce the immediate availability of Apache XML Graphics Commons Version 1.5 [1].

Apache XML Graphics Commons is a library that consists of several reusable components used by Apache FOP [2] and Apache Batik [3]. Many of these components may be easily used outside the domains of SVG and XSL-FO.

This release fixes a number of bugs and provides important performance improvements. For a more detailed list of changes see [4]. Source and binary distributions can be downloaded from an ASF Mirror at [5]. Further download information is available at [6]. Maven artifacts for this release are available at [7].


Glenn Adams | 17 May 2012 07:16
Gravatar

Moving XMLGraphics Products from Bugzilla to JIRA

In accordance with a positive vote [1] to transition the XMLGraphics products from Bugzilla to JIRA, I have filed an infrastructure task requesting this transition at [2]. If I can help accomplish this task in any way, please let me know.


Regards,
Glenn Adams (gadams at apache dot org)
Clay Leeds | 19 Apr 2012 03:02
Picon
Gravatar

Re: on changing fop documentation sources to markdown

I replaced the logo for all sites a month or so ago. 

I'm not at a place I can publish, but if someone can publish the PRODUCTION sites, the logo will show up (be
sure to clear cache!). 

Clay

"My religion is simple. My religion is kindness."
- HH The Dalai Lama of Tibet

On Apr 18, 2012, at 7:09 AM, Chris Bowditch <bowditch_chris <at> hotmail.com> wrote:

> On 18/04/2012 13:52, Clay Leeds wrote:
>> On Apr 18, 2012, at 5:12 AM, Chris Bowditch<bowditch_chris <at> hotmail.com>  wrote:
>>> On 18/04/2012 07:24, The Web Maestro wrote:
>>> 
>>> Hi Clay,
>>> 
>>>> I added the logo (in GIF, JPG, PNG&  SVG formats... ;-)
>>> Thanks, but I don't yet see it on the staging website. Is there a delay before that appears?
> 
> Hi Clay,
> 
>> Strange. The new logo showed up when I refreshed it. Perhaps it's your cache? Try loading only the logo.
> 
> Yes you are right. It was my browser cache. I can now see the updated logo.
> 
>> 
>>>> Sponsorship&  Thanks were already there. License is on the Legal page, which is there, but I've added it
to the sidebar as well, along with the Security page. ;-)
>>> Thanks. I can now see the 4 required links.
>>> 
>>>> I also got the Compliance table working. Unfortunately, the CMS is stripping the
'class="ForrestTable"', so the coloring is White-On-White (but if you select the text, you'll see the
content and layout is there).
>>>> 
>>>> As for the navigation menu, I'd like it to collapse most of the links, except the section you're in.
Anyone have a favorite jQuery menu they like for this? If not, I'll see about finding one...
>>> All the "TM" logos are missing from the content and headers though. It took me quite some time to add them
to all the pages. Will you be able to re-sync the content with the latest xdocs as it would take quite some
time to re-apply them and I want to tell the board that FOP, Commons and XML Graphics sites are now brand
compliant in the upcoming report.
>> The current LIVE site has it, so we should be good informing the board it's there, no?
>> 
>> Weird. When I added the content, I did an `svn up` to ensure it was recent content. I'm sure I'll have to
re-synch, anyway, so we'll see. I wish it were a caching thing!
> 
> I can see the TM logos in most of the content after clearing the cache. Just the XML Graphics top page doesn't
appear to have them now.
> 
> Thanks,
> 
> Chris
> 
>> 
>>> Thanks,
>>> 
>>> Chris
>> 
> 
Clay Leeds | 18 Apr 2012 14:52
Picon
Gravatar

Re: on changing fop documentation sources to markdown

On Apr 18, 2012, at 5:12 AM, Chris Bowditch <bowditch_chris <at> hotmail.com> wrote:
> On 18/04/2012 07:24, The Web Maestro wrote:
> 
> Hi Clay,
> 
>> I added the logo (in GIF, JPG, PNG & SVG formats... ;-)
> 
> Thanks, but I don't yet see it on the staging website. Is there a delay before that appears?

Strange. The new logo showed up when I refreshed it. Perhaps it's your cache? Try loading only the logo. 

>> Sponsorship & Thanks were already there. License is on the Legal page, which is there, but I've added it to
the sidebar as well, along with the Security page. ;-)
> 
> Thanks. I can now see the 4 required links.
> 
>> I also got the Compliance table working. Unfortunately, the CMS is stripping the
'class="ForrestTable"', so the coloring is White-On-White (but if you select the text, you'll see the
content and layout is there).
>> 
>> As for the navigation menu, I'd like it to collapse most of the links, except the section you're in. Anyone
have a favorite jQuery menu they like for this? If not, I'll see about finding one...
> 
> All the "TM" logos are missing from the content and headers though. It took me quite some time to add them to
all the pages. Will you be able to re-sync the content with the latest xdocs as it would take quite some time
to re-apply them and I want to tell the board that FOP, Commons and XML Graphics sites are now brand
compliant in the upcoming report.

The current LIVE site has it, so we should be good informing the board it's there, no?

Weird. When I added the content, I did an `svn up` to ensure it was recent content. I'm sure I'll have to
re-synch, anyway, so we'll see. I wish it were a caching thing!

> Thanks,
> 
> Chris
Clay Leeds | 17 Apr 2012 15:59
Picon
Gravatar

Re: on changing fop documentation sources to markdown

NOTE: Moving discussion to general <at> . Please make all further responses to general <at> . 

BACKGROUND:
We are discussing moving XML Graphics web site to ASF-CMS. You can see progress here:

http://xmlgraphics.staging.apache.org/

ToDo:
- Lots.
- Style & templating work
- Non-HTML content (figure out how to handle java-docs, download.cgi, demo stuff, etc.--might not be too
difficult, just a matter of committing to CMS content/ dirs?)

Done:
- most HTML content

Clay

"My religion is simple. My religion is kindness."
- HH The Dalai Lama of Tibet

On Apr 17, 2012, at 1:19 AM, Chris Bowditch <bowditch_chris <at> hotmail.com> wrote:

> On 15/04/2012 19:52, The Web Maestro wrote:
>> I just added most of the nav for FOP Development (0.95, 1.0, trunk/ and 'dev'):
> Hi Clay,
>> 
>> http://xmlgraphics.staging.apache.org/
>> 
>> As mentioned, there are likely missing things (like java-docs, download.cgi, Batik's DEMO, etc.)...
It'd be great if folks could take a look... I haven't figured out how to add other content, but It Might Just
Work(tm) if weupload it there via SVN...
> 
> Many thanks for working on this.
> 
>> 
>> Come to think of it, we should probably move this to general <at> xmlgraphics.apache.org
<mailto:general <at> xmlgraphics.apache.org>. Or is there a better mailing list? I'll refrain from
sending to other lists, until we figure out where it should go.
>> 
>> Any ideas where this discussion should move, since it entails changes to all XML Graphics Project web docs?
> 
> Yes this discussion should move to general <at>  as it will affect all sub projects of XML Graphics.
> 
> Thanks,
> 
> Chris
> 
>> 
>> Kind regards,
>> 
>> Clay Leeds
>> --
>> <the.webmaestro <at> gmail.com <mailto:the.webmaestro <at> gmail.com>> - <http://ourlil.com/>
>> My religion is simple. My religion is kindness.
>> - HH The 14th Dalai Lama of Tibet
>> 
>> 
>> On Sat, Apr 14, 2012 at 11:53 PM, The Web Maestro <the.webmaestro <at> gmail.com
<mailto:the.webmaestro <at> gmail.com>> wrote:
>> 
>>    I've updated the docs a bit, and gotten much (but not all!) of the
>>    FOP, Batik & Commons content into the CMS...
>> 
>>    We're still missing an adequate navigation system, so I did a
>>    preliminary job of getting a few links in the sidenav, but it's
>>    incomplete and ugly as sin. We'll need to build a mechanism to
>>    hide (collapse?) non-relevant links, but that shouldn't be too hard.
>> 
>>    We also need to figure out java-docs, download.cgi, and perhaps
>>    some other issues...
>> 
>>    Without further ado:
>> 
>>    http://xmlgraphics.staging.apache.org/
>> 
>> 
>>    Kind regards,
>> 
>>    Clay Leeds
>>    --
>>    <the.webmaestro <at> gmail.com <mailto:the.webmaestro <at> gmail.com>> -
>>    <http://ourlil.com/>
>>    My religion is simple. My religion is kindness.
>>    - HH The 14th Dalai Lama of Tibet
>> 
>> 
>>    On Thu, Apr 12, 2012 at 10:03 PM, Clay Leeds
>>    <the.webmaestro <at> gmail.com <mailto:the.webmaestro <at> gmail.com>> wrote:
>> 
>>        On Apr 12, 2012, at 6:41 AM, Glenn Adams <glenn <at> skynav.com
>>        <mailto:glenn <at> skynav.com>> wrote:
>>        > Agreed that removing forrest dependency is desirable.
>>        However, presumably the current xdocs would need to be
>>        converted to MD, in which case someone will need to construct
>>        an XSLT to do so. That begs the question of whether it would
>>        be necessary (at this time) to convert the source format to
>>        MD, or if an additional step in the CMS based process could
>>        merely perform that step automatically. If so, then it should
>>        not be necessary to change the authoring format at this time.
>>        It could be done as a separate step later.
>> 
>>        I am using Forrest 0.8 w markdown plugin. Conversion could be
>>        scripted, but that would negate the benefit of the CMS.
>> 
>>        > What I don't know yet is what we will lose from the
>>        conversion to MD in terms of ability to markup our source
>>        docs. Clearly, MD is not as semantically or syntactically rich
>>        as an XML based source. But do we lose anything of
>>        consequence? I don't know yet.
>>        >
>>        > One thing we may lose if we don't convert to MD is the
>>        ability to use CMS in-page editing. So that is a
>>        consideration. Perhaps that option is sufficient to justify
>>        other potential negatives in converting.
>>        >
>>        > G.
>> 
>>        One of my goals, was to see some discussion in the DEVers,
>>        before I complete the task of converting the docs. The
>>        MarkDown format is not nearly as semantic as xdoc, but it
>>        serves a different purpose.
>> 
>>        It'll take some time, and I'm still prepared to take it on.
>>        But I was hoping for some discussion ;-)
>> 
>> 
>> 
> 
Chris Bowditch | 17 Apr 2012 10:15
Picon
Favicon

[VOTE] Switch from Bugzilla to JIRA

Hi All,

We need to have a formal vote to decide if the XML Graphics project and 
all sub projects should switch the bug tracking system from Bugzilla to 
JIRA. The main benefits of which are:

1. JIRA has a more modern look and feel
2. Infrastructure are not equiped to support Bugzilla anymore as most 
Apache projects are based on JIRA. Therefore should be more able to 
respond to requests for changes.

The downside is that someone will have to work with infra <at>  to organize 
the import of bugs from BZ to JIRA. We then need to update the website 
links to point to JIRA instead of BZ. Glenn Adams has kindly volunteered 
to oversee the migration.

Heres my +1

The vote runs until 24th April.

Thanks,

Chris
Erik Hatcher | 10 Apr 2012 02:38
Picon

[CFP] Open Source Search Conference Oct 2, 2012

Sending this on behalf of my friends at BasisTech -

--------

Subject: Call for Presentations: Open Source Search Conference Oct. 2, 2012 (Chantilly, VA)

======================================
Call for Presentations & Save the Date
Open Source Search Conference Oct 2, 2012 
(tutorials Oct. 1) in Chantilly, VA
http://www.basistech.com/conference/2012/oss/
======================================

The second annual Open Source Search Conference will be held on October 2, 2012 in Chantilly, VA, and you are
invited to submit a presentation. The conference will be attended by government employees and
contractors who are evaluating, building, or using Apache Solr and other open source tools for search
applications throughout the government.

This event is a unique opportunity to share tips and ideas to overcome challenges working with open source
search projects. We are also looking for people who are interested in providing half- and full-day
tutorials on the day before the conference (October 1, 2012). The tutorials should provide hands-on
guidance for using or developing open source search applications.
For more information, visit: http://www.basistech.com/conference/2012/oss/

==Dates==
Conference: October 2, 2012
Tutorials: October 1, 2012

==Submission Instructions==
Please email submissions for conference presentations and tutorials to oss2012 <at> basistech.com by April
23, 2012.
To submit a presentation or tutorial, e-mail the following information:
1. Title
2. Author
3. Brief Biography
4. Description of presentation or tutorial (100-150 words)
5. Brief description of author’s experience with Apache Solr and/or other open source tools
6. Specify whether the presentation or tutorial is targeted towards users or developers

==Suggested Topics==
1. Large-scale Apache Solr
* Solr at exabyte scale 
* High-load deployments
* Complex queries
2. Analytic interfaces
* Geospatial search
* Iterative Analytics using Solr (index reprocessing, etc.)
* Exploring and Discovering Big Data with Solr
* Linguistic plug-in use and development
* Document clustering (semantic, field collapsing, dynamic faceting)
* Language identification
* Search in a multilingual site
* Sentiment analysis
3. Text Mining
* Text analytics processing
* Entity extraction
* Name matching
4. Security
* Access control
* Index encryption
5.Case studies and user experiences
* Migrating to Solr from other search engines
* Other topics

==About the Conference==
The Open Source Search Conference is sponsored by Basis Technology, which has been producing government
conferences since 2006 and focuses on topics including text analytics, human language technology, and
the nexus of language, culture and technology for the federal community. For more information about our
conferences, visit: http://www.basistech.com/conference.
Basis Technology provides software solutions for text analytics, information retrieval, and name
resolution in over 40 languages. Our customers include leading software vendors, content providers,
financial institutions, and government agencies in the defense and intelligence industry.


Gmane