Wikimore

This page used the Structured Discussions extension to give structured discussions. It has since been converted to wikitext, so the content and history here are only an approximation of what was actually displayed at the time these comments were made.

Discussion related to the CirrusSearch MediaWiki extension.

See also the open tasks for CirrusSearch on phabricator.

Problem running updateSuggesterIndex.php

Using REL1_29

I already run the forceSearchIndex.php script.

Look at the indices:

> curl -s localhost:9200/_cat/indices
green open wikidexwiki_content_first EgLUpqUuS66x1bUvIVtoKw 4 0  14106  2836 739.2mb 739.2mb
green open wikidexwiki_general_first L4Y2SfehQdCg2oaZiT2Ing 4 0 262131 62386   1.9gb   1.9gb
green open mw_cirrus_metastore_first 2cf0X6ZpQi6yr6ZE0-6jSA 1 0      3     2   8.5kb   8.5kb

Running updateSuggesterIndex.php fails:

> php extensions/CirrusSearch/maintenance/updateSuggesterIndex.php
Scanning available plugins...
        analysis-icu
Picking analyzer...spanish
Fetching Elasticsearch version...5.6.5...ok
Inferring index identifier...wikidexwiki_titlesuggest_first
Setting index identifier...wikidexwiki_titlesuggest_1515680487
2018-01-11 14:21:27 Waiting for the index to go green...
        Green!
2018-01-11 14:21:27 Setting max_docs to 14106
2018-01-11 14:21:27 Indexing 14106 documents from content with batchId: 1515680487 and scoring method: quality
        10% done...
        14% done...
        24% done...
        28% done...
        38% done...
        42% done...
        46% done...
        56% done...
        60% done...
        70% done...
        74% done...
        88% done...
        92% done...
        100% done...
2018-01-11 14:21:36 Indexing from content index done.
2018-01-11 14:21:36 Indexing 61 documents from general with batchId: 1515680487 and scoring method: quality
2018-01-11 14:21:36 Indexing from general index done.
2018-01-11 14:21:36 Enabling replicas...
2018-01-11 14:21:56 Waiting for the index to go green...
        Green!
2018-01-11 14:21:57 Updating tracking indexes...[cebd4be3bd4e9c30eefa478c] [no req]   Exception from line 745 of ...extensions/CirrusSearch/maintenance/updateSuggesterIndex.php: meta store does not exist, you must
index your data first
Backtrace:
#0 ...extensions/CirrusSearch/maintenance/updateSuggesterIndex.php(319): CirrusSearch\Maintenance\UpdateSuggesterIndex->updateVersions()
#1 ...extensions/CirrusSearch/maintenance/updateSuggesterIndex.php(240): CirrusSearch\Maintenance\UpdateSuggesterIndex->rebuild()
#2 ...maintenance/doMaintenance.php(111): CirrusSearch\Maintenance\UpdateSuggesterIndex->execute()
#3 ...extensions/CirrusSearch/maintenance/updateSuggesterIndex.php(793): require_once(string)
#4 {main}

The last HTTP request I saw it does before failing is:

HEAD /mw_cirrus_metastore/version HTTP/1.1
Host: 127.0.0.1:9200
Accept: */*
Accept-Encoding: deflate, gzip


HTTP/1.1 400 Bad Request
content-type: text/plain; charset=UTF-8
content-length: 73

I look again to the indices and a bogus wikidexwiki_titlesuggest_1515680487 has been created

green open wikidexwiki_titlesuggest_1515680487 LQlgfNreSsumHQRxL9hvcg 4 0  18997     0   7.1mb   7.1mb
green open mw_cirrus_metastore_first           2cf0X6ZpQi6yr6ZE0-6jSA 1 0      3     2   8.5kb   8.5kb
green open wikidexwiki_general_first           L4Y2SfehQdCg2oaZiT2Ing 4 0 262131 62386   1.9gb   1.9gb
green open wikidexwiki_content_first           EgLUpqUuS66x1bUvIVtoKw 4 0  14106  2836 739.2mb 739.2mb

I don't know why it tries to get the mw_cirrus_metastore index while all my indices seem to have a _first suffix... Ciencia Al Poder (talk) 14:26, 11 January 2018 (UTC)

CirrusSearch relies on index aliases and sometimes error messages may refer to these aliases instead of the actual indices. In this case it complains perhaps because the alias mw_cirrus_metastore does not point to mw_cirrus_metastore_first.

What you describe in your message sounds like a bug in CirrusSearch.

Could you provide us more informations by dumping the output of:

curl -s localhost:9200/mw_cirrus_metastore_first/_aliases?pretty

If there are no aliases for this index you may try to fix it by running:

curl -XPOST localhost:9200/_aliases/ -d '{"actions": [{"add": { "alias": "mw_cirrus_metastore", "index": "mw_cirrus_metastore_first"}}]}'

and rerun the updateSuggesterIndex.php script.

Thanks for your feedback! DCausse (WMF) (talk) 17:13, 11 January 2018 (UTC)

Another thing to check would be to verify that the Elastica version you are using is right because I realized that the HTTP request you captured is wrong, it should be HEAD /mw_cirrus_metastore

Did you get the Elastica extension with the REL1_29 tag as well?

~~Also could you update your message by adding the version of elasticsearch you use?~~ 5.6.5

Thanks! DCausse (WMF) (talk) 17:40, 11 January 2018 (UTC)

Both were downloaded for REL1_29

Elastica is version 1.3.0.0. I still have the snapshots:

Elastica-REL1_29-e2a9593.tar.gz
CirrusSearch-REL1_29-5ca9036.tar.gz Ciencia Al Poder (talk) 17:50, 11 January 2018 (UTC)

> curl -s localhost:9200/mw_cirrus_metastore_first/_aliases?pretty
{
  "mw_cirrus_metastore_first" : {
    "aliases" : {
      "mw_cirrus_metastore" : { }
    }
  }
}

The alias exists. I didn't know about aliases.

So I've tried to do the same request but with GET instead of HEAD and... voila:

> curl localhost:9200/mw_cirrus_metastore/version?pretty
{
  "error" : {
    "root_cause" : [
      {
        "type" : "illegal_argument_exception",
        "reason" : "No endpoint or operation is available at [version]"
      }
    ],
    "type" : "illegal_argument_exception",
    "reason" : "No endpoint or operation is available at [version]"
  },
  "status" : 400
}

Maybe the /version is only available at specific ES versions? This is mine:

> curl localhost:9200/?pretty
{
  "name" : "wikidexsearch1-n1",
  "cluster_name" : "wikidexsearch1",
  "cluster_uuid" : "evILbJFIQKKMpzIcuII1Bw",
  "version" : {
    "number" : "5.6.5",
    "build_hash" : "6a37571",
    "build_date" : "2017-12-04T07:50:10.466Z",
    "build_snapshot" : false,
    "lucene_version" : "6.6.1"
  },
  "tagline" : "You Know, for Search"
}

Ciencia Al Poder (talk) 17:48, 11 January 2018 (UTC)

I now remember that we had to bump elastica to more recent version when we migrated from elasticsearch 5.3 to 5.5

I'm afraid that if you want to run MW REL1_29 you'll have to try to downgrade elastic to the latest 5.3 version.

Another hazardous solution would be to hack cirrus to workaround this problem by changing the function in includes/Maintenance/MetaStoreIndex.php

from:

    public static function updateMetastoreVersions( Connection $connection, $indexBaseName,
        $indexTypeName
    ) {
        $index = self::getVersionType( $connection );
        if ( !$index->exists() ) { // <========== This line trigger the bug in elastica
            throw new \Exception( "meta store does not exist, you must index your data first" );
        }
        $index->addDocument( self::versionData( $connection, $indexBaseName, $indexTypeName ) );
    }

to

    public static function updateMetastoreVersions( Connection $connection, $indexBaseName,
        $indexTypeName
    ) {
        $index = self::getVersionType( $connection );
        if ( !$index->getIndex()->exists() ) { // hack to workaround incompatibility with elastic 5.5+
            throw new \Exception( "meta store does not exist, you must index your data first" );
        }
        $index->addDocument( self::versionData( $connection, $indexBaseName, $indexTypeName ) );
    }

NOTE:' I don't suggest this solution unless you feel comfortable with PHP and also because you may run into other issues in other part of the code due to some incompatibilities with elastic 5.6 and the elastica version shipped with REL1_29. DCausse (WMF) (talk) 18:28, 11 January 2018 (UTC)

Would it work if I use Elastica (extension) from master or 1.30? or CirrusSearch 1.30? (with MediaWiki 1.29)

I was planning to upgrade MediaWiki soon, but wanted to prioritize the search before upgrading. Ciencia Al Poder (talk) 18:47, 11 January 2018 (UTC)

Sadly after a quick check the elastica version used by the Elastica extension on REL1_30 is still 5.1.0 and 5.3.0 is needed. (https://github.com/wikimedia/mediawiki-extensions-Elastica/blob/REL1_30/composer.json#L21)

The future 1.31 will have the proper version. DCausse (WMF) (talk) 19:19, 11 January 2018 (UTC)

Ok, so I have to downgrade ES to 5.3. It would be good to clarify that on the Extension:CirrusSearch#Dependencies section, since it currently says 1.29 requires ES 5.3+. I even installed ES 6 before, but then the extension itself said it was not compatible. Ciencia Al Poder (talk) 20:20, 11 January 2018 (UTC)

Sure,

thanks again for your feedback. DCausse (WMF) (talk) 09:46, 12 January 2018 (UTC)

Suggestion: Add uploader / author to search results snippet

Issue:

As a user I'd like to see who uploaded a specific file so that I can see more of their content (without digging through file history).

Proposed solution

Add an author to all search results (including regular pages); and / or
For files, add only the most recent uploader. 197.218.91.135 (talk) 10:42, 14 February 2018 (UTC)

Suggestion: Make it possible to search by page author /contributor/ uploader

Problems

As a user, I'd like to discover more files or content made / created by a specific user.
As a user, I'd like to find specific content without paging through Special:listfiles.

Background

Currently there is no way to distinguish between search results uploaded or created by a specific user. Paging through special:listfiles is not an activity any sane person would do for users with massive uploads, e.g. Special:ListFiles&dir=prev&user=Ruthven. Attempting to view massive new pages by a specific user (special:contribs) will also result in a timeout on a big enough wiki, especially if the namespace parameter is used).

Also, for regular pages, this provides a sensible and easy interface to see and count all (existing) pages created by a user as this would naturally include the matches.

Other Use cases:

Looking into discussions (Talk pages) participated
Looking into pages they created with a specific keyword
Readers looking into interesting pages or media initially created by a specific contributors
Anti vandalism - looking into pages created / edited by a specific user and containing a specific term.

Proposed solution

Add a new search keyword "author:", e.g. "author:User1"; AND
Add a new search keyword "contributor:" to list all pages a edited by particular user;
Possibly make it possible to include more than one author, e.g. "author:User1|user2|..." or alternatively "author:User1 author:User2"

Note: A file page may be created before a file upload (by another user). So there may be a need to distinguish between an uploader and a file page creator. 197.218.91.135 (talk) 11:08, 14 February 2018 (UTC)

Hm, this is an interesting proposal. Given that it's a more contributor-focused tool, I wonder if this might be more appropriate for the AdvancedSearch project Wikimedia Deutschland is working on. I don't think "contributor" is a current field, but it might be a welcome suggestion.

It also sounds a little like maybe an updated Special:Contributions or Special:ListFiles would serve a better job than through CirrusSearch. Given that Special:Search is so general and broad. If other folks from the Search team are reading this, please tell me if I'm wrong!

IP, I'd be happy to create a phab task or two if you think that would be helpful.

Humor: Or maybe we need a one-sided Interaction timeline! :) CKoerner (WMF) (talk) 20:30, 20 February 2018 (UTC)

My guess is that perspective is based on a wikipedia centric view.

The "contributor" keyword might be more related to editors, but readers are 100% interested in knowing the creator / uploader of a file or page in certain contexts. For instance, in wikibooks, one may be interested in stories (pages) created / published by a specific user. While for wikipedia itself, it often doesn't matter who created the page, knowing who uploaded a specific file is still useful, perhaps a particular user uploads images of new species of animals or some other interesting topic. That is entirely distinct from the photographers, who the uploader may or may not know, the reader may still be interested in seeing more of those rather than simply finding out who photographed one particular creature.

In the "real" world, it is also very common for people to buy (read / view) books / movies from the same author / writers, exactly because they appreciate their expertise and / or writing style.

> Special:Contributions or Special:ListFiles would serve a better job than through CirrusSearch. Given that Special:Search is so general and broad.

Not really. Remember that special:search gives the powerful ability to add extra filters that neither listfiles nor contributions will likely ever have, e.g. "keyword, title, geoip", etc. Also while people do enjoy deceiving themselves, the average person can't deal with vast amounts of data. Those pages have close to infinite paging as a poor man's alternative to the lack of a proper filtering capability.

> IP, I'd be happy to create a phab task or two if you think that would be helpful.

Feel free to create them. The feature suggestion is still valid, in my opinion. 197.218.84.219 (talk) 21:19, 22 February 2018 (UTC)

Fair enough. :)

I filed a task: https://phabricator.wikimedia.org/T188125

> the average person can't deal with vast amounts of data.

An author (to your exact example!) I enjoy once talked/wrote about this. He called it a problem of "filter failure".

Oh, and since I can't Special:Thank you, let me just state it plainly. Thank you. CKoerner (WMF) (talk) 18:10, 23 February 2018 (UTC)

Version with MediaWiki 1.28

Hi,

MediaWiki	1.28.2
PHP	7.1.6 (apache2handler)
MariaDB	10.1.24-MariaDB
ICU	4.8.1.1
Elasticsearch	2.4.5

In the download page (Special:ExtensionDistributor/CirrusSearch) I can only download CirrusSearch for versions 27, 29 and 30.

I have tried to install CirrusSearch with versions 27 and 30 but when I execute the updateSearchIndexConfig.php it saids that Elasticsearch version 2.4.5 is not supported.

On the CirrusSearch (Extension:CirrusSearch#Dependencies) page it saids

MediaWiki 1.28.x requires ElasticSearch 2.x.

Where can I download CirrusSearch for MediaWiki 1.28 and Elasticsearch 2.4.5?

Thanks 195.55.236.138 (talk) 09:55, 7 March 2018 (UTC)

Use the version from the 1.28 branch. This should work.

Still I believe it will be best for you to upgrade both MediaWiki and Elasticsarch to supported versions. [[kgh]] (talk) 15:09, 7 March 2018 (UTC)

Thanks for your response @Kghbln. Unluckily I am not allowed to upgrade MediaWiki.

I have download the version from the branch as you said. But now this error appears. I have search for this error and seen other people with the same. But no solutions :(

content index...

Fetching Elasticsearch version...2.4.5...ok

Scanning available plugins...none

Inferring index identifier...[9034f831a5ee88edf680c617] [no req] Error from line 34 of /opt/lamp p/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Exception/ResponseException.p hp: Wrong parameters for Elastica\Exception\ResponseException([string $message [, long $code [, Throwable $previous = NULL]]])

Backtrace:

#0 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Exception/Respons eException.php(34): Exception->__construct(array)

#1 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Transport/Http.ph p(159): Elastica\Exception\ResponseException->__construct(Elastica\Request, Elastica\Response)

#2 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Request.php(171): Elastica\Transport\Http->exec(Elastica\Request, array)

#3 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Client.php(621): Elastica\Request->send()

#4 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Status.php(163): Elastica\Client->request(string, string)

#5 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Status.php(45): E lastica\Status->refresh()

#6 /opt/lampp/htdocs/wiki/extensions/Elastica/vendor/ruflin/elastica/lib/Elastica/Client.php(454): Elastica\Status->__construct(Elastica\Client)

#7 /opt/lampp/htdocs/wiki/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php(109): Elasti ca\Client->getStatus()

#8 /opt/lampp/htdocs/wiki/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php(78): CirrusS earch\Maintenance\ConfigUtils->getAllIndicesByType(string)

#9 /opt/lampp/htdocs/wiki/extensions/CirrusSearch/maintenance/updateOneSearchIndexConfig.php(260): CirrusSearch\Maintenance\ConfigUtils->pickIndexIdentifierFromOption(string, string)

#10 /opt/lampp/htdocs/wiki/extensions/CirrusSearch/maintenance/updateSearchIndexConfig.php(58): Cir rusSearch\Maintenance\UpdateOneSearchIndexConfig->execute()

#11 /opt/lampp/htdocs/wiki/maintenance/doMaintenance.php(111): CirrusSearch\Maintenance\UpdateSearc hIndexConfig->execute()

#12 /opt/lampp/htdocs/wiki/extensions/CirrusSearch/maintenance/updateSearchIndexConfig.php(65): req uire_once(string)

#13 {main} 195.55.236.138 (talk) 09:11, 8 March 2018 (UTC)

I guess you are out of luck. I cannot tell what is wrong and I doubt that the developers will address issues for unsupported branches of MediaWiki. As a matter of fact I would already be happy if that was done for supported branches of MediaWiki. However, one never knows. [[kgh]] (talk) 12:28, 8 March 2018 (UTC)

This looks like a bug in Elastica itself, could you make sure that the Elastica extension is also 1.28 and that composer update has been run properly? DCausse (WMF) (talk) 15:53, 9 March 2018 (UTC)

Suggestion: Include creation date for files (including exif date) / pages in search result snippet (metadata)

Issue:

As a user, I expect the file upload date, and page creation date to be included in search results.

Background

As a user recently looking through files, I was confused about the date in the search results for files. I expected it to be the most recent upload date but instead it is last time the related file page was edited.

Other use cases:

Verifying whether the information in snippet is actually accurate. A page last edited 5 minutes ago is likely to contain a lot of inaccuracies and potential misinformation.
An image upload date can easily be used to evaluate information, e.g. an image caption incorrectly claiming X happened on Y date.

Proposed solution

Include metadata in search results snippet (where appropriate / available):

Exif date - for latest media upload
Upload date - for latest media upload
Page creation date 197.218.88.6 (talk) 11:24, 11 March 2018 (UTC)

Issue with MW 1.30 and SPARQL client class

Hi,

I have recently updated my wiki from 1.27.4 to 1.30. Since I was using CirrusSearch I had to update elasticsearch according to the new version. So now my installation is the following:

Product	Version
MediaWiki	1.30.0
PHP	7.0.25-0ubuntu0.16.04.1 (apache2handler)
MySQL	5.7.21-0ubuntu0.16.04.1
ICU	55.1
Elasticsearch	5.4.3
Lua	5.1.5

I have installed the master version of both CirrusSearch and Elastica, updated composer and LocalSettings.php. Now, when I made a search on my wiki I get this error:

[aa787554751705ac2246e772] /mediawiki/index.php?title=Special%3ASearch&search=kircher&go=Go Error from line 14 of /var/lib/mediawiki/extensions/CirrusSearch/includes/ServiceWiring.php: Class 'MediaWiki\Sparql\SparqlClient' not found

Backtrace:

#0 [internal function]: MediaWiki\Services\ServiceContainer->{closure}(MediaWiki\MediaWikiServices)

#1 /var/lib/mediawiki/includes/services/ServiceContainer.php(360): call_user_func_array(Closure, array)

#2 /var/lib/mediawiki/includes/services/ServiceContainer.php(344): MediaWiki\Services\ServiceContainer->createService(string)

#3 /var/lib/mediawiki/extensions/CirrusSearch/includes/Parser/FullTextKeywordRegistry.php(77): MediaWiki\Services\ServiceContainer->getService(string)

#4 /var/lib/mediawiki/extensions/CirrusSearch/includes/Searcher.php(276): CirrusSearch\Parser\FullTextKeywordRegistry->__construct(CirrusSearch\SearchConfig)

#5 /var/lib/mediawiki/extensions/CirrusSearch/includes/Searcher.php(318): CirrusSearch\Searcher->buildFullTextSearch(string, boolean)

#6 /var/lib/mediawiki/extensions/CirrusSearch/includes/CirrusSearch.php(384): CirrusSearch\Searcher->searchText(string, boolean)

#7 /var/lib/mediawiki/extensions/CirrusSearch/includes/CirrusSearch.php(175): CirrusSearch->searchTextReal(string, CirrusSearch\SearchConfig)

#8 /var/lib/mediawiki/includes/specials/SpecialSearch.php(319): CirrusSearch->searchText(string)

#9 /var/lib/mediawiki/includes/specials/SpecialSearch.php(185): SpecialSearch->showResults(string)

#10 /var/lib/mediawiki/includes/specialpage/SpecialPage.php(522): SpecialSearch->execute(NULL)

#11 /var/lib/mediawiki/includes/specialpage/SpecialPageFactory.php(578): SpecialPage->run(NULL)

#12 /var/lib/mediawiki/includes/MediaWiki.php(287): SpecialPageFactory::executePath(Title, RequestContext)

#13 /var/lib/mediawiki/includes/MediaWiki.php(851): MediaWiki->performRequest()

#14 /var/lib/mediawiki/includes/MediaWiki.php(523): MediaWiki->main()

#15 /var/lib/mediawiki/index.php(43): MediaWiki->run()

#16 {main}

It seems there is an issue with mediawiki SPARQL client; I have read something about this in recent discussion, but this doesn't help me. Any ideas about how solve this issue?

Thanks,

Lorenzo Loman87 (talk) 11:30, 14 March 2018 (UTC)

Are you sure that you are using CirrusSearch on REL_1_30, SparqlClient was added in 1.31 in both core and CirrusSearch? DCausse (WMF) (talk) 09:51, 15 March 2018 (UTC)

Hi,

thanks for your answer. I downloaded CirrusSearch using the extension distributor, so I guess it is the right version. Anyway I will do some other attempts and see what happens... Loman87 (talk) 13:03, 21 March 2018 (UTC)

number_format_exception: For input string: "0,7" (solved)

RESOLVED
Bug in cirrus: T189877, will be backported to 1.30 soon, see the thread for workaround.

The following discussion is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.

Hi! I have a trouble after upgrade my MW to 1.30.

Search backend error during prefix search for 'search query here' after 3: number_format_exception: For input string: "0,7"

I deleted indicies in elasticsearch, then created them again, in order with instruction, but still have this error :(

Need help :(

Product	Version
MediaWiki	1.30.0
PHP	7.1.14 (fpm-fcgi)
MariaDB	10.1.31-MariaDB-1~xenial
Elasticsearch	5.3.3

StRiANON (talk) 21:49, 23 March 2018 (UTC)

Or we can receive

Search backend error during full_text search for 'gdfg' after 2: number_format_exception: For input string: "0,5"

Why it happens? StRiANON (talk) 11:42, 24 March 2018 (UTC)

Have you made any configuration change to the CirrusSearch configuration, I wonder if there are some weights passed to elastic that uses a comma instead of a period for decimal separator.

One way to help us determine if the problem is related to number format would be to paste your config (can be dumped using api.php?action=cirrus-config-dump).

If the error happens for fulltext search could you also paste the output of the search result page adding &cirrusDumpQuery to the search URL.

Thanks! DCausse (WMF) (talk) 13:54, 24 March 2018 (UTC)

You can see it here

I didn't change CirrusSearch params, and for elasticsearch added only few general rules,

script.inline: true

script.stored: true

action.auto_create_index: false

And fulltext error's result - https://pastebin.com/sT3Vydx7 StRiANON (talk) 14:14, 24 March 2018 (UTC)

While your config seems sane I see a weight with a comma in the fulltext query:

"weight": "0,2" This might cause issue on elastic side. I suspect a bug in cirrus or some underlying library that transform this weight to a string by using system locale.

Out of curiosity: is your system using a LOCALE set to something that uses comma for decimal separator?

A quick workaround would be to set:

$wgCirrusSearchDefaultNamespaceWeight = 1;
$wgCirrusSearchTalkNamespaceWeight = 1;

So that we stick non decimal numbers.

I may have found the culprit in Cirrus code, I'll followup there with a fix.

Thanks for your report. DCausse (WMF) (talk) 14:35, 24 March 2018 (UTC)

Finally detected this trouble. Thx for idea about locale. Problem was in $wgShellLocale, which work was changed in 1.30 and now it affected lc_all instead of lc_ctype previously and so now decimal separator affects in scripts. Just removed this param and now all is ok. StRiANON (talk) 17:40, 24 March 2018 (UTC)

No, my locale is en_US.UTF-8, checked by printf - uses dot.

Unfortunately, solution didn't helps :( I added this two params, then deleted and created indicies again - still same error.

Then I added more rules for search weights

$wgCirrusSearchDefaultNamespaceWeight = 1;
$wgCirrusSearchTalkNamespaceWeight = 1;
$wgCirrusSearchWeights = [
        'title' => 20,
        'redirect' => 15,
        'category' => 8,
        'heading' => 5,
        'opening_text' => 3,
        'text' => 1,
        'auxiliary_text' => 1,
        'file_text' => 1,
];
$wgCirrusSearchPrefixWeights = [
        'title' => 10,
        'redirect' => 1,
        'title_asciifolding' => 7,
        'redirect_asciifolding' => 1,
];

And again deleted and created indicies. And still have this trouble - look here o.O StRiANON (talk) 16:41, 24 March 2018 (UTC)

The discussion above is closed. Please do not modify it. No further edits should be made to this discussion.

Support for Elasticsearch 6.x.x?

Hi there,

I am running a modern instance of Elasticsearch, specifically, 6.2.3. I noticed that only 5.x.x versions of Elasticsearch are supported with this extension. Are there any plans to bring the extension up-to-date with the new generation of Elastic?

Cheers! TorontonianOnlines (talk) 19:41, 26 March 2018 (UTC)

The docu says "MediaWiki 1.31.x requires ElasticSearch 5.5+." I read this in a way that 6.x.x will be supported via CirrusSearch for MW 1.31 which is due end of May. Keeping fingers crossed. [[kgh]] (talk) 20:14, 26 March 2018 (UTC)

That would be wonderful news! As is, I have been informed I am not allowed to use Cirrus at my org. TorontonianOnlines (talk) 20:23, 26 March 2018 (UTC)

Elastic does not guarantee compatibility between major versions. In fact it's nearly impossible for us to support multiple major versions of elastic (there are too many breaking changes).

So I'm sorry to say that no MW 1.31 won't support elastic 6.x :(

Back to the original question: yes we have plans to upgrade to elastic 6.x the timeline is not yet very precise. DCausse (WMF) (talk) 08:56, 27 March 2018 (UTC)

Thanks for clarifying. Apparently I was in high hopes for MW 1.31+ because of 6.x. :| I just fixed the docu. [[kgh]] (talk) 09:04, 27 March 2018 (UTC)

updateSearchIndexConfig.php ( Elastic Search version 5.3 ) and MW 1.30

RESOLVED
Bug in cirrus see T191493

The following discussion is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.

We have been trying to execute the updateSearchIndexConfig.php and this fails with the following

php updateSearchIndexConfig.php

content index...

Fetching Elasticsearch version...5.3.2...ok

Scanning available plugins...PHP Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

PHP Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

PHP Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

none

Inferring index identifier...bitnami_mediawiki_content_first

Picking analyzer...english

Validating number of shards...ok

Validating replica range...ok

Validating shard allocation settings...done

Validating max shards per node...ok

Validating analyzers...ok

Validating mappings...

Validating mapping...ok

Validating aliases...

Validating bitnami_mediawiki_content alias...ok

Validating bitnami_mediawiki alias...ok

Updating tracking indexes...done

general index...

Fetching Elasticsearch version...5.3.2...ok

Scanning available plugins...PHP Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

PHP Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

PHP Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

Warning: Invalid argument supplied for foreach() in /opt/bitnami/apps/mediawiki/htdocs/extensions/CirrusSearch/includes/Maintenance/ConfigUtils.php on line 130

none

Inferring index identifier...bitnami_mediawiki_general_first

Picking analyzer...english

Validating number of shards...ok

Validating replica range...ok

Validating shard allocation settings...done

Validating max shards per node...ok

Validating analyzers...ok

Validating mappings...

Validating mapping...ok

Validating aliases...

Validating bitnami_mediawiki_general alias...ok

Validating bitnami_mediawiki alias...ok

Updating tracking indexes...done

Deleting namespaces...done

Indexing namespaces...done 164.144.248.26 (talk) 15:45, 3 April 2018 (UTC)

If this is possible for you could paste the output of the command:

curl -s localhost:9200/_nodes?pretty

replace localhost with the hostname of one node of your elasticsearch cluster.

Thanks. DCausse (WMF) (talk) 08:37, 4 April 2018 (UTC)

$ curl -s vpc-np-es-psd.us-east-1.ps.amazonaws.com:80/_nodes?pretty

{

"_nodes" : {

"total" : 3,

"successful" : 3,

"failed" : 0

},

"cluster_name" : "265365382492:haystack-np-es",

"nodes" : {

"mpy2CEk2TLWnBUGbSExtJA" : {

"name" : "mpy2CEk",

"version" : "5.3.2",

"build_hash" : "Unknown",

"total_indexing_buffer" : 427753472,

"roles" : [ "master", "data", "ingest" ],

"os" : {

"refresh_interval_in_millis" : 1000,

"available_processors" : 2,

"allocated_processors" : 2

},

"process" : {

"refresh_interval_in_millis" : 1000,

"id" : 9734,

"mlockall" : true

},

"jvm" : {

"pid" : 9734,

"start_time_in_millis" : 1522089530084,

"mem" : {

"heap_init_in_bytes" : 4294967296,

"heap_max_in_bytes" : 4277534720,

"non_heap_init_in_bytes" : 2555904,

"non_heap_max_in_bytes" : 0,

"direct_max_in_bytes" : 4277534720

},

"using_compressed_ordinary_object_pointers" : "true"

},

"thread_pool" : {

"force_merge" : {

"type" : "fixed",

"min" : 1,

"max" : 1,

"queue_size" : -1

},

"fetch_shard_started" : {

"type" : "scaling",

"min" : 1,

"max" : 4,

"keep_alive" : "5m",

"queue_size" : -1

},

"listener" : {

"type" : "fixed",

"min" : 1,

"max" : 1,

"queue_size" : -1

},

"index" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 200

},

"refresh" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"generic" : {

"type" : "scaling",

"min" : 4,

"max" : 128,

"keep_alive" : "30s",

"queue_size" : -1

},

"warmer" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"search" : {

"type" : "fixed",

"min" : 4,

"max" : 4,

"queue_size" : 1000

},

"flush" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"fetch_shard_store" : {

"type" : "scaling",

"min" : 1,

"max" : 4,

"keep_alive" : "5m",

"queue_size" : -1

},

"management" : {

"type" : "scaling",

"min" : 1,

"max" : 5,

"keep_alive" : "5m",

"queue_size" : -1

},

"get" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 1000

},

"bulk" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 200

},

"snapshot" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

}

},

"modules" : [ {

"name" : "aggs-matrix-stats",

"version" : "5.3.2",

"description" : "Adds aggregations whose input are a list of numeric fields and output includes a matrix.",

"classname" : "org.elasticsearch.search.aggregations.matrix.MatrixAggregationPlugin"

}, {

"name" : "ingest-common",

"version" : "5.3.2",

"description" : "Module for ingest processors that do not require additional security permissions or have large dependencies and resources",

"classname" : "org.elasticsearch.ingest.common.IngestCommonPlugin"

}, {

"name" : "lang-expression",

"version" : "5.3.2",

"description" : "Lucene expressions integration for Elasticsearch",

"classname" : "org.elasticsearch.script.expression.ExpressionPlugin"

}, {

"name" : "lang-mustache",

"version" : "5.3.2",

"description" : "Mustache scripting integration for Elasticsearch",

"classname" : "org.elasticsearch.script.mustache.MustachePlugin"

}, {

"name" : "lang-painless",

"version" : "5.3.2",

"description" : "An easy, safe and fast scripting language for Elasticsearch",

"classname" : "org.elasticsearch.painless.PainlessPlugin"

}, {

"name" : "percolator",

"version" : "5.3.2",

"description" : "Percolator module adds capability to index queries and query these queries by specifying documents",

"classname" : "org.elasticsearch.percolator.PercolatorPlugin"

}, {

"name" : "reindex",

"version" : "5.3.2",

"description" : "The Reindex module adds APIs to reindex from one index to another or update documents in place.",

"classname" : "org.elasticsearch.index.reindex.ReindexPlugin"

}, {

"name" : "transport-netty3",

"version" : "5.3.2",

"description" : "Netty 3 based transport implementation",

"classname" : "org.elasticsearch.transport.Netty3Plugin"

}, {

"name" : "transport-netty4",

"version" : "5.3.2",

"description" : "Netty 4 based transport implementation",

"classname" : "org.elasticsearch.transport.Netty4Plugin"

} ],

"ingest" : {

"processors" : [ {

"type" : "append"

}, {

"type" : "attachment"

}, {

"type" : "convert"

}, {

"type" : "date"

}, {

"type" : "date_index_name"

}, {

"type" : "dot_expander"

}, {

"type" : "fail"

}, {

"type" : "foreach"

}, {

"type" : "grok"

}, {

"type" : "gsub"

}, {

"type" : "join"

}, {

"type" : "json"

}, {

"type" : "kv"

}, {

"type" : "lowercase"

}, {

"type" : "remove"

}, {

"type" : "rename"

}, {

"type" : "script"

}, {

"type" : "set"

}, {

"type" : "sort"

}, {

"type" : "split"

}, {

"type" : "trim"

}, {

"type" : "uppercase"

}, {

"type" : "user_agent"

} ]

}

},

"LLLQg4hgTtu2FEmnV_inTA" : {

"name" : "LLLQg4h",

"version" : "5.3.2",

"build_hash" : "Unknown",

"total_indexing_buffer" : 427753472,

"roles" : [ "master", "data", "ingest" ],

"os" : {

"refresh_interval_in_millis" : 1000,

"available_processors" : 2,

"allocated_processors" : 2

},

"process" : {

"refresh_interval_in_millis" : 1000,

"id" : 9842,

"mlockall" : true

},

"jvm" : {

"pid" : 9842,

"start_time_in_millis" : 1522089504901,

"mem" : {

"heap_init_in_bytes" : 4294967296,

"heap_max_in_bytes" : 4277534720,

"non_heap_init_in_bytes" : 2555904,

"non_heap_max_in_bytes" : 0,

"direct_max_in_bytes" : 4277534720

},

"using_compressed_ordinary_object_pointers" : "true"

},

"thread_pool" : {

"force_merge" : {

"type" : "fixed",

"min" : 1,

"max" : 1,

"queue_size" : -1

},

"fetch_shard_started" : {

"type" : "scaling",

"min" : 1,

"max" : 4,

"keep_alive" : "5m",

"queue_size" : -1

},

"listener" : {

"type" : "fixed",

"min" : 1,

"max" : 1,

"queue_size" : -1

},

"index" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 200

},

"refresh" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"generic" : {

"type" : "scaling",

"min" : 4,

"max" : 128,

"keep_alive" : "30s",

"queue_size" : -1

},

"warmer" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"search" : {

"type" : "fixed",

"min" : 4,

"max" : 4,

"queue_size" : 1000

},

"flush" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"fetch_shard_store" : {

"type" : "scaling",

"min" : 1,

"max" : 4,

"keep_alive" : "5m",

"queue_size" : -1

},

"management" : {

"type" : "scaling",

"min" : 1,

"max" : 5,

"keep_alive" : "5m",

"queue_size" : -1

},

"get" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 1000

},

"bulk" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 200

},

"snapshot" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

}

},

"modules" : [ {

"name" : "aggs-matrix-stats",

"version" : "5.3.2",

"description" : "Adds aggregations whose input are a list of numeric fields and output includes a matrix.",

"classname" : "org.elasticsearch.search.aggregations.matrix.MatrixAggregationPlugin"

}, {

"name" : "ingest-common",

"version" : "5.3.2",

"description" : "Module for ingest processors that do not require additional security permissions or have large dependencies and resources",

"classname" : "org.elasticsearch.ingest.common.IngestCommonPlugin"

}, {

"name" : "lang-expression",

"version" : "5.3.2",

"description" : "Lucene expressions integration for Elasticsearch",

"classname" : "org.elasticsearch.script.expression.ExpressionPlugin"

}, {

"name" : "lang-mustache",

"version" : "5.3.2",

"description" : "Mustache scripting integration for Elasticsearch",

"classname" : "org.elasticsearch.script.mustache.MustachePlugin"

}, {

"name" : "lang-painless",

"version" : "5.3.2",

"description" : "An easy, safe and fast scripting language for Elasticsearch",

"classname" : "org.elasticsearch.painless.PainlessPlugin"

}, {

"name" : "percolator",

"version" : "5.3.2",

"description" : "Percolator module adds capability to index queries and query these queries by specifying documents",

"classname" : "org.elasticsearch.percolator.PercolatorPlugin"

}, {

"name" : "reindex",

"version" : "5.3.2",

"description" : "The Reindex module adds APIs to reindex from one index to another or update documents in place.",

"classname" : "org.elasticsearch.index.reindex.ReindexPlugin"

}, {

"name" : "transport-netty3",

"version" : "5.3.2",

"description" : "Netty 3 based transport implementation",

"classname" : "org.elasticsearch.transport.Netty3Plugin"

}, {

"name" : "transport-netty4",

"version" : "5.3.2",

"description" : "Netty 4 based transport implementation",

"classname" : "org.elasticsearch.transport.Netty4Plugin"

} ],

"ingest" : {

"processors" : [ {

"type" : "append"

}, {

"type" : "attachment"

}, {

"type" : "convert"

}, {

"type" : "date"

}, {

"type" : "date_index_name"

}, {

"type" : "dot_expander"

}, {

"type" : "fail"

}, {

"type" : "foreach"

}, {

"type" : "grok"

}, {

"type" : "gsub"

}, {

"type" : "join"

}, {

"type" : "json"

}, {

"type" : "kv"

}, {

"type" : "lowercase"

}, {

"type" : "remove"

}, {

"type" : "rename"

}, {

"type" : "script"

}, {

"type" : "set"

}, {

"type" : "sort"

}, {

"type" : "split"

}, {

"type" : "trim"

}, {

"type" : "uppercase"

}, {

"type" : "user_agent"

} ]

}

},

"Zjmu23EvSkmxx2Bp2D6Tpw" : {

"name" : "Zjmu23E",

"version" : "5.3.2",

"build_hash" : "Unknown",

"total_indexing_buffer" : 427753472,

"roles" : [ "master", "data", "ingest" ],

"os" : {

"refresh_interval_in_millis" : 1000,

"available_processors" : 2,

"allocated_processors" : 2

},

"process" : {

"refresh_interval_in_millis" : 1000,

"id" : 9785,

"mlockall" : true

},

"jvm" : {

"pid" : 9785,

"start_time_in_millis" : 1522089516908,

"mem" : {

"heap_init_in_bytes" : 4294967296,

"heap_max_in_bytes" : 4277534720,

"non_heap_init_in_bytes" : 2555904,

"non_heap_max_in_bytes" : 0,

"direct_max_in_bytes" : 4277534720

},

"using_compressed_ordinary_object_pointers" : "true"

},

"thread_pool" : {

"force_merge" : {

"type" : "fixed",

"min" : 1,

"max" : 1,

"queue_size" : -1

},

"fetch_shard_started" : {

"type" : "scaling",

"min" : 1,

"max" : 4,

"keep_alive" : "5m",

"queue_size" : -1

},

"listener" : {

"type" : "fixed",

"min" : 1,

"max" : 1,

"queue_size" : -1

},

"index" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 200

},

"refresh" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"generic" : {

"type" : "scaling",

"min" : 4,

"max" : 128,

"keep_alive" : "30s",

"queue_size" : -1

},

"warmer" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"search" : {

"type" : "fixed",

"min" : 4,

"max" : 4,

"queue_size" : 1000

},

"flush" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

},

"fetch_shard_store" : {

"type" : "scaling",

"min" : 1,

"max" : 4,

"keep_alive" : "5m",

"queue_size" : -1

},

"management" : {

"type" : "scaling",

"min" : 1,

"max" : 5,

"keep_alive" : "5m",

"queue_size" : -1

},

"get" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 1000

},

"bulk" : {

"type" : "fixed",

"min" : 2,

"max" : 2,

"queue_size" : 200

},

"snapshot" : {

"type" : "scaling",

"min" : 1,

"max" : 1,

"keep_alive" : "5m",

"queue_size" : -1

}

},

"modules" : [ {

"name" : "aggs-matrix-stats",

"version" : "5.3.2",

"description" : "Adds aggregations whose input are a list of numeric fields and output includes a matrix.",

"classname" : "org.elasticsearch.search.aggregations.matrix.MatrixAggregationPlugin"

}, {

"name" : "ingest-common",

"version" : "5.3.2",

"description" : "Module for ingest processors that do not require additional security permissions or have large dependencies and resources",

"classname" : "org.elasticsearch.ingest.common.IngestCommonPlugin"

}, {

"name" : "lang-expression",

"version" : "5.3.2",

"description" : "Lucene expressions integration for Elasticsearch",

"classname" : "org.elasticsearch.script.expression.ExpressionPlugin"

}, {

"name" : "lang-mustache",

"version" : "5.3.2",

"description" : "Mustache scripting integration for Elasticsearch",

"classname" : "org.elasticsearch.script.mustache.MustachePlugin"

}, {

"name" : "lang-painless",

"version" : "5.3.2",

"description" : "An easy, safe and fast scripting language for Elasticsearch",

"classname" : "org.elasticsearch.painless.PainlessPlugin"

}, {

"name" : "percolator",

"version" : "5.3.2",

"description" : "Percolator module adds capability to index queries and query these queries by specifying documents",

"classname" : "org.elasticsearch.percolator.PercolatorPlugin"

}, {

"name" : "reindex",

"version" : "5.3.2",

"description" : "The Reindex module adds APIs to reindex from one index to another or update documents in place.",

"classname" : "org.elasticsearch.index.reindex.ReindexPlugin"

}, {

"name" : "transport-netty3",

"version" : "5.3.2",

"description" : "Netty 3 based transport implementation",

"classname" : "org.elasticsearch.transport.Netty3Plugin"

}, {

"name" : "transport-netty4",

"version" : "5.3.2",

"description" : "Netty 4 based transport implementation",

"classname" : "org.elasticsearch.transport.Netty4Plugin"

} ],

"ingest" : {

"processors" : [ {

"type" : "append"

}, {

"type" : "attachment"

}, {

"type" : "convert"

}, {

"type" : "date"

}, {

"type" : "date_index_name"

}, {

"type" : "dot_expander"

}, {

"type" : "fail"

}, {

"type" : "foreach"

}, {

"type" : "grok"

}, {

"type" : "gsub"

}, {

"type" : "join"

}, {

"type" : "json"

}, {

"type" : "kv"

}, {

"type" : "lowercase"

}, {

"type" : "remove"

}, {

"type" : "rename"

}, {

"type" : "script"

}, {

"type" : "set"

}, {

"type" : "sort"

}, {

"type" : "split"

}, {

"type" : "trim"

}, {

"type" : "uppercase"

}, {

"type" : "user_agent"

} ]

}

} 164.144.252.28 (talk) 17:35, 4 April 2018 (UTC)

Thanks,

the response does not include the plugins section and this confuses CirrusSearch. I'll create a task to fix this.

Unless you discovered other problems this should not affect the behavior of Cirrus. DCausse (WMF) (talk) 08:02, 5 April 2018 (UTC)

The discussion above is closed. Please do not modify it. No further edits should be made to this discussion.

updateSearchIndexConfig.php ( Elastic Search version 5.3 ) and MW 1.30

/CirrusSearch/maintenance$ php updateSearchIndexConfig.php --reindexAndRemoveOk --indexIdentifier now

content index...

Fetching Elasticsearch version...5.3.2...ok

Setting index identifier...bitnami_mediawiki_content_1522961068

Picking analyzer...english

Creating index...⧼Custom Analyzer [plain] failed to find filter under name [preserve_original_recorder]⧽ 164.144.248.27 (talk) 20:48, 5 April 2018 (UTC)

This error is more problematic, out of curiosity did you just install the analysis-icu plugin?

I'll file a task since it seems that cirrus wrongly assumes that if the analysis-icu is installed it can use some features provided by another plugin (wmf search-extra).

We will try to backport the fix to 1.30 but in the meantime a possible workaround would be to install the search-extra plugin by running:

./bin/elasticsearch-plugin install org.wikimedia.search:extra:5.3.2

On your elasticsearch nodes. DCausse (WMF) (talk) 22:17, 5 April 2018 (UTC)

We are using Amazon elastic search domain for this work and we have not installed elasticsearch-plugin and do not have control over the ESDomain. Nagaindukuri (talk) 14:20, 6 April 2018 (UTC)

I see that ICU is supported by amazon and having ICU could explain the issue.

Could you try to force disable ICU by setting $wgCirrusSearchUseIcuFolding = 'no'; in your wiki configuration and see if it fixes the issue? DCausse (WMF) (talk) 16:54, 6 April 2018 (UTC)

maintenance$ php updateSearchIndexConfig.php --reindexAndRemoveOk --indexIdentifier=now

content index...

Fetching Elasticsearch version...5.3.2...ok

Scanning available plugins...array(3) {

Setting index identifier...bitnami_mediawiki_content_1523286431

Picking analyzer...english

Creating index...ok

Validating number of shards...ok

Validating replica range...ok

Validating shard allocation settings...done

Validating max shards per node...ok

Validating analyzers...ok

Validating mappings...

Validating mapping...different...corrected

Validating aliases...

Validating bitnami_mediawiki_content alias...is taken...

Reindexing...

Unknown reindex failure: 403 164.144.248.29 (talk) 15:09, 9 April 2018 (UTC)

You seem to not be allowed to use the /_reindex endpoint, could you double check your cluster settings or with amazon support?

The AWS environment seems to be very restrictive, one workaround for you would be not to use the in-place reindex (--reindexAndRemoveOk) and always reindex your wiki using: updateSearchIndexConfig.php --startOver then use the forceSearchIndex.php script you used during the initial setup. DCausse (WMF) (talk) 17:15, 9 April 2018 (UTC)

content index...

Fetching Elasticsearch version...5.3.2...ok

Scanning available plugins...

Inferring index identifier...error

Looks like the index has more than one identifier. You should delete all

but the one of them currently active. Here is the list: bitnami_mediawiki_content_1523286244,bitnami_mediawiki_content_1523286240,bitnami_mediawiki_content_first,bitnami_mediawiki_content_1523285364,bitnami_mediawiki_content_1523285326,bitnami_mediawiki_content_1523286431,bitnami_mediawiki_content_1523285727,bitnami_mediawiki_content_1523286151 164.144.248.29 (talk) 18:47, 9 April 2018 (UTC)

After the above .. we did run php forceSearchIndex.php --skipLinks --indexOnSkip and it is still under progress. 164.144.248.29 (talk) 18:58, 9 April 2018 (UTC)

we are seeing that indexes are not being created post updateSearchIndexConfig.php --startOver Nagaindukuri (talk) 20:30, 10 April 2018 (UTC)

Inferring index identifier...bitnami_mediawiki_content_first

Picking analyzer...english

Blowing away index to start over...ok

Validating number of shards...ok

Validating replica range...ok

Validating shard allocation settings...done

Validating max shards per node...ok

Validating analyzers...ok

Validating mappings...

Validating mapping...different...corrected

Validating aliases...

Validating bitnami_mediawiki_content alias...alias is free...corrected

Validating bitnami_mediawiki alias...alias not already assigned to this index...corrected

Updating tracking indexes...done

general index...

Fetching Elasticsearch version...5.3.2...ok

Scanning available plugins...array(3) {

Inferring index identifier...bitnami_mediawiki_general_first

Picking analyzer...english

Blowing away index to start over...ok

Validating number of shards...ok

Validating replica range...ok

Validating shard allocation settings...done

Validating max shards per node...ok

Validating analyzers...ok

Validating mappings...

Validating mapping...different...corrected

Validating aliases...

Validating bitnami_mediawiki_general alias...alias is free...corrected

Validating bitnami_mediawiki alias...alias not already assigned to this index...corrected

Updating tracking indexes...done

Deleting namespaces...done

Indexing namespaces...done

We are running the forceSearchIndex.php currently and We will update you .... sorry about this. Nagaindukuri (talk) 20:45, 10 April 2018 (UTC)

Search fails with the return boolean ( Amazon Elastic Search version 5.3 ) MW 1.30

Is there a way to test the search for elastic search and search type is Cirrus Search.

[ff7caa56e7c311254efb8e81] /index.php?search=SAP Error from line 474 of /opt/bitnami/apps/mediawiki/htdocs/includes/specials/SpecialSearch.php: Call to a member function searchContainedSyntax() on boolean

Backtrace:

#0 /opt/bitnami/apps/mediawiki/htdocs/includes/specials/SpecialSearch.php(384): SpecialSearch->showCreateLink(Title, integer, NULL, boolean)

#1 /opt/bitnami/apps/mediawiki/htdocs/includes/specials/SpecialSearch.php(185): SpecialSearch->showResults(string)

#2 /opt/bitnami/apps/mediawiki/htdocs/includes/specialpage/SpecialPage.php(522): SpecialSearch->execute(NULL)

#3 /opt/bitnami/apps/mediawiki/htdocs/includes/specialpage/SpecialPageFactory.php(578): SpecialPage->run(NULL)

#4 /opt/bitnami/apps/mediawiki/htdocs/includes/MediaWiki.php(287): SpecialPageFactory::executePath(Title, RequestContext)

#5 /opt/bitnami/apps/mediawiki/htdocs/includes/MediaWiki.php(851): MediaWiki->performRequest()

#6 /opt/bitnami/apps/mediawiki/htdocs/includes/MediaWiki.php(523): MediaWiki->main()

#7 /opt/bitnami/apps/mediawiki/htdocs/index.php(43): MediaWiki->run()

#8 {main} Nagaindukuri (talk) 14:27, 10 April 2018 (UTC)

https://www.myproject.com/Main_Page?search=biw&title=Special:Search&profile=default&fulltext=1&cirrusDumpResult

flase Nagaindukuri (talk) 16:41, 19 April 2018 (UTC)

CirrusSearch - Special search fails even after creating index

AWS ES - 5.3.2

Bitnami Media wiki 1.30 and Cirrus search extension.

Internal error

[04e462859a269b1b57b048c5] /index.php?search=%22Hary%22 Error from line 474 of /opt/bitnami/apps/mediawiki/htdocs/includes/specials/SpecialSearch.php: Call to a member function searchContainedSyntax() on boolean