You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently we have cl_update_index, but I think django-elasticsearch might have something it supplies out of the box, or perhaps we'll need to make our own to get the performance we need. In any case, we'll need something like this so we can ingest the old data in the DB.
The text was updated successfully, but these errors were encountered:
According to #2676cl_update_index main functionality is to populate the index for the first time.
Django elasticsearch DSL has some built-in commands to populate existing data to the index.
It has a --parallel option that is based on elasticsearch-pyparallel_bulk that uses multiprocessing.pool.ThreadPool, by default it uses 4 threads and a chunk size of 500 objects to be sent to ES at once.
docker exec -it cl-django python /opt/courtlistener/manage.py search_index --rebuild --models audio.Audio --parallel
(Creates the index and populates it.)
Yeah, let's give it a try and see how far we can take it. It'll be great if it's good enough and we can use it without needing to get Celery involved (!)
Currently we have cl_update_index, but I think django-elasticsearch might have something it supplies out of the box, or perhaps we'll need to make our own to get the performance we need. In any case, we'll need something like this so we can ingest the old data in the DB.
The text was updated successfully, but these errors were encountered: