I set one up via yunohost and it seems like its doing its job. Any tips? Anyone set it up before?

  • zutto@lemmy.fedi.zutto.fi
    link
    fedilink
    English
    arrow-up
    4
    ·
    3 days ago

    Hi!

    I’ve been selfhosting Yacy for some years, even tho I rarely use it (I’m mostly using Kagi these days).

    But some tips:

    • Set up something like this to your browser, this sends Yacy to crawl pages that you visit https://github.com/JeremyRand/YaCyIndexerGreasemonkey .
    • Get familiar with blacklists and try to find some public ones to filter out bad sites and adult content.
    • Tinker with Ranking & heuristics -> Solr boosts to get results that fit your use case more.
    • And in general, tinker with all the settings you can find!

    And not directly Yacy related, but you can use your own Yacy through Searxng as well, even in ‘private’(non P2P) mode.

    • johntash@eviltoast.org
      link
      fedilink
      English
      arrow-up
      1
      ·
      3 days ago

      I haven’t used yacy in a whole, but i had configured it to auto import and index links from linkding. I also imported my browser history to get started.

      Never tried the p2p option though so not sure how well that worked. I was worried about indexing private sites on accident .

    • zutto@lemmy.fedi.zutto.fi
      link
      fedilink
      English
      arrow-up
      1
      ·
      3 days ago

      In general, to everyone who finds Yacy as an interesting project, just give it a try!

      It’s relatively light weight, and having millions of pages indexed does not take that much disk space, in my case: 3.5 million indexed pages is around 200 gigabytes only.

      Yacy is far from perfect, and it’s an ancient project. But it’s still alive and kicking strong!