InnerSource-Crawler
ActionsA GitHub Action to create a list of InnerSource repos within an organization
v1.0.4
LatestBy zkoppert
Tags
(2)This project creates a repos.json that can be utilized by the SAP InnerSource Portal. The current approach assumes that the repos that you want to show in the portal are available in a GitHub organization, and that they all are tagged with a certain topic.
If you need support using this project or have questions about it, please open up an issue in this repository. Requests made directly to GitHub staff or support team will be redirected here to open an issue. GitHub SLA's and support/services contracts do not apply to this repository.
- Create a repository to host this GitHub Action or select an existing repository
- Create the env values from the sample workflow below (GH_TOKEN, ORGANIZATION) with your information as repository secrets. More info on creating secrets can be found here. Note: Your GitHub token will need to have read/write access to all the repositories in the organization
- Copy the below example workflow to your repository and put it in the .github/workflows/directory with the file extension.yml(ie..github/workflows/crawler.yml)
- Don't forget to do something with the resulting repos.jsonfile. You can move it to another repository if needed or save it as a build artifact. This will all depend on what you are doing with it and what repository you are running this action out of.
name: InnerSource repo crawler
on:
  workflow_dispatch:
  schedule:
    - cron: '00 5 * * *'
jobs:
  build:
    name: InnerSource repo crawler
    runs-on: ubuntu-latest
    steps:
    - name: Checkout code
      uses: actions/checkout@v2
    
    - name: Run crawler tool
      uses: docker://ghcr.io/zkoppert/innersource-crawler:v1
      env:
        GH_TOKEN: ${{ secrets.GH_TOKEN }}
        ORGANIZATION: ${{ secrets.ORGANIZATION }}
        # for multiple topics, add them after a comma eg:
        # TOPIC: inner-source,actions,security,python
        TOPIC: inner-source- Copy .env-exampleto.env
- Fill out the .envfile with a token from a user that has access to the organization to scan (listed below). Tokens should have admin:org or read:org access.
- Fill out the .envfile with the exact topic name you are searching for
- Fill out the .envfile with the exact organization that you want to search in
- (Optional) Fill out the .envfile with the exact URL of the GitHub Enterprise that you want to search in. Keep empty if you want to search in the publiccf-workers-proxy-9e9.pages.dev.
- python -m pip install -r requirements.txt
- Run python3 ./crawler.py, which will create arepos.jsonfile containing the relevant metadata for the GitHub repos for the given topic
- Copy repos.jsonto your instance of the SAP-InnerSource-Portal and launch the portal as outlined in their installation instructions
InnerSource-Crawler is not certified by GitHub. It is provided by a third-party and is governed by separate terms of service, privacy policy, and support documentation.