Name		Name	Last commit message	Last commit date
Latest commit History 1,468 Commits
.circleci		.circleci
app		app
bin		bin
config		config
db		db
jslib		jslib
lib		lib
log		log
nginx		nginx
public		public
spec		spec
vendor/assets		vendor/assets
xsl		xsl
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.lando.yml		.lando.yml
.rspec		.rspec
.ruby-version		.ruby-version
Dockerfile		Dockerfile
Dockerfile-jruby		Dockerfile-jruby
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
Rakefile		Rakefile
build_docker_image.sh		build_docker_image.sh
config.ru		config.ru
delete_by_id.sh		delete_by_id.sh
fetch_and_process_full_export.sh		fetch_and_process_full_export.sh
fetch_and_process_oai.sh		fetch_and_process_oai.sh
fetch_oai.rb		fetch_oai.rb
file_pipeline.rb		file_pipeline.rb
index_and_deletions.sh		index_and_deletions.sh
index_solr.sh		index_solr.sh
index_solr_file.sh		index_solr_file.sh
index_solr_hathi.sh		index_solr_hathi.sh
package-lock.json		package-lock.json
package.json		package.json
preprocess.sh		preprocess.sh
preprocess_hathi.sh		preprocess_hathi.sh
preprocess_hathi_file.sh		preprocess_hathi_file.sh
preprocess_oai.sh		preprocess_oai.sh
preprocess_oai_step1.sh		preprocess_oai_step1.sh
preprocess_oai_step2.sh		preprocess_oai_step2.sh
preprocess_step1.sh		preprocess_step1.sh
preprocess_step2.sh		preprocess_step2.sh
process_files.rb		process_files.rb
split.sh		split.sh

Repository files navigation

Nouveau Franklin

Development Setup

Clone this repo
Install Ruby 2.3.3 (rbenv recommended)
- You may have issues installing Ruby 2.3.3 in recent Linux distributions due to an OpenSSL version incompatibility. See this guide for help.
Run bundle install to install all gem dependencies.
Run npm install to install javascript libraries.
Edit the local_dev_env file and populate the variables with appropriate values. Then source it in your shell.
```
source local_dev_env
```
Run bundle exec rake db:migrate to initialize the database. You'll also have run this again whenever you pull code that includes new migrations (if you forget, Rails will raise an exception when serving requests because there are unloaded migrations.)
Configure Solr
- You can get the production Solr URL and use that, assuming you're on the Penn VPN
- Otherwise, you can run Penn's custom Solr locally using Lando
  - franklin:start to pull and start the Solr container
  - franklin:stop when you're done working
  - franklin:clean when things get weird
Start the rails server:
```
bundle exec rails s
```
Open up localhost:3000 in a browser. If everything went well, you should see the Franklin homepage.

Solr Indexing

This repository also contains Traject code for indexing MARC records into Solr. It isn't separate because we want to consolidate the MARC parsing logic, as some of it is used to generate display values on-the-fly at page render time.

We handle two types of data exports from Alma: full exports and incremental updates via OAI.

The commands in this section can be run directly, or in an application container. See the run_in_container.sh wrapper script in the ansible repository.

Full exports

Transfer the *.tar.gz files created by the Alma publishing job to the directory where they will be preprocessed and indexed. Run these commands:

./preprocess.sh /var/solr_input_data/alma_prod_sandbox/20170412_full allTitles

./index_solr.sh /var/solr_input_data/alma_prod_sandbox/20170412_full/processed

Incremental updates (OAI)

This runs via a cron job, which fetches the updates available via OAI since the last time the job was run.

./fetch_and_process_oai.sh /var/solr_input_data/alma_prod_sandbox/oai

If you do a full index using an older full data export, and you want to apply a set of already fetched and processed OAI updates manually, you can do so like this:

# run this for each dated directory
./index_and_deletions.sh /var/solr_input_data/alma_prod_sandbox/oai/allTitles/2017_04_10_00_00 allTitles

Building Docker Images

There is a build_docker_image.sh script you can use to build docker images from specific branches that have been freshly pulled from origin. It's intended to be run from a repository clone whose sole purpose is to do builds, so that the images aren't polluted with misc files you may have lying around. Run it with the branch name:

./build_docker_image.sh master
# remember to push to the registry afterwards! see the output of the script.

See the deploy-docker repository for Ansible scripts that build Docker images and deploy containers to the test and production environments.

Running Tests

Tests require a locally-installed version of Chrome to support feature specs

The usual ENV variables need to be set, for now

DL Chrome @ https://commondatastorage.googleapis.com/chromium-browser-snapshots/index.html?prefix=Linux_x64/737173/
Extract to PATH_OF_YOUR_CHOOSING
Precompile assets for test (why???): RAILS_ENV=test bundle exec rake assets:precompile
Start dockerized UPenn Solr rake franklin:start
Run suite: RAILS_ENV=test rspec

Auditing Secrets

You can use Gitleaks to check the repository for unencrypted secrets that have been committed.

docker run --rm --name=gitleaks -v $PWD:/code quay.io/upennlibraries/gitleaks:v1.23.0 -v --repo-path=/code --repo-config

Any leaks will be logged to stdout. You can add the --redact flag if you do not want to log the offending secrets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nouveau Franklin

Development Setup

Solr Indexing

Full exports

Incremental updates (OAI)

Building Docker Images

Running Tests

Auditing Secrets

About

Releases

Packages

Contributors 12

Languages

upenn-libraries/discovery-app

Folders and files

Latest commit

History

Repository files navigation

Nouveau Franklin

Development Setup

Solr Indexing

Full exports

Incremental updates (OAI)

Building Docker Images

Running Tests

Auditing Secrets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 12

Languages

Packages