-
-
Notifications
You must be signed in to change notification settings - Fork 271
/
Copy pathJoin-Object.ps1
496 lines (406 loc) · 17.8 KB
/
Join-Object.ps1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
function Join-Object
{
<#
.SYNOPSIS
Join data from two sets of objects based on a common value
.DESCRIPTION
Join data from two sets of objects based on a common value
For more details, see the accompanying blog post:
http://ramblingcookiemonster.github.io/Join-Object/
For even more details, see the original code and discussions that this borrows from:
Dave Wyatt's Join-Object - http://powershell.org/wp/forums/topic/merging-very-large-collections
Lucio Silveira's Join-Object - http://blogs.msdn.com/b/powershell/archive/2012/07/13/join-object.aspx
.PARAMETER Left
'Left' collection of objects to join. You can use the pipeline for Left.
The objects in this collection should be consistent.
We look at the properties on the first object for a baseline.
.PARAMETER Right
'Right' collection of objects to join.
The objects in this collection should be consistent.
We look at the properties on the first object for a baseline.
.PARAMETER LeftJoinProperty
Property on Left collection objects that we match up with RightJoinProperty on the Right collection
.PARAMETER RightJoinProperty
Property on Right collection objects that we match up with LeftJoinProperty on the Left collection
.PARAMETER LeftProperties
One or more properties to keep from Left. Default is to keep all Left properties (*).
Each property can:
- Be a plain property name like "Name"
- Contain wildcards like "*"
- Be a hashtable like @{Name="Product Name";Expression={$_.Name}}.
Name is the output property name
Expression is the property value ($_ as the current object)
Alternatively, use the Suffix or Prefix parameter to avoid collisions
Each property using this hashtable syntax will be excluded from suffixes and prefixes
.PARAMETER RightProperties
One or more properties to keep from Right. Default is to keep all Right properties (*).
Each property can:
- Be a plain property name like "Name"
- Contain wildcards like "*"
- Be a hashtable like @{Name="Product Name";Expression={$_.Name}}.
Name is the output property name
Expression is the property value ($_ as the current object)
Alternatively, use the Suffix or Prefix parameter to avoid collisions
Each property using this hashtable syntax will be excluded from suffixes and prefixes
.PARAMETER Prefix
If specified, prepend Right object property names with this prefix to avoid collisions
Example:
Property Name = 'Name'
Suffix = 'j_'
Resulting Joined Property Name = 'j_Name'
.PARAMETER Suffix
If specified, append Right object property names with this suffix to avoid collisions
Example:
Property Name = 'Name'
Suffix = '_j'
Resulting Joined Property Name = 'Name_j'
.PARAMETER Type
Type of join. Default is AllInLeft.
AllInLeft will have all elements from Left at least once in the output, and might appear more than once
if the where clause is true for more than one element in right, Left elements with matches in Right are
preceded by elements with no matches.
SQL equivalent: outer left join (or simply left join)
AllInRight is similar to AllInLeft.
OnlyIfInBoth will cause all elements from Left to be placed in the output, only if there is at least one
match in Right.
SQL equivalent: inner join (or simply join)
AllInBoth will have all entries in right and left in the output. Specifically, it will have all entries
in right with at least one match in left, followed by all entries in Right with no matches in left,
followed by all entries in Left with no matches in Right.
SQL equivalent: full join
.EXAMPLE
#
#Define some input data.
$l = 1..5 | Foreach-Object {
[pscustomobject]@{
Name = "jsmith$_"
Birthday = (Get-Date).adddays(-1)
}
}
$r = 4..7 | Foreach-Object{
[pscustomobject]@{
Department = "Department $_"
Name = "Department $_"
Manager = "jsmith$_"
}
}
#We have a name and Birthday for each manager, how do we find their department, using an inner join?
Join-Object -Left $l -Right $r -LeftJoinProperty Name -RightJoinProperty Manager -Type OnlyIfInBoth -RightProperties Department
# Name Birthday Department
# ---- -------- ----------
# jsmith4 4/14/2015 3:27:22 PM Department 4
# jsmith5 4/14/2015 3:27:22 PM Department 5
.EXAMPLE
#
#Define some input data.
$l = 1..5 | Foreach-Object {
[pscustomobject]@{
Name = "jsmith$_"
Birthday = (Get-Date).adddays(-1)
}
}
$r = 4..7 | Foreach-Object{
[pscustomobject]@{
Department = "Department $_"
Name = "Department $_"
Manager = "jsmith$_"
}
}
#We have a name and Birthday for each manager, how do we find all related department data, even if there are conflicting properties?
$l | Join-Object -Right $r -LeftJoinProperty Name -RightJoinProperty Manager -Type AllInLeft -Prefix j_
# Name Birthday j_Department j_Name j_Manager
# ---- -------- ------------ ------ ---------
# jsmith1 4/14/2015 3:27:22 PM
# jsmith2 4/14/2015 3:27:22 PM
# jsmith3 4/14/2015 3:27:22 PM
# jsmith4 4/14/2015 3:27:22 PM Department 4 Department 4 jsmith4
# jsmith5 4/14/2015 3:27:22 PM Department 5 Department 5 jsmith5
.EXAMPLE
#
#Hey! You know how to script right? Can you merge these two CSVs, where Path1's IP is equal to Path2's IP_ADDRESS?
#Get CSV data
$s1 = Import-CSV $Path1
$s2 = Import-CSV $Path2
#Merge the data, using a full outer join to avoid omitting anything, and export it
Join-Object -Left $s1 -Right $s2 -LeftJoinProperty IP_ADDRESS -RightJoinProperty IP -Prefix 'j_' -Type AllInBoth |
Export-CSV $MergePath -NoTypeInformation
.EXAMPLE
#
# "Hey Warren, we need to match up SSNs to Active Directory users, and check if they are enabled or not.
# I'll e-mail you an unencrypted CSV with all the SSNs from gmail, what could go wrong?"
# Import some SSNs.
$SSNs = Import-CSV -Path D:\SSNs.csv
#Get AD users, and match up by a common value, samaccountname in this case:
Get-ADUser -Filter "samaccountname -like 'wframe*'" |
Join-Object -LeftJoinProperty samaccountname -Right $SSNs `
-RightJoinProperty samaccountname -RightProperties ssn `
-LeftProperties samaccountname, enabled, objectclass
.NOTES
This borrows from:
Dave Wyatt's Join-Object - http://powershell.org/wp/forums/topic/merging-very-large-collections/
Lucio Silveira's Join-Object - http://blogs.msdn.com/b/powershell/archive/2012/07/13/join-object.aspx
Changes:
Always display full set of properties
Display properties in order (left first, right second)
If specified, add suffix or prefix to right object property names to avoid collisions
Use a hashtable rather than ordereddictionary (avoid case sensitivity)
.LINK
http://ramblingcookiemonster.github.io/Join-Object/
.FUNCTIONALITY
PowerShell Language
#>
[CmdletBinding()]
Param
(
[Parameter(Mandatory=$true,
ValueFromPipeLine = $true)]
[object[]] $Left,
# List to join with $Left
[Parameter(Mandatory=$true)]
[object[]] $Right,
[Parameter(Mandatory = $true)]
[string] $LeftJoinProperty,
[Parameter(Mandatory = $true)]
[string] $RightJoinProperty,
[object[]]$LeftProperties = '*',
# Properties from $Right we want in the output.
# Like LeftProperties, each can be a plain name, wildcard or hashtable. See the LeftProperties comments.
[object[]]$RightProperties = '*',
[validateset( 'AllInLeft', 'OnlyIfInBoth', 'AllInBoth', 'AllInRight')]
[Parameter(Mandatory=$false)]
[string]$Type = 'AllInLeft',
[string]$Prefix,
[string]$Suffix
)
Begin
{
function AddItemProperties($item, $properties, $hash)
{
if ($null -eq $item)
{
return
}
foreach($property in $properties)
{
$propertyHash = $property -as [hashtable]
if($null -ne $propertyHash)
{
$hashName = $propertyHash["name"] -as [string]
$expression = $propertyHash["expression"] -as [scriptblock]
$expressionValue = $expression.Invoke($item)[0]
$hash[$hashName] = $expressionValue
}
else
{
foreach($itemProperty in $item.psobject.Properties)
{
if ($itemProperty.Name -like $property)
{
$hash[$itemProperty.Name] = $itemProperty.Value
}
}
}
}
}
function TranslateProperties
{
[cmdletbinding()]
param(
[object[]]$Properties,
[psobject]$RealObject,
[string]$Side)
foreach($Prop in $Properties)
{
$propertyHash = $Prop -as [hashtable]
if($null -ne $propertyHash)
{
$hashName = $propertyHash["name"] -as [string]
$expression = $propertyHash["expression"] -as [scriptblock]
$ScriptString = $expression.tostring()
if($ScriptString -notmatch 'param\(')
{
Write-Verbose "Property '$HashName'`: Adding param(`$_) to scriptblock '$ScriptString'"
$Expression = [ScriptBlock]::Create("param(`$_)`n $ScriptString")
}
$Output = @{Name =$HashName; Expression = $Expression }
Write-Verbose "Found $Side property hash with name $($Output.Name), expression:`n$($Output.Expression | out-string)"
$Output
}
else
{
foreach($ThisProp in $RealObject.psobject.Properties)
{
if ($ThisProp.Name -like $Prop)
{
Write-Verbose "Found $Side property '$($ThisProp.Name)'"
$ThisProp.Name
}
}
}
}
}
function WriteJoinObjectOutput($leftItem, $rightItem, $leftProperties, $rightProperties)
{
$properties = @{}
AddItemProperties $leftItem $leftProperties $properties
AddItemProperties $rightItem $rightProperties $properties
New-Object psobject -Property $properties
}
#Translate variations on calculated properties. Doing this once shouldn't affect perf too much.
foreach($Prop in @($LeftProperties + $RightProperties))
{
if($Prop -as [hashtable])
{
foreach($variation in ('n','label','l'))
{
if(-not $Prop.ContainsKey('Name') )
{
if($Prop.ContainsKey($variation) )
{
$Prop.Add('Name',$Prop[$Variation])
}
}
}
if(-not $Prop.ContainsKey('Name') -or $Prop['Name'] -like $null )
{
Throw "Property is missing a name`n. This should be in calculated property format, with a Name and an Expression:`n@{Name='Something';Expression={`$_.Something}}`nAffected property:`n$($Prop | out-string)"
}
if(-not $Prop.ContainsKey('Expression') )
{
if($Prop.ContainsKey('E') )
{
$Prop.Add('Expression',$Prop['E'])
}
}
if(-not $Prop.ContainsKey('Expression') -or $Prop['Expression'] -like $null )
{
Throw "Property is missing an expression`n. This should be in calculated property format, with a Name and an Expression:`n@{Name='Something';Expression={`$_.Something}}`nAffected property:`n$($Prop | out-string)"
}
}
}
$leftHash = @{}
$rightHash = @{}
# Hashtable keys can't be null; we'll use any old object reference as a placeholder if needed.
$nullKey = New-Object psobject
$bound = $PSBoundParameters.keys -contains "InputObject"
if(-not $bound)
{
[System.Collections.ArrayList]$LeftData = @()
}
}
Process
{
#We pull all the data for comparison later, no streaming
if($bound)
{
$LeftData = $Left
}
Else
{
foreach($Object in $Left)
{
[void]$LeftData.add($Object)
}
}
}
End
{
foreach ($item in $Right)
{
$key = $item.$RightJoinProperty
if ($null -eq $key)
{
$key = $nullKey
}
$bucket = $rightHash[$key]
if ($null -eq $bucket)
{
$bucket = New-Object System.Collections.ArrayList
$rightHash.Add($key, $bucket)
}
$null = $bucket.Add($item)
}
foreach ($item in $LeftData)
{
$key = $item.$LeftJoinProperty
if ($null -eq $key)
{
$key = $nullKey
}
$bucket = $leftHash[$key]
if ($null -eq $bucket)
{
$bucket = New-Object System.Collections.ArrayList
$leftHash.Add($key, $bucket)
}
$null = $bucket.Add($item)
}
$LeftProperties = TranslateProperties -Properties $LeftProperties -Side 'Left' -RealObject $LeftData[0]
$RightProperties = TranslateProperties -Properties $RightProperties -Side 'Right' -RealObject $Right[0]
#I prefer ordered output. Left properties first.
[string[]]$AllProps = $LeftProperties
#Handle prefixes, suffixes, and building AllProps with Name only
$RightProperties = foreach($RightProp in $RightProperties)
{
if(-not ($RightProp -as [Hashtable]))
{
Write-Verbose "Transforming property $RightProp to $Prefix$RightProp$Suffix"
@{
Name="$Prefix$RightProp$Suffix"
Expression=[scriptblock]::create("param(`$_) `$_.'$RightProp'")
}
$AllProps += "$Prefix$RightProp$Suffix"
}
else
{
Write-Verbose "Skipping transformation of calculated property with name $($RightProp.Name), expression:`n$($RightProp.Expression | out-string)"
$AllProps += [string]$RightProp["Name"]
$RightProp
}
}
$AllProps = $AllProps | Select -Unique
Write-Verbose "Combined set of properties: $($AllProps -join ', ')"
foreach ( $entry in $leftHash.GetEnumerator() )
{
$key = $entry.Key
$leftBucket = $entry.Value
$rightBucket = $rightHash[$key]
if ($null -eq $rightBucket)
{
if ($Type -eq 'AllInLeft' -or $Type -eq 'AllInBoth')
{
foreach ($leftItem in $leftBucket)
{
WriteJoinObjectOutput $leftItem $null $LeftProperties $RightProperties | Select $AllProps
}
}
}
else
{
foreach ($leftItem in $leftBucket)
{
foreach ($rightItem in $rightBucket)
{
WriteJoinObjectOutput $leftItem $rightItem $LeftProperties $RightProperties | Select $AllProps
}
}
}
}
if ($Type -eq 'AllInRight' -or $Type -eq 'AllInBoth')
{
foreach ($entry in $rightHash.GetEnumerator())
{
$key = $entry.Key
$rightBucket = $entry.Value
$leftBucket = $leftHash[$key]
if ($null -eq $leftBucket)
{
foreach ($rightItem in $rightBucket)
{
WriteJoinObjectOutput $null $rightItem $LeftProperties $RightProperties | Select $AllProps
}
}
}
}
}
}